ValueError: setting an array element with a sequence. The requested array has an inhomogeneous shape after 2 dimensions. The detected shape was (5513, 1) + inhomogeneous part

Ata_Tekeli · July 13, 2023, 12:32pm

I’m working on Malaria detection with TensorFlow and I tried to solve the problem by looking at the documentation and other people who have similar problems and similar solutions. I actively use Git and you can look at the python script from the link below with .py extension, this is my first time with the error and learning computer vision all along, so I appreciate all the help and feedback (python script at malaria_detection.py)

You can find the code in from my GitHub link or

import tensorflow as tf
import numpy as np
import matplotlib.pyplot as plt
import tensorflow_datasets as tfds
from keras.models import Model
from keras.layers import Conv2D, MaxPool2D, Dense, Flatten, InputLayer, \
    BatchNormalization, Input, Layer, Dropout, RandomFlip, RandomRotation,\
    Resizing, Rescaling
from keras.optimizers import Adam
from keras.losses import BinaryCrossentropy
from keras.metrics import BinaryAccuracy, FalseNegatives, FalsePositives, \
    TrueNegatives, TruePositives, Precision, Recall, AUC
from sklearn.metrics import confusion_matrix, roc_curve
import seaborn as sns
from keras.callbacks import Callback, CSVLogger, EarlyStopping, LearningRateScheduler, \
    ModelCheckpoint, ReduceLROnPlateau
from keras.regularizers import L2
from keras import Sequential
import cv2

dataset, dataset_info = tfds.load('malaria',
                                  with_info=True,
                                  as_supervised=True,
                                  shuffle_files=True,
                                  split=['train'])

print(dataset)
print(dataset_info)

def splits(dataset, TRAIN_RATIO, VAL_RATIO, TEST_RATIO):
    DATASET_SIZE = len(dataset)

    train_dataset = dataset.take(int(TRAIN_RATIO * DATASET_SIZE))

    val_test_dataset = dataset.skip(int(TRAIN_RATIO * DATASET_SIZE))
    val_dataset = val_test_dataset.take(int(VAL_RATIO * DATASET_SIZE))

    test_dataset = val_test_dataset.skip(int(VAL_RATIO * DATASET_SIZE))
    return train_dataset, val_dataset, test_dataset

TRAIN_RATIO = 0.6
VAL_RATIO = 0.2
TEST_RATIO = 0.2

#dataset = tf.data.Dataset.range(10)
train_dataset, val_dataset, test_dataset = splits(dataset[0],
                                                  TRAIN_RATIO,
                                                  VAL_RATIO,
                                                  TEST_RATIO)
print(list(train_dataset.take(1).as_numpy_iterator()),
      list(val_dataset.take(1).as_numpy_iterator()),
      list(test_dataset.take(1).as_numpy_iterator()))

for data in dataset[0].take(4):
    print(data)

for i, (image, label) in enumerate(train_dataset.take(16)):
    ax = plt.subplot(4, 4, i + 1)
    plt.imshow(image)
    plt.title(dataset_info.features['label'].int2str(label))
    plt.axis('off')
    plt.show()

"""for i, (image, label) in enumerate(train_dataset.take(2)):
    plt.subplot(1, 4, 2*i + 1)
    plt.imshow(image)

    plt.subplot(1, 4, 2 * i + 2)
    plt.imshow(tf.image.adjust_saturation(image, 0.3))

    plt.title(dataset_info.features['label'].int2str(label))
    plt.axis('off')
    plt.show()"""

print(dataset_info.features['label'].int2str(1))

def visualize(original, augmented):
    plt.subplot(1, 2, 1)
    plt.imshow(original)

    plt.subplot(1, 2, 2)
    plt.imshow(augmented)

original_image, label = next(iter(train_dataset))

augmented_image = tf.image.adjust_saturation(image, saturation_factor = 0.3)
visualize(original_image, augmented_image)

IM_SIZE = 224

def resize_rescale(image, label):
    return tf.image.resize(image, (IM_SIZE, IM_SIZE))/255.0, label

#tf.keras.layer resizing and rescaling
resize_rescale_layers = Sequential([
    Resizing(IM_SIZE, IM_SIZE),
    Rescaling(1.0/255),
])

#tf.image augment
def augment(image, label):

    image, label = resize_rescale(image, label)

    image = tf.image.rot90(image)
    #image = tf.image.adjust_saturation(image, saturation_factor = 0.3)
    image = tf.image.flip_left_right(image)

    return image, label

class RotNinety(Layer):
    def __init__(self):
        super().__init__()

    def call(self, image):
        return tf.image.rot90(image)

#tf.keras.layers augment
augment_layers = Sequential([
    RotNinety(),
    RandomFlip(mode='horizontal',),
])

def augment_layer(image, label):
    return augment_layers(resize_rescale_layers(image), training = True), label

#test_dataset = test_dataset.map(resize_rescale_layers)
#print(test_dataset)

#for image, label in train_dataset.take(1):
    #print(image, label)

BATCH_SIZE = 32
train_dataset = (
    train_dataset.shuffle(buffer_size = 8, reshuffle_each_iteration = True)
    #.map(augment_layer)
    .batch(1).
    prefetch(tf.data.AUTOTUNE)
)

val_dataset = (
    val_dataset
    .shuffle(buffer_size = 8,reshuffle_each_iteration = True)
    #.map(resize_rescale_layers)
    .batch(1)
    .prefetch(tf.data.AUTOTUNE)
)

print(train_dataset)
print(val_dataset)

IM_SIZE = 224
dropout_rate = 0.3
regularization_rate = 0.01

lenet_model = Sequential([
    InputLayer(input_shape=(None, None, 3)),

    resize_rescale_layers,
    augment_layers,

    Conv2D(filters = 6,
           kernel_size = 3,
           strides=1,
           padding='valid',
           activation= 'relu',
           ), #kernel_regularizer = L2(regularization_rate)
    BatchNormalization(),
    MaxPool2D(pool_size=2, strides=2),
    #Dropout(rate = dropout_rate),

    Conv2D(filters = 16,
           kernel_size = 3,
           strides=1,
           padding='valid',
           activation= 'relu',
           ), #kernel_regularizer= L2(regularization_rate)
    BatchNormalization(),
    MaxPool2D(pool_size=2, strides=2),

    Flatten(),

    Dense(100, activation= "relu", kernel_regularizer= L2(regularization_rate)),
    BatchNormalization(),
    Dropout(rate = dropout_rate),

    Dense(10, activation= "relu", kernel_regularizer= L2(regularization_rate)),
    BatchNormalization(),
    Dense(1, activation= "sigmoid", kernel_regularizer= L2(regularization_rate)),
])

print(lenet_model.summary())

class LossCallback(Callback):
    def on_epoch_end(self, epoch, logs):
        print("\n For Epoch Number {} the model has a loss of {} ".format(epoch+1, logs["loss"]))

    def on_batch_end(self, batch, logs):
        print("\n For Batch Number {} the model has a loss of {} ".format(batch+1, logs))

csv_callback = CSVLogger(
    'logs.csv', separator=',', append=True
)

es_callback = EarlyStopping(
    monitor='val_loss',
    min_delta=0,
    patience=2,
    verbose=1,
    mode='auto',
    baseline=None,
    restore_best_weights=False
)

def scheduler(epoch, lr):
    if epoch <= 3:
        return lr
    else:
        return lr * tf.math.exp(-0.1)

scheduler_callback = LearningRateScheduler(scheduler, verbose=1)

checkpoint_callback = ModelCheckpoint(
    'checkpoints/',
    monitor = 'val_loss',
    verbose = 0,
    save_best_only = False,
    save_weights_only = True,
    mode = 'auto',
    save_freq='epoch',
    options=None,
    initial_value_threshold=None,
)

plateau_callback = ReduceLROnPlateau(
    monitor='val_accuracy', factor=0.1, patience=5, verbose=1
)

metrics = [TruePositives(name='tp'),FalsePositives(name='fp'),
           TrueNegatives(name='tn'), FalseNegatives(name='fn'),
            BinaryAccuracy(name='accuracy'), Precision(name='precision'),
           Recall(name='recall'), AUC(name='auc')]

lenet_model.compile(optimizer=Adam(learning_rate=0.01),
                    loss=BinaryCrossentropy(),
                    metrics=metrics,
                    run_eagerly=False)

history = lenet_model.fit(train_dataset,
                          validation_data=val_dataset,
                          epochs = 1,
                          verbose = 1,) #callbacks= [plateau_callback]

image = cv2.imread('cell1.png')
print(image.shape)
image = tf.expand_dims(image, axis = 0)
print(image.shape)

print(lenet_model.predict(image))

plt.plot(history.history['loss'])
plt.plot(history.history['val_loss'])
plt.title('Model loss')
plt.ylabel('loss')
plt.xlabel('epoch')
plt.legend(['train_loss', 'val_loss'])
plt.show()

print(lenet_model.predict(train_dataset.take(1)).shape)

test_dataset = test_dataset.batch(1)

print(lenet_model.evaluate(test_dataset))

def parasite_or_not(x):
    if(x<0.5):
        return str('P')
    else:
        return str('U')

labels = []
inp = []

for x, y in test_dataset.as_numpy_iterator():
    labels.append(y)
    inp.append(x)

labels = np.array([i[0] for i in labels])
predicted = lenet_model.predict(np.array(inp)[:,0,...])

threshold = 0.6265

cm = confusion_matrix(labels, predicted > threshold)
print(cm)

plt.figure(figsize=(8, 8))
sns.heatmap(cm, annot=True,)
plt.title('Confusion matrix - {}'.format(threshold))
plt.ylabel('Actual')
plt.xlabel('Predicted')
plt.show()

fp, tp, thresholds = roc_curve(labels, predicted)
print(len(fp), len(tp), len(thresholds))

fp, tp, thresholds = roc_curve(labels, predicted)
plt.plot(fp, tp)
plt.xlabel("False Positive rate")
plt.ylabel("True Positive rate")

plt.grid()

skip = 20

for i in range(0, len(thresholds), skip):
    plt.text(fp[i], tp[i], thresholds[i])

plt.show()

print(parasite_or_not(lenet_model.predict(test_dataset.take(1))[0][0]))

for i, (image, label) in enumerate(test_dataset.take(9)):

    ax = plt.subplot(3, 3, i + 1)
    plt.imshow(image[0])
    plt.title(str(parasite_or_not(label.numpy()[0])) + ":" +
              str(parasite_or_not(lenet_model.predict(image)[0][0])))

    plt.axis('off')
    plt.show()

#Functional API

func_input = Input(shape=(IM_SIZE, IM_SIZE, 3), name="Input Image")

x = Conv2D(filters = 6,
           kernel_size = 3,
           strides=1,
           padding='valid',
           activation= 'relu')(func_input)
x = BatchNormalization()(x)
x = MaxPool2D(pool_size=2, strides=2)(x)

x = Conv2D(filters = 16,
           kernel_size = 3,
           strides=1,
           padding='valid',
           activation= 'relu')(x)
x = BatchNormalization()(x)

output = MaxPool2D(pool_size=2, strides=2)(x)

x = Flatten()(x)

x = Dense(100, activation= "relu")(x)
x = BatchNormalization()(x)

x = Dense(10, activation= "relu")(x)
x = BatchNormalization()(x)

feature_extractor_seq_model = tf.keras.Sequential([
    InputLayer(input_shape=(IM_SIZE, IM_SIZE, 3)),

    Conv2D(filters=6,
           kernel_size=3,
           strides=1,
           padding='valid',
           activation='relu'),
    BatchNormalization(),
    MaxPool2D(pool_size=2, strides=2),

    Conv2D(filters=16,
           kernel_size=3,
           strides=1,
           padding='valid',
           activation='relu'),
    BatchNormalization(),
    MaxPool2D(pool_size=2, strides=2),

])
print(feature_extractor_seq_model.summary())

func_output = Dense(1, activation= "sigmoid")(x)

lenet_model = Model(func_input, func_output, name = "Lenet Model")
print(lenet_model.summary())

#Model Subclassing

class FeatureExtractor(Layer):
    def __init__(self, filters, kernel_size, strides, padding, activation,
                 pool_size):
        super(FeatureExtractor, self).__init__()

        self.conv_1 = Conv2D(filters = filters,
                             kernel_size = kernel_size,
                             strides = strides,
                             padding = padding,
                             activation = activation)
        self.batch_1 = BatchNormalization()
        self.pool_1 = MaxPool2D(pool_size=pool_size, strides=2*strides)

        self.conv_2 = Conv2D(filters = filters*2,
                             kernel_size = kernel_size,
                             strides = strides,
                             padding = padding,
                             activation = activation)
        self.batch_2 = BatchNormalization()
        self.pool_2 = MaxPool2D(pool_size=pool_size, strides=2*strides)

    def call(self, x, training):
        x = self.conv_1(x)
        x = self.batch_1(x)
        x = self.pool_1(x)

        x = self.conv_2(x)
        x = self.batch_2(x)
        x = self.pool_2(x)

        return x

feature_sub_classed = FeatureExtractor(8, 3, 1, "valid", "relu", 2)

func_input = Input(shape=(IM_SIZE, IM_SIZE, 3), name="Input Image")

x = feature_sub_classed(func_input)

x = Flatten()(x)

x = Dense(100, activation="relu")(x)
x = BatchNormalization()(x)

x = Dense(10, activation="relu")(x)
x = BatchNormalization()(x)

func_output = Dense(1, activation="sigmoid")(x)

lenet_model_func = Model(func_input, func_output, name="Lenet_Model")
print(lenet_model_func.summary())

class LenetModel(Model):
    def __init__(self):
        super(LenetModel, self).__init__()

        self.feature_extractor = FeatureExtractor(8, 3, 1, "valid", "relu", 2)

        self.flatten = Flatten()

        self.dense_1 = Dense(100, activation="relu")
        self.batch_1 = BatchNormalization()

        self.dense_2 = Dense(10, activation="relu")
        self.batch_2 = BatchNormalization()

        self.dense_3 = Dense(1, activation="sigmoid")

    def call(self, x, training):
        x = self.feature_extractor(x)
        x = self.flatten(x)
        x = self.dense_1(x)
        x = self.batch_1(x)
        x = self.dense_2(x)
        x = self.batch_2(x)
        x = self.dense_3(x)

        return x

lenet_sub_classed = LenetModel()
lenet_sub_classed(tf.zeros([1, 224, 224, 3]))
print(lenet_sub_classed.summary())

class NeuralearnDense(Layer):
    def __init__(self, output_units, activation):
        super(NeuralearnDense, self).__init__()
        self.output_units = output_units
        self.activation = activation

    def build(self, input_features_shape):
        self.w = self.add_weight(shape=(input_features_shape[-1], self.output_units), initializer="random_normal",
                                 trainable=True)
        self.b = self.add_weight(shape=(self.output_units,), initializer="random_normal", trainable=True)

    def call(self, input_features):

        pre_output = tf.matmul(input_features, self.w) + self.b

        if (self.activation == "relu"):
            return tf.nn.relu(pre_output)

        elif (self.activation == "sigmoid"):
            return tf.math.sigmoid(pre_output)

        else:
            return pre_output

IM_SIZE = 224
lenet_custom_model = tf.keras.Sequential([
    InputLayer(input_shape=(IM_SIZE, IM_SIZE, 3)),

    Conv2D(filters = 6,
           kernel_size = 3,
           strides=1,
           padding='valid',
           activation= 'relu'),
    BatchNormalization(),
    MaxPool2D(pool_size=2, strides=2),

    Conv2D(filters = 16,
           kernel_size = 3,
           strides=1,
           padding='valid',
           activation= 'relu'),
    BatchNormalization(),
    MaxPool2D(pool_size=2, strides=2),

    Flatten(),

    NeuralearnDense(100, activation= "relu"),
    BatchNormalization(),
    NeuralearnDense(10, activation= "relu"),
    BatchNormalization(),
    NeuralearnDense(1, activation= "sigmoid"),
])

print(lenet_custom_model.summary())

lenet_custom_model.compile(optimizer=Adam(learning_rate=0.01),
                          loss=BinaryCrossentropy(),
                          metrics='accuracy')

history = lenet_custom_model.fit(train_dataset,
                                validation_data=val_dataset,
                                epochs=3,
                                verbose=1)

Output:

Traceback (most recent call last):
File “/Users/atatekeli/PycharmProjects/comp-vision-projects/tensorflow-comp-vision/Malaria Detection/malaria_detection.py”, line 278, in
else:
ValueError: setting an array element with a sequence. The requested array has an inhomogeneous shape after 2 dimensions. The detected shape was (5513, 1) + inhomogeneous part.

https://github.com/Killpit/comp-vision-projects/tree/main/tensorflow-comp-vision/Malaria%20Detection

Kiran_Sai_Ramineni · July 22, 2024, 12:39pm

Hi @Ata_Tekeli , As the inp is a list of image data which has values of arrays for different size images. when converting that list to array using np.array(inp) causes this

For example

l=[[1,2,3],[1,2]]
np.array(l) >>>ValueError: setting an array element with a sequence. 
The requested array has an inhomogeneous shape after 1 dimensions. 
The detected shape was (2,) + inhomogeneous part.

Before converting a list of arrays of different sizes to numpy array make sure that all the arrays in the list have the same size. Thank you

Rudraneel_Mandal · October 23, 2024, 10:28am

How do I make sure that all the arrays in the list have the same size? Its impossible to do it manually as my list contains 6k+ elements.

Topic		Replies	Views
ValueError: Input 0 of layer "sequential" is incompatible with the layer: expected shape=(None, 224, 224, 3), found shape=(224, 224, 3) General Discussion datasets , help_request	4	2525	June 26, 2023
Working with multiple errors at once General Discussion help_request	1	759	July 20, 2023
Addressing Shape Mismatch Error in TensorFlow Code for (None, 224, 224, 3) vs. (TensorSpec(shape=(None, None, 224, 224, 3)) Shape: Troubleshooting and Resolution General Discussion datasets , tfkeraslayer , tfmodel	7	736	January 26, 2024
Data augmentation issue General Discussion models , keras	1	730	April 20, 2023
Keras throwing a shape mismatch error between logits and labels Keras models , help_request	1	19983	November 1, 2022

ValueError: setting an array element with a sequence. The requested array has an inhomogeneous shape after 2 dimensions. The detected shape was (5513, 1) + inhomogeneous part

Related topics