Problem with stateful LSTM and static batch size

Arno_Kimeswenger · September 11, 2024, 3:18pm

Hi,
I want to use a simple LSTM model with stateful=True. Therefore I use

import numpy as np
import tensorflow as tf

window_length = 3
batch_size = 1
number_time_series = 1

timeseries_1 = np.arange(150).reshape(-1, 1)
# timeseries_2 = np.arange(150).reshape(-1, 1)

X = tf.keras.utils.timeseries_dataset_from_array(
    timeseries_1,
    # np.concatenate([timeseries_1, timeseries_2], axis=1),
    targets=timeseries_1[window_length:],
    sequence_stride=window_length,
    sequence_length=window_length,
    batch_size=batch_size
)

# list(X)

model = tf.keras.models.Sequential()
model.add(tf.keras.layers.InputLayer(batch_input_shape=(batch_size, window_length, number_time_series)))
model.add(tf.keras.layers.LSTM(units=8, stateful=True))
model.add(tf.keras.layers.Dense(units=1))
model.compile(loss=tf.keras.losses.Huber(), optimizer="adam", metrics=["mae"])


class ResetStatesCallback(tf.keras.callbacks.Callback):
    def on_epoch_begin(self, epoch, logs):
        for layer in self.model.layers:
            if hasattr(layer, "reset_states"):
                layer.reset_states()


model.fit(X, epochs=3, batch_size=batch_size, callbacks=[ResetStatesCallback()], shuffle=False)

But when I run this I get

Input tensor `sequential_1/lstm_1/ReadVariableOp:0` enters the loop with shape (1, 8), but has shape (None, 8) after one iteration. To allow the shape to vary across iterations, use the `shape_invariants` argument of tf.while_loop to specify a less-specific shape.

Arguments received by LSTM.call():
  • sequences=tf.Tensor(shape=(None, None, 1), dtype=float32)
  • initial_state=None
  • mask=None
  • training=True

In some tutorials, e.g.
https://machinelearningmastery.com/understanding-stateful-lstm-recurrent-neural-networks-python-keras/
I can find something like

model.add(tf.keras.layers.LSTM(units=8, batch_input_shape=(batch_size, window_length, number_time_series), stateful=True))

I also think that it worked one year ago with this solution, but it seems that there is no argument batch_inpupt_shape in LSTM

ValueError: Unrecognized keyword arguments passed to LSTM: {'batch_input_shape': (1, 3, 1)}

Can you please give me a hint? Thank you very much!
Arno

Arno_Kimeswenger · September 11, 2024, 3:48pm

I think that the problem is that X has the shape element_spec=(TensorSpec(shape=(None, None, 1).
I tried

def enforce_shape(x, y):
    x = tf.ensure_shape(x, [batch_size, window_length, number_time_series])  # Set the shape for inputs
    y = tf.ensure_shape(y, [batch_size, 1])  # Set the shape for targets
    return x, y

# Apply the shape enforcement using map
X = X.map(enforce_shape)

and it the model is fitted without errors. Is there a “nicer” way to solve the issue?
Thanks, Arno

Kiran_Sai_Ramineni · September 12, 2024, 8:23am

Hi @Arno_Kimeswenger, The other way is adding preprocessing layers in the model architecture. But the purpose of both will be the same one is preprocessing the data before passing to the model and the other will be preprocessing the data after passing to the model. Thank You.

Arno_Kimeswenger · September 12, 2024, 2:24pm

Thank you very much!

Topic		Replies	Views
Help! I dont understand "input_shape" more General Discussion models , datasets , keras , help_request	3	2270	July 15, 2021
LSTM false predictions when timestep changes General Discussion help_request , keras , models	4	982	November 29, 2021
LSTM input size Keras models , help_request	1	1534	September 5, 2023
Time Series Data: Sequence Classification General Discussion timeseries , classification	1	67	June 5, 2024
I get error in epoch line General Discussion tfkeras , training , datasets	3	1092	January 27, 2024

Problem with stateful LSTM and static batch size

Related topics