Save a tensorflow model with a transformer layer

Constantin_Werner · January 23, 2022, 4:56pm

Hi

I trained a model with the following architecture:

bert_config = BertConfig.from_pretrained(MODEL_NAME)
bert_config.output_hidden_states = True
backbone = TFAutoModelForSequenceClassification.from_pretrained(MODEL_NAME,config=bert_config)

input_ids = tf.keras.layers.Input(shape=(MAX_LENGTH,), name='input_ids', dtype='int32')
features = backbone(input_ids)[1][-1]
pooling =  tf.keras.layers.GlobalAveragePooling1D()(features)
dense = tf.keras.layers.Dense(len(label2id), name='output',activation=tf.nn.softmax)(pooling)
    
model = tf.keras.Model(inputs=[input_ids], outputs = [dense])

and saved model in different ways. The first one is

model.save_weights('/content/drive/MyDrive/weights/weights')

and the second one is

model.save_weights('/content/drive/MyDrive/weights/weights.h5')

So, I am able to load my model (model.load_weights()) from both of these options without any error. Moreover, inference is fine. In short, everything works as I expect.

But if I start a new session and load my model again then inference is bad, like model has random weights instead of my saved weights.

I was trying other options of saving models as well, but they do not work also. Probably there is a special way to save a model with transformer layer?

Thanks in advance!

Bhack · January 25, 2022, 3:05pm

Do you have experienced the same problem with model.save?

Constantin_Werner · January 25, 2022, 3:24pm

Hi

Unfortunately, yes

Bhack · January 25, 2022, 4:01pm

Have you tried to post this on the Hugginface forum?
As it seems that you are using one of their classes TFAutoModelForSequenceClassification

Edit:
I suppose your post is this one:

Bhack · January 25, 2022, 4:07pm

Have you tried if it works with the tf.function and model signature approach:

Bhack · January 25, 2022, 4:12pm

See also:
https://github.com/huggingface/transformers/issues/3246

Samuel_Chiji · January 6, 2023, 4:20am

infact your code is ok just follow my first post and you will get reproducible results anytime you run on colab, even if your session expired, even you dont want function you can just do it this way just after importing these two modules:

from tensorflow.python.framework import ops
import tensorflow as tf

seed=42
ops.reset_default_graph()
tf.random.set_seed(seed)
np.random.seed(seed)

Samuel_Chiji · January 6, 2023, 4:20am

see the problem is that tensorflow reinitialize variables when your session expires, so to solve this problem, you have to manually set your own random seed after importing tensorflow and numpy.

create a function this way:

from tensorflow.python.framework import ops
import tensorflow as tf
SEED=42 #choose any seed of your choice
def reproducibleResult(seed:int):
ops.reset_default_graph()
tf.random.set_seed(seed)
np.random.seed(seed)

#call your function and problem solved
reproducibleResult(SEED)

Topic		Replies	Views
Try out ConvNeXt in Keras! TensorFlow models , keras , tfhub , education	26	6138	November 8, 2023
Keras custom models deteriorates after save and reload General Discussion models , keras , help_request	20	1936	August 19, 2021
Tensorflow saved model loading issue General Discussion models , keras , help_request	2	6259	August 4, 2021
Convert Keras '.h5' Model to TensorFlow SavedModel (saved_model.pb) General Discussion models , keras , help_request	5	4911	July 28, 2021
Option to save model as keras model in tensorflow object detection api General Discussion models , object-detection , keras , help_request	1	1093	June 7, 2022

Save a tensorflow model with a transformer layer

Related topics