Fine-tuning a pre-trained model while replacing one of the pre-trained layers with a new PyTorchlayer

Conor_Warren · September 13, 2024, 8:43pm

Hi everyone!

I want to fine-tune a pre-trained BERT model off of the official BERT repository. I want to replace one of the pre-trained dense layers with a custom PyTorch layer.

I’ve been trying to implement this, but I haven’t been able to figure it out. I learned that the get_assignment_map_from_checkpoint function computes the union of the current variables and checkpoint variables, and then the output of this function is passed into tf.train.init_from_checkpoint.

Any tips/advice/suggestions would be greatly appreciated. Thank you in advance for your help!

Jetti_Bharat · March 16, 2025, 1:09pm

Hello @Conor_Warren

To my understanding, The possible way of doing this is to get the model checkpoints and identify the layer which you want to work on and then copy the weights of that layer and add them to replaced layer from pytorch.
The general way of working with tensorflow models in pytorch is to convert them to pytorch, verify the model with the similar accuracy inference and then work with the model in pytorch.

Thank you.

Topic		Replies	Views
How to extract body of a transformer like models and fine tune with that body on different data TensorFlow models , transformers	2	501	June 5, 2023
Is there any way to change a connection in a pre-trained tensorflow model and use them while keeping the all other layers unchanged? General Discussion keraslayer , tffloat , model	1	336	December 21, 2023
Questions about the fine-tuning BERT TensorFlow models , nlp , help_request	1	1026	January 17, 2024
Subject: Seeking Guidance on Text Understanding and Entity Extraction Using TensorFlow General Discussion models , help_request	3	429	December 11, 2023
Fine tune a pre-trained model to add new custom classes General Discussion model_garden	1	864	March 16, 2023

Fine-tuning a pre-trained model while replacing one of the pre-trained layers with a new PyTorchlayer

Related topics