Hi.
Given the attached architecture, the model is already trained and has the weights loaded. I would like to perform an ablation study by disabling or removing one component (example: text). after some research I have found that I can create an empty input containing zeros and fed it to the network but I am not sure about that.
Please can you share some doc or tell me how to do it.
I don’t want to create the model from scratch that handle the tree components (scene, face and audio)
Thank you