Implementing Compositional Attention

rishit_dagli · June 24, 2022, 5:24am

Here is my TF/Keras implementation of the recent Compositional Attention paper by MILA which disentangles the search and retrieval components of the attention mechanism. This can be used as a drop-in replacement for standard multi-head attention and outperforms it for some tasks.

lgusm · June 27, 2022, 9:52am

Nice work! Congrats!

Topic		Replies	Views
Implementing DeepMind's new Perceiver Model Show and Tell models , keras	2	2269	December 14, 2021
Probing into the representations of ViTs 🤖 🕵️‍♀️ Show and Tell models , keras , education	2	1820	May 16, 2022
Conceptual dive into "Attention" Show and Tell keras , education	0	548	June 3, 2021
Implementation of CaiT family of models Show and Tell models , research , keras	0	1352	May 4, 2022
TF/Keras implementation of Conformer: Convolution-augmented Transformer Show and Tell models , keras , education	0	2784	January 4, 2022

Implementing Compositional Attention

Related topics