Image classification with ConvMixer

Sayak_Paul · October 18, 2021, 4:34pm

What happens when we apply similar pure convolution blocks on patches of images? We can train a network with 0.8 million parameters for 10 epochs on CIFAR-10 and get ~83% top-1 test accuracy without having to use any fancy regularization. ConvMixer (the recently talked about architecture on Twitter):

There are a few visualizations of the internals of ConvMixer that might be useful for the community.

Learned patch embeddings:

Convolution kernel from the middle of the network showing varying locality spans:

Topic		Replies	Views
Image classification with MobileViT Show and Tell keras , learning , education	1	2113	October 29, 2021
Text Classification with MLP-Mixer model Show and Tell models , keras , learning	0	1398	June 10, 2021
[Research :tada:] MLP-Mixer: An all-MLP Architecture for Vision TensorFlow	6	2303	January 26, 2022
MLP-Mixer with CIFAR-10 Show and Tell keras , learning , education	0	1871	May 25, 2021
Compact Convolutional Transformers Show and Tell keras , learning , education	15	3714	April 22, 2022

Image classification with ConvMixer

Related topics