3 months ago

MLP-Mixer: An all-MLP Architecture for Vision

Ilya Tolstikhin Neil Houlsby Alexander Kolesnikov Lucas Beyer Xiaohua Zhai Thomas Unterthiner Jessica Yung Andreas Steiner Daniel Keysers Jakob Uszkoreit

Abstract

Convolutional Neural Networks (CNNs) are the go-to model for computer vision. Recently, attention-based networks, such as the Vision Transformer, have also become popular. In this paper we show that while convolutions and attention are both sufficient for good performance, neither of them are necessary. We present MLP-Mixer, an architecture based exclusively on multi-layer perceptrons (MLPs). MLP-Mixer contains two types of layers: one with MLPs applied independently to image patches (i.e. "mixing" the per-location features), and one with MLPs applied across patches (i.e. "mixing" spatial information). When trained on large datasets, or with modern regularization schemes, MLP-Mixer attains competitive scores on image classification benchmarks, with pre-training and inference cost comparable to state-of-the-art models. We hope that these results spark further research beyond the realms of well established CNNs and Transformers.

Code Repositories

BR-IDL/PaddleViT/blob/main/image_classification/MLP-Mixer

paddle

ericleixd/mlpMixer-MindSpore

mindspore

bangoc123/mlp-mixer

Mentioned in GitHub

KiUngSong/Vision

pytorch

Mentioned in GitHub

jm12138/MLP-Mixer-Paddle

paddle

jankrepl/mildlyoverfitted

jax

jeonsworld/MLP-Mixer-Pytorch

pytorch

Mentioned in GitHub

PaddlePaddle/PASSL

paddle

rwightman/pytorch-image-models

pytorch

Mentioned in GitHub

asarigun/MixerGANsformer

pytorch

Mentioned in GitHub

lucidrains/mlp-mixer-pytorch

pytorch

Mentioned in GitHub

sayakpaul/MLP-Mixer-CIFAR10

Mentioned in GitHub

jaketae/mlp-mixer

pytorch

Mentioned in GitHub

liuruiyang98/Jittor-MLP

jax

Mentioned in GitHub

luutn2002/mixer_test

pytorch

Mentioned in GitHub

ashishpatel26/Vision-Transformer-Keras-Tensorflow-Pytorch-Examples

pytorch

google-research/vision_transformer

Official

jax

Mentioned in GitHub

martinsbruveris/tensorflow-image-models

Mentioned in GitHub

IMvision12/keras-vision-models

pytorch

Mentioned in GitHub

sradc/nd-mlp-mixer

Mentioned in GitHub

zer0sh0t/artificial_intelligence/tree/master/vision_models/mlp_mixer

pytorch

rishikksh20/MLP-Mixer-pytorch

pytorch

Mentioned in GitHub

xuwkk/task_aware_machine_unlearning

pytorch

Mentioned in GitHub

04RR/SOTA-Vision

pytorch

Mentioned in GitHub

Elman295/MLP-Mixer-for-MNIST-Classification

yangyucheng000/mlpMixer

mindspore

MiuGod0126/Mlp-Mixer-Paddle

paddle

Mentioned in GitHub

lavish619/MLP-Mixer-PyTorch

pytorch

Mentioned in GitHub

omihub777/mlp-mixer-cifar

pytorch

Mentioned in GitHub

Benjamin-Etheredge/mlp-mixer-keras

Mentioned in GitHub

ttt496/VisionTransformer

jax

Mentioned in GitHub

imad08/MLP-Mixer

pytorch

DarshanDeshpande/jax-models

jax

Mentioned in GitHub

leondgarse/keras_cv_attention_models/tree/main/keras_cv_attention_models/mlp_family

Oguzhanercan/MLP-Mixer

pytorch

Mentioned in GitHub

Mayurji/Image-Classification-PyTorch

pytorch

Mentioned in GitHub

engichang1467/kan-mixer

pytorch

Mentioned in GitHub

labmlai/annotated_deep_learning_paper_implementations

pytorch

engichang1467/MLP-Mixer-Reimplementation

pytorch

Mentioned in GitHub

YeongHyeon/MLP-Mixer-TF2

isaaccorley/mlp-mixer-pytorch

pytorch

Mentioned in GitHub

xmu-xiaoma666/MLP-Mixer-pytorch

pytorch

Mentioned in GitHub

xmu-xiaoma666/External-Attention-pytorch

pytorch

Mentioned in GitHub

YeongHyeon/MLP-Mixer-PyTorch

pytorch

leaderj1001/Bag-of-MLP

pytorch

Mentioned in GitHub

Nguyendat-bit/MLP-Mixer

Mentioned in GitHub

himanshu-dutta/MLPMixer-pytorch

pytorch

Mentioned in GitHub

mli-lab/imaging_mlps

pytorch

Mentioned in GitHub

qwopqwop200/MLP-Mixer-tf2

Benchmarks

Benchmark	Methodology	Metrics
image-classification-on-imagenet	ViT-L/16 Dosovitskiy et al. (2021)	Top 1 Accuracy: 85.3%
image-classification-on-imagenet	Mixer-H/14 (JFT-300M pre-train)	Hardware Burden: Operations per network pass: Top 1 Accuracy: 87.94%
image-classification-on-imagenet	Mixer-B/16	Number of params: 46M Top 1 Accuracy: 76.44%
image-classification-on-imagenet-real	Mixer-H/14 (JFT-300M pre-train)	Accuracy: 87.86% Params: 409M
image-classification-on-imagenet-real	Mixer-H/14- 448 (JFT-300M pre-train)	Accuracy: 90.18% Params: 409M
image-classification-on-omnibenchmark	MLP-Mixer	Average Top-1 Accuracy: 32.2

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started

Hyper Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

MLP-Mixer: An all-MLP Architecture for Vision

Ilya Tolstikhin Neil Houlsby Alexander Kolesnikov Lucas Beyer Xiaohua Zhai Thomas Unterthiner Jessica Yung Andreas Steiner Daniel Keysers Jakob Uszkoreit2 more

Abstract

Code Repositories

Benchmarks

Build AI with AI

Hyper Newsletters

Ilya Tolstikhin Neil Houlsby Alexander Kolesnikov Lucas Beyer Xiaohua Zhai Thomas Unterthiner Jessica Yung Andreas Steiner Daniel Keysers Jakob Uszkoreit