Pytorch reimplementation of the Mixer (MLP-Mixer: An all-MLP Architecture for Vision)

Last update: Dec 08, 2022

Related tags

Overview

MLP-Mixer

Pytorch reimplementation of Google's repository for the MLP-Mixer (Not yet updated on the master branch) that was released with the paper MLP-Mixer: An all-MLP Architecture for Vision by Ilya Tolstikhin, Neil Houlsby, Alexander Kolesnikov, Lucas Beyer, Xiaohua Zhai, Thomas Unterthiner, Jessica Yung, Daniel Keysers, Jakob Uszkoreit, Mario Lucic, Alexey Dosovitskiy.

In this paper, the authors show a performance close to SotA in an image classification benchmark using MLP(Multi-layer perceptron) without using CNN and Transformer.

MLP-Mixer (Mixer for short) consists of per-patch linear embeddings, Mixer layers, and a classifier head. Mixer layers contain one token-mixing MLP and one channel-mixing MLP, each consisting of two fully-connected layers and a GELU nonlinearity. Other components include: skip-connections, dropout, and linear classifier head.

Usage

1. Download Pre-trained model (Google's Official Checkpoint)

Available models: Mixer-B_16, Mixer-L_16
- imagenet pre-train models
  - Mixer-B_16, Mixer-L_16
- imagenet-21k pre-train models
  - Mixer-B_16, Mixer-L_16

# imagenet pre-train
wget https://storage.googleapis.com/mixer_models/imagenet1k/{MODEL_NAME}.npz

# imagenet-21k pre-train
wget https://storage.googleapis.com/mixer_models/imagenet21k/{MODEL_NAME}.npz

2. Fine-tuning

python3 train.py --name cifar10-100_500 --model_type Mixer-B_16 --pretrained_dir checkpoint/Mixer-B_16.npz

Reproducing Mixer results

upstream	model	dataset	acc(official)
ImageNet	Mixer-B/16	cifar10	96.72
ImageNet	Mixer-L/16	cifar10	96.59
ImageNet-21k	Mixer-B/16	cifar10	96.82
ImageNet-21k	Mixer-L/16	cifar10	96.34

Reference

Google's Vision Transformer and MLP-Mixer

Citations

@article{tolstikhin2021,
  title={MLP-Mixer: An all-MLP Architecture for Vision},
  author={Tolstikhin, Ilya and Houlsby, Neil and Kolesnikov, Alexander and Beyer, Lucas and Zhai, Xiaohua and Unterthiner, Thomas and Yung, Jessica and Keysers, Daniel and Uszkoreit, Jakob and Lucic, Mario and Dosovitskiy, Alexey},
  journal={arXiv preprint arXiv:2105.01601},
  year={2021}
}

Pytorch reimplementation of the Mixer (MLP-Mixer: An all-MLP Architecture for Vision)

Related tags

Overview

MLP-Mixer

Usage

1. Download Pre-trained model (Google's Official Checkpoint)

2. Fine-tuning

Reproducing Mixer results

Reference

Citations

Owner

Eunkwang Jeon

This is the official implementation of VaxNeRF (Voxel-Accelearated NeRF).

[Pedestron] Generalizable Pedestrian Detection: The Elephant In The Room. @ CVPR2021

“英特尔创新大师杯”深度学习挑战赛赛道3：CCKS2021中文NLP地址相关性任务

Two types of Recommender System : Content-based Recommender System and Colaborating filtering based recommender system

Image Deblurring using Generative Adversarial Networks

Clustering is a popular approach to detect patterns in unlabeled data

YoloAll is a collection of yolo all versions. you you use YoloAll to test yolov3/yolov5/yolox/yolo_fastest

A trusty face recognition research platform developed by Tencent Youtu Lab

Pytorch implementations of popular off-policy multi-agent reinforcement learning algorithms, including QMix, VDN, MADDPG, and MATD3.

Repository for tackling Kaggle Ultrasound Nerve Segmentation challenge using Torchnet.

Toward Realistic Single-View 3D Object Reconstruction with Unsupervised Learning from Multiple Images (ICCV 2021)

GNPy: Optical Route Planning and DWDM Network Optimization

A Robust Unsupervised Ensemble of Feature-Based Explanations using Restricted Boltzmann Machines

A Broad Study on the Transferability of Visual Representations with Contrastive Learning

TinyML Cookbook, published by Packt

bio_inspired_min_nets_improve_the_performance_and_robustness_of_deep_networks

Official PyTorch implementation of the ICRA 2021 paper: Adversarial Differentiable Data Augmentation for Autonomous Systems.

HAT: Hierarchical Aggregation Transformers for Person Re-identification

Distributing reference energies for SMIRNOFF implementations

OpenMMLab Semantic Segmentation Toolbox and Benchmark.

Pytorch reimplementation of the Mixer (MLP-Mixer: An all-MLP Architecture for Vision)

Related tags

Overview

MLP-Mixer

Usage

1. Download Pre-trained model (Google's Official Checkpoint)

2. Fine-tuning

Reproducing Mixer results

Reference

Citations

Owner

Eunkwang Jeon

This is the official implementation of VaxNeRF (Voxel-Accelearated NeRF).

[Pedestron] Generalizable Pedestrian Detection: The Elephant In The Room. @ CVPR2021

“英特尔创新大师杯”深度学习挑战赛 赛道3：CCKS2021中文NLP地址相关性任务

Two types of Recommender System : Content-based Recommender System and Colaborating filtering based recommender system

Image Deblurring using Generative Adversarial Networks

Clustering is a popular approach to detect patterns in unlabeled data

YoloAll is a collection of yolo all versions. you you use YoloAll to test yolov3/yolov5/yolox/yolo_fastest

A trusty face recognition research platform developed by Tencent Youtu Lab

Pytorch implementations of popular off-policy multi-agent reinforcement learning algorithms, including QMix, VDN, MADDPG, and MATD3.

Repository for tackling Kaggle Ultrasound Nerve Segmentation challenge using Torchnet.

Toward Realistic Single-View 3D Object Reconstruction with Unsupervised Learning from Multiple Images (ICCV 2021)

GNPy: Optical Route Planning and DWDM Network Optimization

A Robust Unsupervised Ensemble of Feature-Based Explanations using Restricted Boltzmann Machines

A Broad Study on the Transferability of Visual Representations with Contrastive Learning

TinyML Cookbook, published by Packt

bio_inspired_min_nets_improve_the_performance_and_robustness_of_deep_networks

Official PyTorch implementation of the ICRA 2021 paper: Adversarial Differentiable Data Augmentation for Autonomous Systems.

HAT: Hierarchical Aggregation Transformers for Person Re-identification

Distributing reference energies for SMIRNOFF implementations

OpenMMLab Semantic Segmentation Toolbox and Benchmark.

“英特尔创新大师杯”深度学习挑战赛赛道3：CCKS2021中文NLP地址相关性任务