Unofficial implementation of MLP-Mixer: An all-MLP Architecture for Vision

Last update: Dec 23, 2022

Overview

MLP-Mixer: An all-MLP Architecture for Vision

This repo contains PyTorch implementation of MLP-Mixer: An all-MLP Architecture for Vision.

Usage :

import torch
import numpy as np
from mlp-mixer import MLPMixer

img = torch.ones([1, 3, 224, 224])

model = MLPMixer(in_channels=3, image_size=224, patch_size=16, num_classes=1000,
                 dim=512, depth=8, token_dim=256, channel_dim=2048)

parameters = filter(lambda p: p.requires_grad, model.parameters())
parameters = sum([np.prod(p.size()) for p in parameters]) / 1_000_000
print('Trainable Parameters: %.3fM' % parameters)

out_img = model(img)

print("Shape of out :", out_img.shape)  # [B, in_channels, image_size, image_size]

Citation :

@misc{tolstikhin2021mlpmixer,
      title={MLP-Mixer: An all-MLP Architecture for Vision}, 
      author={Ilya Tolstikhin and Neil Houlsby and Alexander Kolesnikov and Lucas Beyer and Xiaohua Zhai and Thomas Unterthiner and Jessica Yung and Daniel Keysers and Jakob Uszkoreit and Mario Lucic and Alexey Dosovitskiy},
      year={2021},
      eprint={2105.01601},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Acknowledgement :

Some component borrowed from ViT code of @lucidrains repo : https://github.com/lucidrains/vit-pytorch

Unofficial implementation of MLP-Mixer: An all-MLP Architecture for Vision

Related tags

Overview

MLP-Mixer: An all-MLP Architecture for Vision

Usage :

Citation :

Acknowledgement :

Owner

Rishikesh (ऋषिकेश)

PyTorch implementation of "Debiased Visual Question Answering from Feature and Sample Perspectives" (NeurIPS 2021)

Evaluating different engineering tricks that make RL work

Collection of in-progress libraries for entity neural networks.

Precomputed Real-Time Texture Synthesis with Markovian Generative Adversarial Networks

A universal memory dumper using Frida

“英特尔创新大师杯”深度学习挑战赛赛道3：CCKS2021中文NLP地址相关性任务

Fermi Problems: A New Reasoning Challenge for AI

Python scripts form performing stereo depth estimation using the HITNET model in Tensorflow Lite.

a pytorch implementation of auto-punctuation learned character by character

This repo contains the official code and pre-trained models for the Dynamic Vision Transformer (DVT).

A curated list of awesome Active Learning

(ICCV 2021 Oral) Re-distributing Biased Pseudo Labels for Semi-supervised Semantic Segmentation: A Baseline Investigation.

Time series annotation library.

High-level library to help with training and evaluating neural networks in PyTorch flexibly and transparently.

This is the source code for the experiments related to the paper Unsupervised Audio Source Separation Using Differentiable Parametric Source Models

On the adaptation of recurrent neural networks for system identification

PyTorch implementation for our NeurIPS 2021 Spotlight paper "Long Short-Term Transformer for Online Action Detection".

Solver for Large-Scale Rank-One Semidefinite Relaxations

Camera-caps - Examine the camera capabilities for V4l2 cameras

Code & Experiments for "LILA: Language-Informed Latent Actions" to be presented at the Conference on Robot Learning (CoRL) 2021.

Unofficial implementation of MLP-Mixer: An all-MLP Architecture for Vision

Related tags

Overview

MLP-Mixer: An all-MLP Architecture for Vision

Usage :

Citation :

Acknowledgement :

Owner

Rishikesh (ऋषिकेश)

PyTorch implementation of "Debiased Visual Question Answering from Feature and Sample Perspectives" (NeurIPS 2021)

Evaluating different engineering tricks that make RL work

Collection of in-progress libraries for entity neural networks.

Precomputed Real-Time Texture Synthesis with Markovian Generative Adversarial Networks

A universal memory dumper using Frida

“英特尔创新大师杯”深度学习挑战赛 赛道3：CCKS2021中文NLP地址相关性任务

Fermi Problems: A New Reasoning Challenge for AI

Python scripts form performing stereo depth estimation using the HITNET model in Tensorflow Lite.

a pytorch implementation of auto-punctuation learned character by character

This repo contains the official code and pre-trained models for the Dynamic Vision Transformer (DVT).

A curated list of awesome Active Learning

(ICCV 2021 Oral) Re-distributing Biased Pseudo Labels for Semi-supervised Semantic Segmentation: A Baseline Investigation.

Time series annotation library.

High-level library to help with training and evaluating neural networks in PyTorch flexibly and transparently.

This is the source code for the experiments related to the paper Unsupervised Audio Source Separation Using Differentiable Parametric Source Models

On the adaptation of recurrent neural networks for system identification

PyTorch implementation for our NeurIPS 2021 Spotlight paper "Long Short-Term Transformer for Online Action Detection".

Solver for Large-Scale Rank-One Semidefinite Relaxations

Camera-caps - Examine the camera capabilities for V4l2 cameras

Code & Experiments for "LILA: Language-Informed Latent Actions" to be presented at the Conference on Robot Learning (CoRL) 2021.

“英特尔创新大师杯”深度学习挑战赛赛道3：CCKS2021中文NLP地址相关性任务