This is an official implementation for "AS-MLP: An Axial Shifted MLP Architecture for Vision".

Last update: Dec 12, 2022

Related tags

Overview

AS-MLP architecture for Image Classification

Model Zoo

Image Classification on ImageNet-1K

Network	Resolution	Top-1 (%)	Params	FLOPs	Throughput (image/s)	model
AS-MLP-T	224x224	81.3	28M	4.4G	1047	onedrive
AS-MLP-S	224x224	83.1	50M	8.5G	619	onedrive
AS-MLP-B	224x224	83.3	88M	15.2G	455	onedrive

Usage

Install

Clone this repo:

git clone https://github.com/svip-lab/AS-MLP
cd AS-MLP

Create a conda virtual environment and activate it:

conda create -n asmlp python=3.7 -y
conda activate asmlp

Install CUDA==10.1 with cudnn7 following the official installation instructions
Install PyTorch==1.7.1 and torchvision==0.8.2 with CUDA==10.1:

conda install pytorch==1.7.1 torchvision==0.8.2 cudatoolkit=10.1 -c pytorch

Install timm==0.3.2:

pip install timm==0.3.2

Install cupy-cuda101:

pip install cupy-cuda101

Install Apex:

git clone https://github.com/NVIDIA/apex
cd apex
pip install -v --disable-pip-version-check --no-cache-dir --global-option="--cpp_ext" --global-option="--cuda_ext" ./

Install other requirements:

pip install opencv-python==4.4.0.46 termcolor==1.1.0 yacs==0.1.8

Evaluation

To evaluate a pre-trained AS-MLP on ImageNet val, run:

bash train_scripts/test.sh

Training from scratch

To train a AS-MLP on ImageNet from scratch, run:

bash train_scripts/train.sh

You can easily reproduce our results. Enjoy!

Throughput

To measure the throughput, run:

bash train_scripts/get_throughput.sh

Citation

If this project is helpful for you, you can cite our paper:

@article{Lian_2021_ASMLP,
  author = {Lian, Dongze and Yu, Zehao and Sun, Xing and Gao, Shenghua},
  title = {AS-MLP: An Axial Shifted MLP Architecture for Vision},
  journal={arXiv preprint arXiv:2107.08391},
  year = {2021}
}

Acknowledgement

The code is built upon Swin-Transformer

This is an official implementation for "AS-MLP: An Axial Shifted MLP Architecture for Vision".

Related tags

Overview

AS-MLP architecture for Image Classification

Model Zoo

Image Classification on ImageNet-1K

Usage

Install

Evaluation

Training from scratch

Throughput

Citation

Acknowledgement

Owner

SVIP Lab

Code for "NeuralRecon: Real-Time Coherent 3D Reconstruction from Monocular Video", CVPR 2021 oral

LLVM-based compiler for LightGBM gradient-boosted trees. Speeds up prediction by ≥10x.

On Evaluation Metrics for Graph Generative Models

Collision risk estimation using stochastic motion models

Perturb-and-max-product: Sampling and learning in discrete energy-based models

Gesture-controlled Video Game. Just swing your finger and play the game without touching your PC

The source code of CVPR 2019 paper "Deep Exemplar-based Video Colorization".

Use VITS and Opencpop to develop singing voice synthesis; Maybe it will VISinger.

Pytorch implementation of "Training a 85.4% Top-1 Accuracy Vision Transformer with 56M Parameters on ImageNet"

Machine learning library for fast and efficient Gaussian mixture models

Course materials for Fall 2021 "CIS6930 Topics in Computing for Data Science" at New College of Florida

CATE: Computation-aware Neural Architecture Encoding with Transformers

Go from graph data to a secure and interactive visual graph app in 15 minutes. Batteries-included self-hosting of graph data apps with Streamlit, Graphistry, RAPIDS, and more!

Code implementation of "Sparsity Probe: Analysis tool for Deep Learning Models"

Weighing Counts: Sequential Crowd Counting by Reinforcement Learning

ReferFormer - Official Implementation of ReferFormer

Pyramid addon for OpenAPI3 validation of requests and responses.

GLaRA: Graph-based Labeling Rule Augmentation for Weakly Supervised Named Entity Recognition

Repo for the ACMMM20 submission: "Personalized breath based biometric authentication with wearable multimodality".

UMPNet: Universal Manipulation Policy Network for Articulated Objects