MicroNet: Improving Image Recognition with Extremely Low FLOPs (ICCV 2021)

A pytorch implementation of MicroNet. If you use this code in your research please consider citing

@article{li2021micronet, title={MicroNet: Improving Image Recognition with Extremely Low FLOPs}, author={Li, Yunsheng and Chen, Yinpeng and Dai, Xiyang and Chen, Dongdong and Liu, Mengchen and Yuan, Lu and Liu, Zicheng and Zhang, Lei and Vasconcelos, Nuno}, journal={arXiv preprint arXiv:2108.05894}, year={2021} }

Requirements

Linux or macOS with Python ≥ 3.6.
Anaconda3, PyTorch ≥ 1.5 with matched torchvision

Models

Model	#Param	MAdds	Top-1	download
MicroNet-M3	2.6M	21M	62.5	model
MicroNet-M2	2.4M	12M	59.4	model
MicroNet-M1	1.8M	6M	51.4	model
MicroNet-M0	1.0M	4M	46.6	model

Evaluate MicroNet on ImageNet

Download the pretrained MicroNet M0-M3 with the link above. The scripts used for evaluation can be found here. For example, if you want to test MicroNet-M3, you can use the following command.

sh scripts/eval_micronet_m3.sh /path/to/imagenet /path/to/output /path/to/pretrained_model

Train MicroNet on ImageNet

The scripts used for training MicroNet M0-M3 can be found here and can be implemented as follows (You can choose to use different scripts for 2 gpu or 4 gpu training based on the resources you can access).

sh scripts/train_micronet_m3_4gpu.sh /path/to/imagenet /path/to/output

MicroNet: Improving Image Recognition with Extremely Low FLOPs (ICCV 2021)

Related tags

Overview

MicroNet: Improving Image Recognition with Extremely Low FLOPs (ICCV 2021)

Requirements

Models

Evaluate MicroNet on ImageNet

Train MicroNet on ImageNet

Owner

Yunsheng Li

VOneNet: CNNs with a Primary Visual Cortex Front-End

This is an example of object detection on Micro bacterium tuberculosis using Mask-RCNN

PyTorch reimplementation of the Smooth ReLU activation function proposed in the paper "Real World Large Scale Recommendation Systems Reproducibility and Smooth Activations" [arXiv 2022].

这是一个yolox-pytorch的源码，可以用于训练自己的模型。

Source code of article "Towards Toxic and Narcotic Medication Detection with Rotated Object Detector"

Implementation of "Learning to Match Features with Seeded Graph Matching Network" ICCV2021

Code for generating a single image pretraining dataset

PyTorch implementation of Rethinking Positional Encoding in Language Pre-training

NAS-FCOS: Fast Neural Architecture Search for Object Detection (CVPR 2020)

Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task of Visual Document Understanding (VDU)

CVPR2022 (Oral) - Rethinking Semantic Segmentation: A Prototype View

a Lightweight library for sequential learning agents, including reinforcement learning

The code for SAG-DTA: Prediction of Drug–Target Affinity Using Self-Attention Graph Network.

A distributed, plug-n-play algorithm for multi-robot applications with a priori non-computable objective functions

Lightweight, Python library for fast and reproducible experimentation :microscope:

PFFDTD is an open-source FDTD simulator for 3D room acoustics

A Protein-RNA Interface Predictor Based on Semantics of Sequences

Detecting Blurred Ground-based Sky/Cloud Images

PyTorch implementations of algorithms for density estimation

Learning to Predict Gradients for Semi-Supervised Continual Learning