MobileFormer

An implementation of MobileFormer proposed by Yinpeng Chen, Xiyang Dai et al.

Including

[1] Mobile-Former proposed in: 
                        Yinpeng Chen, Xiyang Dai et al., Mobile-Former: Bridging MobileNet and Transformer. 
                        arxiv.org/abs/2108.05895
[2] Dynamtic ReLU proposed in: 
                        Yinpeng Chen, Xiyang Dai et al., Dynamtic ReLU. 
                        arxiv.org/abs/2003.10027v2
[3] Lite-BottleNeck proposed in: 
                        Yunsheng Li, Yinpeng Chen et al., MicroNet: Improving Image Recognition with Extremely Low FLOPs. 
                        arxiv.org/abs/2108.05894v1
[4] Adam-W proposed in:
                        Ilya Loshchilov & Frank Hutter, Decoupled Weight Decay Regularization.
                        arxiv.org/abs/1711.05101v3
[5] Mixup proposed in:
                        Hongyi Zhang, Moustapha Cisse et al., Mixup: Beyond Empircal Risk Minimization.
                        arxiv.org/abs/1710.09412
[6] Multi-FocalLoss (not used), focal loss is proposed in:
                        Tsung-Yi Lin, Priya Goyal, Ross Girshick, Kaiming He, Piotr Dollár, Focal Loss for Dense Object Detection.
                        arxiv.org/abs/1708.02002

Note

(1) Due to the expanded DW conv used in strided Mobile-Former blocks, 
    the out_channel should be divisible by expand_size of the next block.
(2) Adam-W and Mixup is embedded in train.py.
(3) Use run() in train.py to train('run') or search('search'). There is an example in the train.py.

'###### The '#'s #######'

'##### are aligned #####'

No pre-train parameters for now.

An implementation of MobileFormer

Related tags

Overview

MobileFormer

Including

Note

'###### The '#'s #######'

'##### are aligned #####'

Owner

slwang9353

Mip-NeRF: A Multiscale Representation for Anti-Aliasing Neural Radiance Fields.

ColBERT: Contextualized Late Interaction over BERT (SIGIR'20)

The final project of "Applying AI to EHR Data" of "AI for Healthcare" nanodegree - Udacity.

PromptDet: Expand Your Detector Vocabulary with Uncurated Images

The PyTorch implementation of paper REST: Debiased Social Recommendation via Reconstructing Exposure Strategies

Companion code for the paper Theoretical characterization of uncertainty in high-dimensional linear classification

:hot_pepper: R²SQL: "Dynamic Hybrid Relation Network for Cross-Domain Context-Dependent Semantic Parsing." (AAAI 2021)

HyDiff: Hybrid Differential Software Analysis

This is the code for CVPR 2021 oral paper: Jigsaw Clustering for Unsupervised Visual Representation Learning

Implementation of the paper "Shapley Explanation Networks"

1st Solution For NeurIPS 2021 Competition on ML4CO Dual Task

Specificity-preserving RGB-D Saliency Detection

A large-scale database for graph representation learning

A library for preparing, training, and evaluating scalable deep learning hybrid recommender systems using PyTorch.

Official implementation for the paper "Attentive Prototypes for Source-free Unsupervised Domain Adaptive 3D Object Detection"

Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model

Genetic feature selection module for scikit-learn

This project generates news headlines using a Long Short-Term Memory (LSTM) neural network.

Config files for my GitHub profile.

Towards uncontrained hand-object reconstruction from RGB videos