awesome-MIM

Reading list for research topics in Masked Image Modeling(MIM).

We list the most popular methods for MIM, if I missed something, please submit a request. (Note: We show the date of the first version of Arxiv here. But the link of paper may be not the early version.)

Self-supervied Vision Transformers as backbone models.

Date	Method	Conference	Title	Code
2021-06-14	BeiT	ICLR 2022(Oral)	BEiT: BERT Pre-Training of Image Transformers	BeiT
2021-11-11	MAE	Arxiv 2021	Masked Autoencoders Are Scalable Vision Learners	MAE
2021-11-15	iBoT	Arxiv 2021	iBOT: Image BERT Pre-Training with Online Tokenizer	iBoT
2021-11-18	SimMIM	Arxiv 2021	SimMIM: A Simple Framework for Masked Image Modeling	SimMIM
2021-12-16	MaskFeat	Arxiv 2021	Masked Feature Prediction for Self-Supervised Visual Pre-Training	None
2021-12-20	SplitMask	Arxiv 2021	Are Large-scale Datasets Necessary for Self-Supervised Pre-training?	None
2022-01-31	ADIOS	Arxiv 2022	Adversarial Masking for Self-Supervised Learning	None
2022-02-07	CAE	Arxiv 2022	Context Autoencoder for Self-Supervised Representation Learning	None
2022-02-07	CIM	Arxiv 2022	Corrupted Image Modeling for Self-Supervised Visual Pre-Training	None

Reading list for research topics in Masked Image Modeling

Related tags

Overview

awesome-MIM

Self-supervied Vision Transformers as backbone models.

Owner

ligang

An all-in-one application to visualize multiple different local path planning algorithms

Just-Now - This Is Just Now Login Friendlist Cloner Tools

A PyTorch implementation of ViTGAN based on paper ViTGAN: Training GANs with Vision Transformers.

A python-image-classification web application project, written in Python and served through the Flask Microframework

HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement

基于DouZero定制AI实战欢乐斗地主

《Where am I looking at? Joint Location and Orientation Estimation by Cross-View Matching》(CVPR 2020)

AdaMML: Adaptive Multi-Modal Learning for Efficient Video Recognition

PyBrain - Another Python Machine Learning Library.

VR-Caps: A Virtual Environment for Active Capsule Endoscopy

Optimizes image files by converting them to webp while also updating all references.

Implementation of Rotary Embeddings, from the Roformer paper, in Pytorch

A New Open-Source Off-road Environment for Benchmark Generalization of Autonomous Driving

Unsupervised Foreground Extraction via Deep Region Competition

CoReD: Generalizing Fake Media Detection with Continual Representation using Distillation (ACMMM'21 Oral Paper)

DeepMetaHandles: Learning Deformation Meta-Handles of 3D Meshes with Biharmonic Coordinates

Implementation of the paper "Self-Promoted Prototype Refinement for Few-Shot Class-Incremental Learning"

Keras implementation of Normalizer-Free Networks and SGD - Adaptive Gradient Clipping

A PyTorch implementation for Unsupervised Domain Adaptation by Backpropagation(DANN), support Office-31 and Office-Home dataset

A Quick and Dirty Progressive Neural Network written in TensorFlow.