UniMoCo: Unsupervised, Semi-Supervised and Full-Supervised Visual Representation Learning

Last update: Jan 02, 2023

Related tags

Overview

UniMoCo: Unsupervised, Semi-Supervised and Full-Supervised Visual Representation Learning

This is the official PyTorch implementation for UniMoCo paper:

@article{dai2021unimoco,
  author  = {Zhigang Dai and Bolun Cai and Yugeng Lin and Junying Chen},
  title   = {UniMoCo: Unsupervised, Semi-Supervised and Full-Supervised Visual Representation Learning},
  journal = {arXiv preprint arXiv:2103.10773},
  year    = {2021},
}

In UniMoCo, we generalize MoCo to a unified contrastive learning framework, which supports unsupervised, semi-supervised and full-supervised visual representation learning. Based on MoCo, we maintain a label queue to store supervised labels. With the label queue, we can construct the multi-hot target on-the-fly, which represents postives and negatives of the given query. Besides, we propose a unified contrastive loss to deal with arbitrary number of positives and negatives. There is a comparison between MoCo and UniMoCo.

ImageNet Pre-training

Data Preparation

Install PyTorch and ImageNet dataset following the official PyTorch ImageNet training code.

Pre-training

To perform supervised contrastive learning of ResNet-50 model on ImageNet with 8 gpus for 800 epochs, run:

python main_unimoco.py \
  -a resnet50 \
  --lr 0.03 \
  --batch-size 256 \
  --epochs 800 \
  --dist-url 'tcp://localhost:10001' \
  --multiprocessing-distributed --world-size 1 --rank 0 \
  --mlp \
  --moco-t 0.2 \
  --aug-plus \
  --cos \
  [your imagenet-folder with train and val folders]

By default, the script performs full-supervised contrasitve learning.

Set --supervised-list to perform semi-supervised contrastive learning with different label ratios. For exmaple, 60% labels: --supervised-list ./label_info/60percent.txt.

This script uses all the default hyper-parameters as described in the MoCo v2.

Results

ImageNet Linear classification and COCO detection 1x schedule (R50-C4) results:

model	ratios	top-1 acc.	top-5 acc.	COCO AP
UniMoCo	0%	71.1	90.1	39.0
UniMoCo	10%	72.0	90.3	39.3
UniMoCo	30%	75.1	92.5	39.6
UniMoCo	60%	76.2	93.0	39.8
UniMoCo	100%	76.4	93.1	39.6

Check more details about linear classification and detection fine-tuning on MoCo.

Models are coming soon.

License

This project is under the CC-BY-NC 4.0 license. See LICENSE for details.

UniMoCo: Unsupervised, Semi-Supervised and Full-Supervised Visual Representation Learning

Related tags

Overview

UniMoCo: Unsupervised, Semi-Supervised and Full-Supervised Visual Representation Learning

ImageNet Pre-training

Data Preparation

Pre-training

Results

License

Owner

dddzg

Black-Box-Tuning - Black-Box Tuning for Language-Model-as-a-Service

The source code for 'Noisy-Labeled NER with Confidence Estimation' accepted by NAACL 2021

This repository contains the code for the paper Neural RGB-D Surface Reconstruction

Source code of NeurIPS 2021 Paper ''Be Confident! Towards Trustworthy Graph Neural Networks via Confidence Calibration''

Graph Regularized Residual Subspace Clustering Network for hyperspectral image clustering

Julia package for contraction of tensor networks, based on the sweep line algorithm outlined in the paper General tensor network decoding of 2D Pauli codes

This repository contains Prior-RObust Bayesian Optimization (PROBO) as introduced in our paper "Accounting for Gaussian Process Imprecision in Bayesian Optimization"

Galactic and gravitational dynamics in Python

Lightweight Salient Object Detection in Optical Remote Sensing Images via Feature Correlation

Autonomous Driving on Curvy Roads without Reliance on Frenet Frame: A Cartesian-based Trajectory Planning Method

[NeurIPS 2021] Garment4D: Garment Reconstruction from Point Cloud Sequences

ServiceX Transformer that converts flat ROOT ntuples into columnwise data

[NeurIPS 2021] “Improving Contrastive Learning on Imbalanced Data via Open-World Sampling”,

This is a pytorch implementation of the NeurIPS paper GAN Memory with No Forgetting.

Rethinking of Pedestrian Attribute Recognition: A Reliable Evaluation under Zero-Shot Pedestrian Identity Setting

Deep learning-based approach to discovering Granger causality networks in multivariate time series

A sample pytorch Implementation of ACL 2021 research paper "Learning Span-Level Interactions for Aspect Sentiment Triplet Extraction".

Neural style transfer in PyTorch.

An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.

Small little script to scrape, parse and check for active tor nodes. Can be used as proxies.