[CVPR 2022 Oral] Crafting Better Contrastive Views for Siamese Representation Learning

Last update: Dec 28, 2022

Overview

Crafting Better Contrastive Views for Siamese Representation Learning (CVPR 2022 Oral)

2022-03-29: The paper was selected as a CVPR 2022 Oral paper!

2022-03-03: The paper was accepted by CVPR 2022!

This is the official PyTorch implementation of the ContrastiveCrop paper:

@article{peng2022crafting,
  title={Crafting Better Contrastive Views for Siamese Representation Learning},
  author={Peng, Xiangyu and Wang, Kai and Zhu, Zheng and You, Yang},
  journal={arXiv preprint arXiv:2202.03278},
  year={2022}
}

This repo includes PyTorch implementation of SimCLR, MoCo, BYOL and SimSiam, as well as their DDP training code.

Preparation

Create a python enviroment with pytorch >= 1.8.1.
pip install -r requirements.txt
Modify dataset root in the config file.

Pre-train

# MoCo, CIFAR-10, CCrop
python DDP_moco_ccrop.py configs/small/cifar10/moco_ccrop.py

# SimSiam, CIFAR-100, CCrop
python DDP_simsiam_ccrop.py configs/small/cifar100/simsiam_ccrop.py

# MoCo V2, IN-200, CCrop
python DDP_moco_ccrop.py configs/IN200/mocov2_ccrop.py

# MoCo V2, IN-1K, CCrop
python DDP_moco_ccrop.py configs/IN1K/mocov2_ccrop.py

We also recommend trying an even simpler version of ContrastiveCrop, named SimCCrop, that simply fixes a box at the center of the image with half height & width of that image. SimCCrop even does not require localization and thus adds NO extra training overhead. It should work well on almost 'object-centric' datasets.

# MoCo, SimCCrop
python DDP_moco_ccrop.py configs/small/cifar10/moco_simccrop.py
python DDP_moco_ccrop.py configs/small/cifar100/moco_simccrop.py

Linear Evaluation

# CIFAR-10
python DDP_linear.py configs/linear/cifar10_res18.py --load ./checkpoints/small/cifar10/moco_ccrop/last.pth

# CIFAR-100
python DDP_linear.py configs/linear/cifar100_res18.py --load ./checkpoints/small/cifar100/simsiam_ccrop/last.pth

# IN-200 
python DDP_linear.py configs/linear/IN200_res50.py --load ./checkpoints/IN200/mocov2_ccrop/last.pth

# IN-1K
python DDP_linear.py configs/linear/IN1K_res50.py --load ./checkpoints/IN1K/mocov2_ccrop/last.pth

More models and datasets coming soon.

[CVPR 2022 Oral] Crafting Better Contrastive Views for Siamese Representation Learning

Related tags

Overview

Crafting Better Contrastive Views for Siamese Representation Learning (CVPR 2022 Oral)

Preparation

Pre-train

Linear Evaluation

Owner

MAg: a simple learning-based patient-level aggregation method for detecting microsatellite instability from whole-slide images

Use CLIP to represent video for Retrieval Task

Bunch of different tools which helps visualizing and annotating images for semantic/instance segmentation tasks

Improving Factual Completeness and Consistency of Image-to-text Radiology Report Generation

Auto-updating data to assist in investment to NEPSE

[Preprint] ConvMLP: Hierarchical Convolutional MLPs for Vision, 2021

Command-line tool for downloading and extending the RedCaps dataset.

Face recognition with trained classifiers for detecting objects using OpenCV

Implementation for Homogeneous Unbalanced Regularized Optimal Transport

PyTorch code for training MM-DistillNet for multimodal knowledge distillation

As a part of the HAKE project, includes the reproduced SOTA models and the corresponding HAKE-enhanced versions (CVPR2020).

SPT_LSA_ViT - Implementation for Visual Transformer for Small-size Datasets

The first dataset of composite images with rationality score indicating whether the object placement in a composite image is reasonable.

Self-Supervised Learning with Kernel Dependence Maximization

Codes for NeurIPS 2021 paper "Adversarial Neuron Pruning Purifies Backdoored Deep Models"

Powerful unsupervised domain adaptation method for dense retrieval.

Unsupervised captioning - Code for Unsupervised Image Captioning

Serverless proxy for Spark cluster

Unofficial PyTorch Implementation of "DOLG: Single-Stage Image Retrieval with Deep Orthogonal Fusion of Local and Global Features"

[CVPR 2021] Unsupervised Degradation Representation Learning for Blind Super-Resolution