PyTorch Implementation of Region Similarity Representation Learning (ReSim)

Last update: Jan 03, 2023

Related tags

Overview

ReSim

This repository provides the PyTorch implementation of Region Similarity Representation Learning (ReSim) described in this paper:

@Article{xiao2021region,
  author  = {Tete Xiao and Colorado J Reed and Xiaolong Wang and Kurt Keutzer and Trevor Darrell},
  title   = {Region Similarity Representation Learning},
  journal = {arXiv preprint arXiv:2103.12902},
  year    = {2021},
}

tldr; ReSim maintains spatial relationships in the convolutional feature maps when performing instance contrastive pre-training, which is useful for region-related tasks such as object detection, segmentation, and dense pose estimation.

Installation

Assuming a conda environment:

conda create --name resim python=3.7
conda activate resim

# NOTE: if you are not using CUDA 10.2, you need to change the 10.2 in this command appropriately. 
# Code tested with torch 1.6 and 1.7
# (check CUDA version with e.g. `cat /usr/local/cuda/version.txt`)
conda install pytorch==1.6.0 torchvision==0.7.0 cudatoolkit=10.2 -c pytorch

Pre-training

This codebase is based on the original MoCo codebase -- see this README for more details.

To pre-train for 200 epochs using the ReSim-FPN implementation as described in the paper:

python main_moco.py -a resnet50 --lr 0.03 --batch-size 256 \
       --dist-url tcp://localhost:10005 --multiprocessing-distributed --world-size 1 --rank 0 \
       --mlp --moco-t 0.2 --aug-plus --cos --epochs 200 \
       /location/of/imagenet/data/folder

ResNet-50 Pre-trained Models

Checkpoint	Pre-train Epochs	COCO AP @2x	MoCo Checkpoint	Detectron Backbone
ReSim-FPN	400	41.9	Download	Download
ReSim-FPN	200	41.4	Download	Download
ReSim-C4	200	41.1	Download	Download

Detection

See these instructions for more details, but in brief:

# first install detectron2
# then place COCO-2017 dataset detection/datasets/coco

cd detection
python convert-pretrain-to-detectron2.py ../resim_fpn_checkpoint_latest.pth.tar detectron_resim_fpn_checkpoint_latest.pth.tar
python train_net.py --dist-url 'tcp://127.0.0.1:17654' --config-file configs/coco_R_50_FPN_2x_moco.yaml --num-gpus 8 MODEL.WEIGHTS detectron_resim_fpn_checkpoint_latest.pth.tar TEST.EVAL_PERIOD 180000 OUTPUT_DIR results/coco2x-resim-fpn SOLVER.CHECKPOINT_PERIOD 180000

License

This project is under the CC-BY-NC 4.0 license. See LICENSE.

PyTorch Implementation of Region Similarity Representation Learning (ReSim)

Related tags

Overview

ReSim

Installation

Pre-training

ResNet-50 Pre-trained Models

Detection

License

Owner

Tete Xiao

TensorLight - A high-level framework for TensorFlow

AniGAN: Style-Guided Generative Adversarial Networks for Unsupervised Anime Face Generation

RM Operation can equivalently convert ResNet to VGG, which is better for pruning; and can help RepVGG perform better when the depth is large.

CvT-ASSD: Convolutional vision-Transformerbased Attentive Single Shot MultiBox Detector (ICTAI 2021 CCF-C 会议)The 33rd IEEE International Conference on Tools with Artificial Intelligence

Ejemplo Algoritmo Viterbi - Example of a Viterbi algorithm applied to a hidden Markov model on DNA sequence

Toontown: Galaxy, a new Toontown game based on Disney's Toontown Online

patchmatch和patchmatchstereo算法的python实现

A GOOD REPRESENTATION DETECTS NOISY LABELS

Human motion synthesis using Unity3D

Code & Experiments for "LILA: Language-Informed Latent Actions" to be presented at the Conference on Robot Learning (CoRL) 2021.

[CVPR'21] Multi-Modal Fusion Transformer for End-to-End Autonomous Driving

Machine learning evaluation metrics, implemented in Python, R, Haskell, and MATLAB / Octave

Fast and customizable reconnaissance workflow tool based on simple YAML based DSL.

SLAMP: Stochastic Latent Appearance and Motion Prediction

Unified Instance and Knowledge Alignment Pretraining for Aspect-based Sentiment Analysis

Implementation of a Transformer that Ponders, using the scheme from the PonderNet paper

Part-Aware Data Augmentation for 3D Object Detection in Point Cloud

[CVPR 21] Vectorization and Rasterization: Self-Supervised Learning for Sketch and Handwriting, IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2021.

Large Scale Multi-Illuminant (LSMI) Dataset for Developing White Balance Algorithm under Mixed Illumination

This example implements the end-to-end MLOps process using Vertex AI platform and Smart Analytics technology capabilities