[NeurIPS 2021] A weak-shot object detection approach by transferring semantic similarity and mask prior.

Last update: Jul 27, 2022

Overview

TransMaS

This repository is the official pytorch implementation of the following paper:

NIPS2021 Mixed Supervised Object Detection by TransferringMask Prior and Semantic Similarity

Yan Liu^∗, Zhijie Zhang^∗, Li Niu^†, Junjie Chen, Liqing Zhang^†

MoE Key Lab of Artificial, IntelligenceDepartment of Computer Science and Engineering, Shanghai Jiao Tong University

Setup

Follow the instructions in Installation to build the projects.

Data

Follow instructions in README.old.md to setup COCO and VOC datasets folder and place the coco and voc files under folder ./datasets. Annotations for the COCO-60, and VOC datasets on Google Drive

coco60_train2017_21987.json, coco60_val2017_969.json : place under folder ./datasets/coco/annotations/
voc_2007_trainval.json, voc_2007_test.json: place under ./datasets/voc/VOC2007/

Checkpoints

We provide the model checkpoints of object detection network and MIL classifier. All checkpoint files are on Google Drive, place the files under folder ./output/coco60_to_voc/

Evaluation

The test results of Ours^*(single-scale) on VOC2007 test set in the main paper can be reproduced by executing the following commands:

python -m torch.distributed.launch --nproc_per_node=2 tools/test_net.py --config-file wsod/coco60_to_voc/mil_it0.yaml OUTPUT_DIR "output/coco60_to_voc/mil_it2" MODEL.WEIGHT "output/coco60_to_voc/mil_it2/model_final.pth" WEAK.CFG2 "output/coco60_to_voc/odn_it2/config.yml"

Resources

We have summarized the existing papers and codes on weak-shot learning in the following repository: https://github.com/bcmi/Awesome-Weak-Shot-Learning

Acknowledgements

Thanks to WSOD with Progressive Knowledge Transfer providing the base architecture, iterative training strategy, and data annotations for our project. We further transfer mask prior and semantic similarity to bridge the gap between novel categories and base categories by adding the code for Mask Generator and SimNet.

[NeurIPS 2021] A weak-shot object detection approach by transferring semantic similarity and mask prior.

Related tags

Overview

TransMaS

Setup

Data

Checkpoints

Evaluation

Resources

Acknowledgements

Owner

BCMI

Official Repo for ICCV2021 Paper: Learning to Regress Bodies from Images using Differentiable Semantic Rendering

Code for paper Novel View Synthesis via Depth-guided Skip Connections

Deep Learning Models for Causal Inference

Platform-agnostic AI Framework 🔥

Wordle Env: A Daily Word Environment for Reinforcement Learning

alfred-py: A deep learning utility library for human

Code To Tune or Not To Tune? Zero-shot Models for Legal Case Entailment.

Wider or Deeper: Revisiting the ResNet Model for Visual Recognition

Awesome Weak-Shot Learning

Industrial Image Anomaly Localization Based on Gaussian Clustering of Pre-trained Feature

Flexible-Modal Face Anti-Spoofing: A Benchmark

Contrastive Multi-View Representation Learning on Graphs

GNN-based Recommendation Benchma

PyTorch Implementation of Sparse DETR

Use VITS and Opencpop to develop singing voice synthesis; Maybe it will VISinger.

A project for developing transformer-based models for clinical relation extraction

Make a Turtlebot3 follow a figure 8 trajectory and create a robot arm and make it follow a trajectory

[ WSDM '22 ] On Sampling Collaborative Filtering Datasets

A Transformer-Based Siamese Network for Change Detection

(ICCV 2021) ProHMR - Probabilistic Modeling for Human Mesh Recovery

[NeurIPS 2021] A weak-shot object detection approach by transferring semantic similarity and mask prior.

Related tags

Overview

TransMaS

Setup

Data

Checkpoints

Evaluation

Resources

Acknowledgements

Owner

BCMI

Official Repo for ICCV2021 Paper: Learning to Regress Bodies from Images using Differentiable Semantic Rendering

Code for paper Novel View Synthesis via Depth-guided Skip Connections

Deep Learning Models for Causal Inference

Platform-agnostic AI Framework 🔥

Wordle Env: A Daily Word Environment for Reinforcement Learning

alfred-py: A deep learning utility library for **human**

Code To Tune or Not To Tune? Zero-shot Models for Legal Case Entailment.

Wider or Deeper: Revisiting the ResNet Model for Visual Recognition

Awesome Weak-Shot Learning

Industrial Image Anomaly Localization Based on Gaussian Clustering of Pre-trained Feature

Flexible-Modal Face Anti-Spoofing: A Benchmark

Contrastive Multi-View Representation Learning on Graphs

GNN-based Recommendation Benchma

PyTorch Implementation of Sparse DETR

Use VITS and Opencpop to develop singing voice synthesis; Maybe it will VISinger.

A project for developing transformer-based models for clinical relation extraction

Make a Turtlebot3 follow a figure 8 trajectory and create a robot arm and make it follow a trajectory

[ WSDM '22 ] On Sampling Collaborative Filtering Datasets

A Transformer-Based Siamese Network for Change Detection

(ICCV 2021) ProHMR - Probabilistic Modeling for Human Mesh Recovery

alfred-py: A deep learning utility library for human