Few-Shot Object Detection via Association and DIscrimination

Last update: Dec 18, 2022

Related tags

Overview

Few-Shot Object Detection via Association and DIscrimination

Code release of our NeurIPS 2021 paper: Few-Shot Object Detection via Association and DIscrimination.

Bibtex

@inproceedings{cao2021few,
  title={Few-Shot Object Detection via Association and DIscrimination},
  author={Cao, Yuhang and Wang, Jiaqi and Jin, Ying and Wu, Tong and Chen, Kai and Liu, Ziwei and Lin, Dahua},
  booktitle={Thirty-Fifth Conference on Neural Information Processing Systems},
  year={2021}
}

Arxiv: https://arxiv.org/abs/2111.11656

Install dependencies

Create a new environment: conda create -n fadi python=3.8 -y
Active the newly created environment: conda activate fadi
Install PyTorch and torchvision: conda install pytorch=1.7 torchvision cudatoolkit=10.2 -c pytorch -y
Install MMDetection: pip install mmdet==2.11.0
Install MMCV: pip install mmcv==1.2.5
Install MMCV-Full: pip install mmcv-full==1.2.5 -f https://download.openmmlab.com/mmcv/dist/cu102/torch1.7.0/index.html

Note:

Only tested on MMDet==2.11.0, MMCV==1.2.5, it may not be consistent with other versions.
The above instructions use CUDA 10.2, make sure you install the correct PyTorch, Torchvision and MMCV-Full that are consistent with your CUDA version.

Prepare dataset

We follow exact the same split with TFA, please download the dataset and split files as follows:

Download PASCAL VOC
Download split files

Create a directory data in the root directory, and the expected structure for data directory:

data/
    VOCdevkit
    few_shot_voc_split

Training & Testing

Base Training

FADI share the same base training stage with TFA, we directly convert the corresponding checkpoints from TFA in Detectron2 format to MMDetection format, please download the base training checkpoints following the table.

Name	Split	AP50	download
Base Model	1	80.8	model \| surgery
Base Model	2	81.9	model \| surgery
Base Model	3	82.0	model \| surgery

Create a directory models in the root directory, and the expected structure for models directory:

models/
    voc_split1_base.pth
    voc_split1_base_surgery.pth
    voc_split2_base.pth
    voc_split2_base_surgery.pth
    voc_split3_base.pth
    voc_split3_base_surgery.pth

Few-Shot Fine-tuning

FADI divides the few-shot fine-tuning stage into two steps, ie, association and discrimination,

Suppose we want to train a model for Pascal VOC split1, shot1 with 8 GPUs

1. Step 1: Association.

Getting the assigning scheme of the split:

python tools/associate.py 1

Aligning the feature distribution of the associated base and novel classes:

./tools/dist_train.sh configs/voc_split1/fadi_split1_shot1_association.py 8

2. Step 2: Discrimination

Building a discriminate feature space for novel classes with disentangling and set-specialized margin loss:

./tools/dist_train.sh configs/voc_split1/fadi_split1_shot1_discrimination.py 8

Holistically Training:

We also provide you a script tools/fadi_finetune.sh to holistically train a model for a specific split/shot by running:

./tools/fadi_finetune.sh 1 1

Evaluation

To evaluate the trained models, run

./tools/dist_test.sh configs/voc_split1/fadi_split1_shot1_discrimination.py [checkpoint] 8 --eval mAP --out res.pkl

Model Zoo

Pascal VOC split 1

Shot	nAP50	download
1	50.6	association \| discrimination
2	54.8	association \| discrimination
3	54.1	association \| discrimination
5	59.4	association \| discrimination
10	63.5	association \| discrimination

Pascal VOC split 2

Shot	nAP50	download
1	30.5	association \| discrimination
2	35.1	association \| discrimination
3	40.3	association \| discrimination
5	42.9	association \| discrimination
10	48.3	association \| discrimination

Pascal VOC split 3

Shot	nAP50	download
1	45.7	association \| discrimination
2	49.4	association \| discrimination
3	49.4	association \| discrimination
5	55.1	association \| discrimination
10	59.3	association \| discrimination

Few-Shot Object Detection via Association and DIscrimination

Related tags

Overview

Few-Shot Object Detection via Association and DIscrimination

Bibtex

Install dependencies

Prepare dataset

Training & Testing

Base Training

Few-Shot Fine-tuning

1. Step 1: Association.

2. Step 2: Discrimination

Holistically Training:

Evaluation

Model Zoo

Pascal VOC split 1

Pascal VOC split 2

Pascal VOC split 3

Owner

Cao Yuhang

PIGLeT: Language Grounding Through Neuro-Symbolic Interaction in a 3D World [ACL 2021]

基于tensorflow 2.x的图片识别工具集

Detecting Blurred Ground-based Sky/Cloud Images

CellRank's reproducibility repository.

Implementation of a Transformer that Ponders, using the scheme from the PonderNet paper

Reviving Iterative Training with Mask Guidance for Interactive Segmentation

Auxiliary Raw Net (ARawNet) is a ASVSpoof detection model taking both raw waveform and handcrafted features as inputs, to balance the trade-off between performance and model complexity.

Automatic labeling, conversion of different data set formats, sample size statistics, model cascade

Hierarchical Attentive Recurrent Tracking

Reinforcement learning models in ViZDoom environment

Source code for PairNorm (ICLR 2020)

Deep deconfounded recommender (Deep-Deconf) for paper "Deep causal reasoning for recommendations"

A multi-scale unsupervised learning for deformable image registration

La source de mon module 'pyfade' disponible sur Pypi.

Python implementation of O-OFDMNet, a deep learning-based optical OFDM system,

quantize aware training package for NCNN on pytorch

Human Pose estimation with TensorFlow framework

Clean Machine Learning, a Coding Kata

Alfred-Restore-Iterm-Arrangement - An Alfred workflow to restore iTerm2 window Arrangements

[NeurIPS 2021] Deceive D: Adaptive Pseudo Augmentation for GAN Training with Limited Data