Official pytorch implementation of Active Learning for deep object detection via probabilistic modeling (ICCV 2021)

Last update: Jan 06, 2023

Overview

Active Learning for Deep Object Detection via Probabilistic Modeling

This repository is the official PyTorch implementation of Active Learning for Deep Object Detection via Probabilistic Modeling, ICCV 2021.

The proposed method is implemented based on the SSD pytorch.

Our approach relies on mixture density networks to estimate, in a single forward pass of a single model, both localization and classification uncertainties, and leverages them in the scoring function for active learning.

Our method performs on par with multiple model-based methods (e.g., ensembles and MC-Dropout). Therefore, our method provides the best trade-off between accuracy and computational cost.

License

To view a NVIDIA Source Code License for this work, visit https://github.com/NVlabs/AL-MDN/blob/main/LICENSE

Requirements

For setup and data preparation, please refer to the README in SSD pytorch.

Code was tested in virtual environment with Python 3+ and Pytorch 1.1.

Training

Make directory mkdir weights and cd weights.
Download the FC-reduced VGG-16 backbone weight in the weights directory, and cd ...
If necessary, change the VOC_ROOT in data/voc0712.py or COCO_ROOT in data/coco.py.
Please refer to data/config.py for configuration.
Run the training code:

# Supervised learning
CUDA_VISIBLE_DEVICES=<GPU_ID> python train_ssd_gmm_supervised_learning.py

# Active learning
CUDA_VISIBLE_DEVICES=<GPU_ID> python train_ssd_gmm_active_learining.py

Evaluation

To evaluate on MS-COCO, change the COCO_ROOT_EVAL in data/coco_eval.py.
Run the evaluation code:

# Evaluation on PASCAL VOC
python eval_voc.py --trained_model <trained weight path>

# Evaluation on MS-COCO
python eval_coco.py --trained_model <trained weight path>

Visualization

Run the visualization code:

python demo.py --trained_model <trained weight path>

Citation

@InProceedings{Choi_2021_ICCV,
    author    = {Choi, Jiwoong and Elezi, Ismail and Lee, Hyuk-Jae and Farabet, Clement and Alvarez, Jose M.},
    title     = {Active Learning for Deep Object Detection via Probabilistic Modeling},
    booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
    month     = {October},
    year      = {2021},
    pages     = {10264-10273}
}

Official pytorch implementation of Active Learning for deep object detection via probabilistic modeling (ICCV 2021)

Related tags

Overview

Active Learning for Deep Object Detection via Probabilistic Modeling

License

Requirements

Training

Evaluation

Visualization

Citation

Owner

NVIDIA Research Projects

SafePicking: Learning Safe Object Extraction via Object-Level Mapping, ICRA 2022

VarCLR: Variable Semantic Representation Pre-training via Contrastive Learning

Second-Order Neural ODE Optimizer, NeurIPS 2021 spotlight

Caffe: a fast open framework for deep learning.

IDA file loader for UF2, created for the DEFCON 29 hardware badge

Optimizing DR with hard negatives and achieving SOTA first-stage retrieval performance on TREC DL Track (SIGIR 2021 Full Paper).

Implementation of "JOKR: Joint Keypoint Representation for Unsupervised Cross-Domain Motion Retargeting"

New AidForBlind - Various Libraries used like OpenCV and other mentioned in Requirements.txt

EMNLP 2021 paper The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers.

PyTorch implementation of saliency map-aided GAN for Auto-demosaic+denosing

TorchPQ is a python library for Approximate Nearest Neighbor Search (ANNS) and Maximum Inner Product Search (MIPS) on GPU using Product Quantization (PQ) algorithm.

SSD-based Object Detection in PyTorch

DiscoNet: Learning Distilled Collaboration Graph for Multi-Agent Perception [NeurIPS 2021]

Semantic Segmentation in Pytorch

Over-the-Air Ensemble Inference with Model Privacy

PyTorch CZSL framework containing GQA, the open-world setting, and the CGE and CompCos methods.

Fashion Entity Classification

GUPNet - Geometry Uncertainty Projection Network for Monocular 3D Object Detection

Calibrated Hyperspectral Image Reconstruction via Graph-based Self-Tuning Network.

Official implementation of "SinIR: Efficient General Image Manipulation with Single Image Reconstruction" (ICML 2021)