Official pytorch implementation of Active Learning for deep object detection via probabilistic modeling (ICCV 2021)

Last update: Jan 06, 2023

Overview

Active Learning for Deep Object Detection via Probabilistic Modeling

This repository is the official PyTorch implementation of Active Learning for Deep Object Detection via Probabilistic Modeling, ICCV 2021.

The proposed method is implemented based on the SSD pytorch.

Our approach relies on mixture density networks to estimate, in a single forward pass of a single model, both localization and classification uncertainties, and leverages them in the scoring function for active learning.

Our method performs on par with multiple model-based methods (e.g., ensembles and MC-Dropout). Therefore, our method provides the best trade-off between accuracy and computational cost.

License

To view a NVIDIA Source Code License for this work, visit https://github.com/NVlabs/AL-MDN/blob/main/LICENSE

Requirements

For setup and data preparation, please refer to the README in SSD pytorch.

Code was tested in virtual environment with Python 3+ and Pytorch 1.1.

Training

Make directory mkdir weights and cd weights.
Download the FC-reduced VGG-16 backbone weight in the weights directory, and cd ...
If necessary, change the VOC_ROOT in data/voc0712.py or COCO_ROOT in data/coco.py.
Please refer to data/config.py for configuration.
Run the training code:

# Supervised learning
CUDA_VISIBLE_DEVICES=<GPU_ID> python train_ssd_gmm_supervised_learning.py

# Active learning
CUDA_VISIBLE_DEVICES=<GPU_ID> python train_ssd_gmm_active_learining.py

Evaluation

To evaluate on MS-COCO, change the COCO_ROOT_EVAL in data/coco_eval.py.
Run the evaluation code:

# Evaluation on PASCAL VOC
python eval_voc.py --trained_model <trained weight path>

# Evaluation on MS-COCO
python eval_coco.py --trained_model <trained weight path>

Visualization

Run the visualization code:

python demo.py --trained_model <trained weight path>

Citation

@InProceedings{Choi_2021_ICCV,
    author    = {Choi, Jiwoong and Elezi, Ismail and Lee, Hyuk-Jae and Farabet, Clement and Alvarez, Jose M.},
    title     = {Active Learning for Deep Object Detection via Probabilistic Modeling},
    booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
    month     = {October},
    year      = {2021},
    pages     = {10264-10273}
}

Official pytorch implementation of Active Learning for deep object detection via probabilistic modeling (ICCV 2021)

Related tags

Overview

Active Learning for Deep Object Detection via Probabilistic Modeling

License

Requirements

Training

Evaluation

Visualization

Citation

Owner

NVIDIA Research Projects

A python package to perform same transformation to coco-annotation as performed on the image.

Pytorch implementation of Cut-Thumbnail in the paper Cut-Thumbnail:A Novel Data Augmentation for Convolutional Neural Network.

Powerful and efficient Computer Vision Annotation Tool (CVAT)

Some useful blender add-ons for SMPL skeleton's poses and global translation.

DCT-Mask: Discrete Cosine Transform Mask Representation for Instance Segmentation

Code of the paper "Multi-Task Meta-Learning Modification with Stochastic Approximation".

The PASS dataset: pretrained models and how to get the data - PASS: Pictures without humAns for Self-Supervised Pretraining

A Python library for generating new text from existing samples.

FACIAL: Synthesizing Dynamic Talking Face With Implicit Attribute Learning. ICCV, 2021.

根据midi文件演奏“风物之诗琴”的脚本 "Windsong Lyre" auto play

This program uses trial auth token of Azure Cognitive Services to do speech synthesis for you.

Self-Adaptable Point Processes with Nonparametric Time Decays

Multi-task head pose estimation in-the-wild

This repository provides data for the VAW dataset as described in the CVPR 2021 paper titled "Learning to Predict Visual Attributes in the Wild"

A C implementation for creating 2D voronoi diagrams

Self-Supervised Learning with Kernel Dependence Maximization

Supplemental learning materials for "Fourier Feature Networks and Neural Volume Rendering"

Experiments and examples converting Transformers to ONNX

A Python package for performing pore network modeling of porous media

An index of recommendation algorithms that are based on Graph Neural Networks.