PyTorch code for ICLR 2021 paper Unbiased Teacher for Semi-Supervised Object Detection

Last update: Dec 28, 2022

Related tags

Overview

Unbiased Teacher for Semi-Supervised Object Detection

This is the PyTorch implementation of our paper:
Unbiased Teacher for Semi-Supervised Object Detection
Yen-Cheng Liu, Chih-Yao Ma, Zijian He, Chia-Wen Kuo, Kan Chen, Peizhao Zhang, Bichen Wu, Zsolt Kira, Peter Vajda
International Conference on Learning Representations (ICLR), 2021

[arXiv] [OpenReview] [Project]

Installation

Prerequisites

Linux or macOS with Python ≥ 3.6
PyTorch ≥ 1.5 and torchvision that matches the PyTorch installation.

Install PyTorch in Conda env

# create conda env
conda create -n detectron2 python=3.6
# activate the enviorment
conda activate detectron2
# install PyTorch >=1.5 with GPU
conda install pytorch torchvision -c pytorch

Build Detectron2 from Source

Follow the INSTALL.md to install Detectron2.

Dataset download

Download COCO dataset

# download images
wget http://images.cocodataset.org/zips/train2017.zip
wget http://images.cocodataset.org/zips/val2017.zip

# download annotations
wget http://images.cocodataset.org/annotations/annotations_trainval2017.zip

Organize the dataset as following:

unbiased_teacher/
└── datasets/
    └── coco/
        ├── train2017/
        ├── val2017/
        └── annotations/
        	├── instances_train2017.json
        	└── instances_val2017.json

Training

Train the Unbiased Teacher under 1% COCO-supervision

python train_net.py \
      --num-gpus 8 \
      --config configs/coco_supervision/faster_rcnn_R_50_FPN_sup1_run1.yaml \
       SOLVER.IMG_PER_BATCH_LABEL 16 SOLVER.IMG_PER_BATCH_UNLABEL 16

Train the Unbiased Teacher under 2% COCO-supervision

python train_net.py \
      --num-gpus 8 \
      --config configs/coco_supervision/faster_rcnn_R_50_FPN_sup2_run1.yaml \
       SOLVER.IMG_PER_BATCH_LABEL 16 SOLVER.IMG_PER_BATCH_UNLABEL 16

Train the Unbiased Teacher under 5% COCO-supervision

python train_net.py \
      --num-gpus 8 \
      --config configs/coco_supervision/faster_rcnn_R_50_FPN_sup5_run1.yaml \
       SOLVER.IMG_PER_BATCH_LABEL 16 SOLVER.IMG_PER_BATCH_UNLABEL 16

Train the Unbiased Teacher under 10% COCO-supervision

python train_net.py \
      --num-gpus 8 \
      --config configs/coco_supervision/faster_rcnn_R_50_FPN_sup10_run1.yaml \
       SOLVER.IMG_PER_BATCH_LABEL 16 SOLVER.IMG_PER_BATCH_UNLABEL 16

Resume the training

python train_net.py \
      --resume \
      --num-gpus 8 \
      --config configs/coco_supervision/faster_rcnn_R_50_FPN_sup10_run1.yaml \
       SOLVER.IMG_PER_BATCH_LABEL 16 SOLVER.IMG_PER_BATCH_UNLABEL 16 MODEL.WEIGHTS <your weight>.pth

Evaluation

python train_net.py \
      --eval-only \
      --num-gpus 8 \
      --config configs/coco_supervision/faster_rcnn_R_50_FPN_sup10_run1.yaml \
       SOLVER.IMG_PER_BATCH_LABEL 16 SOLVER.IMG_PER_BATCH_UNLABEL 16 MODEL.WEIGHTS <your weight>.pth

Model Zoo

Coming soon

FAQ

Q: Using the lower batch size and fewer GPUs cannot achieve the results presented in the paper?

A: We train the model with 32 labeled images + 32 unlabeled images per batch for the results presented in the paper, and using the lower batch size leads to lower accuracy. For example, in the 1% COCO-supervision setting, the model trained with 16 labeled images + 16 unlabeled images achieves 19.9 AP as shown in the following table.

Experiment GPUs	Batch size per node	Batch size	AP
8 GPUs/node; 4 nodes	8 labeled imgs + 8 unlabeled imgs	32 labeled img + 32 unlabeled imgs	20.75
8 GPUs/node; 1 node	16 labeled imgs + 16 unlabeled imgs	16 labeled imgs + 16 unlabeled imgs	19.9

Citing Unbiased Teacher

If you use Unbiased Teacher in your research or wish to refer to the results published in the paper, please use the following BibTeX entry.

@inproceedings{liu2021unbiased,
    title={Unbiased Teacher for Semi-Supervised Object Detection},
    author={Liu, Yen-Cheng and Ma, Chih-Yao and He, Zijian and Kuo, Chia-Wen and Chen, Kan and Zhang, Peizhao and Wu, Bichen and Kira, Zsolt and Vajda, Peter},
    booktitle={Proceedings of the International Conference on Learning Representations (ICLR)},
    year={2021},
}

Also, if you use Detectron2 in your research, please use the following BibTeX entry.

@misc{wu2019detectron2,
  author =       {Yuxin Wu and Alexander Kirillov and Francisco Massa and
                  Wan-Yen Lo and Ross Girshick},
  title =        {Detectron2},
  howpublished = {\url{https://github.com/facebookresearch/detectron2}},
  year =         {2019}
}

License

This project is licensed under MIT License, as found in the LICENSE file.

PyTorch code for ICLR 2021 paper Unbiased Teacher for Semi-Supervised Object Detection

Related tags

Overview

Unbiased Teacher for Semi-Supervised Object Detection

Installation

Prerequisites

Install PyTorch in Conda env

Build Detectron2 from Source

Dataset download

Training

Resume the training

Evaluation

Model Zoo

FAQ

Citing Unbiased Teacher

License

Owner

Facebook Research

ProjectOxford-ClientSDK - This repo has moved :house: Visit our website for the latest SDKs & Samples

Kinetics-Data-Preprocessing

BED: A Real-Time Object Detection System for Edge Devices

MaskTrackRCNN for video instance segmentation based on mmdetection

public repo for ESTER dataset and modeling (EMNLP'21)

Code for "Contextual Non-Local Alignment over Full-Scale Representation for Text-Based Person Search"

A Kitti Road Segmentation model implemented in tensorflow.

Open-Set Recognition: A Good Closed-Set Classifier is All You Need

JupyterNotebook - C/C++, Javascript, HTML, LaTex, Shell scripts in Jupyter Notebook Also run them on remote computer

PyTorch implementations of neural network models for keyword spotting

GitHub repository for "Improving Video Generation for Multi-functional Applications"

Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"

gym-anm is a framework for designing reinforcement learning (RL) environments that model Active Network Management (ANM) tasks in electricity distribution networks.

Repository for the paper "Exploring the Sensory Spaces of English Perceptual Verbs in Natural Language Data"

Rotated Box Is Back : Accurate Box Proposal Network for Scene Text Detection

Python PID Tuner - Makes a model of the System from a Process Reaction Curve and calculates PID Gains

NeRD: Neural Reflectance Decomposition from Image Collections

ReGAN: Sequence GAN using RE[INFORCE|LAX|BAR] based PG estimators

The NEOSSat is a dual-mission microsatellite designed to detect potentially hazardous Earth-orbit-crossing asteroids and track objects that reside in deep space

Sharing of contents on mitochondrial encounter networks