Group R-CNN for Point-based Weakly Semi-supervised Object Detection (CVPR2022)

Last update: Dec 24, 2022

Related tags

Deep Learning GroupRCNN

Overview

Group R-CNN for Point-based Weakly Semi-supervised Object Detection (CVPR2022)

By Shilong Zhang*, Zhuoran Yu*, Liyang Liu*, Xinjiang Wang, Aojun Zhou, Kai Chen

Abstract:

We study the problem of weakly semi-supervised object detection with points (WSSOD-P), where the training data is combined by a small set of fully annotated images with bounding boxes and a large set of weakly-labeled images with only a single point annotated for each instance. The core of this task is to train a point-to-box regressor on well labeled images that can be used to predict credible bounding boxes for each point annotation. Group R-CNN significantly outperforms the prior method Point DETR by 3.9 mAP with 5% well-labeled images, which is the most challenging scenario.

Install

The project has been fully tested under MMDetection V2.22.0 and MMCV V1.4.6, other versions may not be compatible. so you have to install mmcv and mmdetection firstly. You can refer to Installation of MMCV & Installation of MMDetection

Prepare the dataset

mmdetection
├── data
│   ├── coco
│   │   ├── annotations
│   │   │      ├──instances_train2017.json
│   │   │      ├──instances_val2017.json
│   │   ├── train2017
│   │   ├── val2017

You can generate point annotations with the command. It may take you several minutes for instances_train2017.json

python tools/generate_anns.py /data/coco/annotations/instances_train2017.json
python tools/generate_anns.py /data/coco/annotations/instances_val2017.json

Then you can find a point_ann directory, all annotations in the directory contain point annotations. Then you should replace the original annotations in data/coco/annotations with generated annotations.

NOTES

Here, we sample a point from the mask for all instances. But we split the images into two divisions in :class:PointCocoDataset.

Images with only bbox annotations(well-labeled images): Only be used in training phase. We sample a point from its bbox as point annotations each iteration.
Images with only point annotations(weakly-labeled sets): Only be used to generate bbox annotations from point annotations with trained point to bbox regressor.

Train and Test

8 is the number of gpus.

For slurm

Train

GPUS=8 sh tools/slurm_train.sh partition_name  job_name projects/configs/10_coco/group_rcnn_24e_10_percent_coco_detr_augmentation.py  ./exp/group_rcnn

Evaluate the quality of generated bbox annotations on val dataset with pre-defined point annotations.

GPUS=8 sh tools/slurm_test.sh partition_name  job_name projects/configs/10_coco/group_rcnn_24e_10_percent_coco_detr_augmentation.py ./exp/group_rcnn/latest.pth --eval bbox

Run the inference process on weakly-labeled images with point annotations to get bbox annotations.

GPUS=8 sh tools/slurm_test.sh partition_name  job_name  projects/configs/10_coco/group_rcnn_50e_10_percent_coco_detr_augmentation.py   path_to_checkpoint  --format-only --options  "jsonfile_prefix=./generated"

For Pytorch distributed

Train

sh tools/dist_train.sh projects/configs/10_coco/group_rcnn_24e_10_percent_coco_detr_augmentation.py 8 --work-dir ./exp/group_rcnn

Evaluate the quality of generated bbox annotations on val dataset with pre-defined point annotations.

sh tools/dist_test.sh  projects/configs/10_coco/group_rcnn_24e_10_percent_coco_detr_augmentation.py  path_to_checkpoint 8 --eval bbox

Run the inference process on weakly-labeled images with point annotations to get bbox annotations.

sh tools/dist_test.sh  projects/configs/10_coco/group_rcnn_50e_10_percent_coco_detr_augmentation.py   path_to_checkpoint 8 --format-only --options  "jsonfile_prefix=./data/coco/annotations/generated"

Then you can train the student model focs.

sh tools/dist_train.sh projects/configs/10_coco/01_student_fcos.py 8 --work-dir ./exp/01_student_fcos

Results & Checkpoints

We find that the performance of teacher is unstable under 24e setting and may fluctuate by about 0.2 mAP. We report the average.

Model	Backbone	Lr schd	Augmentation	box AP	Config	Model	log	Generated Annotations
Teacher(Group R-CNN)	R-50-FPN	24e	DETR Aug	39.2	config	ckpt	log	-
Teacher(Group R-CNN)	R-50-FPN	50e	DETR Aug	39.9	config	ckpt	log	generated.bbox.json
Student(FCOS)	R-50-FPN	12e	Normal 1x Aug	33.1	config	ckpt	log	-

Group R-CNN for Point-based Weakly Semi-supervised Object Detection (CVPR2022)

Related tags

Overview

Group R-CNN for Point-based Weakly Semi-supervised Object Detection (CVPR2022)

Abstract:

Install

Prepare the dataset

NOTES

Train and Test

For slurm

For Pytorch distributed

Results & Checkpoints

Owner

Shilong Zhang

PyTorch framework, for reproducing experiments from the paper Implicit Regularization in Hierarchical Tensor Factorization and Deep Convolutional Neural Networks

The code for our paper "AutoSF: Searching Scoring Functions for Knowledge Graph Embedding"

transfer attack; adversarial examples; black-box attack; unrestricted Adversarial Attacks on ImageNet; CVPR2021 天池黑盒竞赛

The official implementation of Theme Transformer

This is the pytorch implementation for the paper: Learning Accurate Performance Predictors for Ultrafast Automated Model Compression, which is in submission to TPAMI

Source code for CVPR 2021 paper "Riggable 3D Face Reconstruction via In-Network Optimization"

A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning

Using knowledge-informed machine learning on the PRONOSTIA (FEMTO) and IMS bearing data sets. Predict remaining-useful-life (RUL).

Codes for the paper Contrast and Mix: Temporal Contrastive Video Domain Adaptation with Background Mixing

Classic Papers for Beginners and Impact Scope for Authors.

WaveFake: A Data Set to Facilitate Audio DeepFake Detection

A little Python application to auto tag your photos with the power of machine learning.

Official Code For TDEER: An Efficient Translating Decoding Schema for Joint Extraction of Entities and Relations (EMNLP2021)

Random Forests for Regression with Missing Entries

Benchmark for the generalization of 3D machine learning models across different remeshing/samplings of a surface.

[ICCV2021] Official code for "Channel-wise Topology Refinement Graph Convolution for Skeleton-Based Action Recognition"

A small library of 3D related utilities used in my research.

Official code for the paper "Self-Supervised Prototypical Transfer Learning for Few-Shot Classification"

LF-YOLO (Lighter and Faster YOLO) is used to detect defect of X-ray weld image.

Dynamic Divide-and-Conquer Adversarial Training for Robust Semantic Segmentation （ICCV2021）

Group R-CNN for Point-based Weakly Semi-supervised Object Detection (CVPR2022)

Related tags

Overview

Group R-CNN for Point-based Weakly Semi-supervised Object Detection (CVPR2022)

Abstract:

Install

Prepare the dataset

NOTES

Train and Test

For slurm

For Pytorch distributed

Results & Checkpoints

Owner

Shilong Zhang

PyTorch framework, for reproducing experiments from the paper Implicit Regularization in Hierarchical Tensor Factorization and Deep Convolutional Neural Networks

The code for our paper "AutoSF: Searching Scoring Functions for Knowledge Graph Embedding"

transfer attack; adversarial examples; black-box attack; unrestricted Adversarial Attacks on ImageNet; CVPR2021 天池黑盒竞赛

The official implementation of Theme Transformer

This is the pytorch implementation for the paper: *Learning Accurate Performance Predictors for Ultrafast Automated Model Compression*, which is in submission to TPAMI

Source code for CVPR 2021 paper "Riggable 3D Face Reconstruction via In-Network Optimization"

A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning

Using knowledge-informed machine learning on the PRONOSTIA (FEMTO) and IMS bearing data sets. Predict remaining-useful-life (RUL).

Codes for the paper Contrast and Mix: Temporal Contrastive Video Domain Adaptation with Background Mixing

Classic Papers for Beginners and Impact Scope for Authors.

WaveFake: A Data Set to Facilitate Audio DeepFake Detection

A little Python application to auto tag your photos with the power of machine learning.

Official Code For TDEER: An Efficient Translating Decoding Schema for Joint Extraction of Entities and Relations (EMNLP2021)

Random Forests for Regression with Missing Entries

Benchmark for the generalization of 3D machine learning models across different remeshing/samplings of a surface.

[ICCV2021] Official code for "Channel-wise Topology Refinement Graph Convolution for Skeleton-Based Action Recognition"

A small library of 3D related utilities used in my research.

Official code for the paper "Self-Supervised Prototypical Transfer Learning for Few-Shot Classification"

LF-YOLO (Lighter and Faster YOLO) is used to detect defect of X-ray weld image.

Dynamic Divide-and-Conquer Adversarial Training for Robust Semantic Segmentation （ICCV2021）

This is the pytorch implementation for the paper: Learning Accurate Performance Predictors for Ultrafast Automated Model Compression, which is in submission to TPAMI