LBBA-boosted WSOD

Last update: Sep 19, 2022

Related tags

Deep Learning lbba_boosted_wsod

Overview

LBBA-boosted WSOD

Summary

Our code is based on ruotianluo/pytorch-faster-rcnn and WSCDN

Sincerely thanks for your resources.

Newer version of our code (based on Detectron 2) work in progress.

Hardware

We use one RTX 2080Ti GPU (11GB) to train and evaluate our method, GPU with larger memory is better (e.g., TITAN RTX with 24GB memory)

Requirements

Python 3.6 or higher
CUDA 10.1 with cuDNN 7.6.2
PyTorch 1.2.0
numpy 1.18.1
opencv 3.4.2

We provide a full requirements.txt (namely lbba_requirements.txt) in the workspace (lbba_boosted_wsod directory).

Additional resources

Google Drive

Description

selective_search_data: precomputed proposals of VOC 2007/2012
pretrained_models/imagenet_pretrain: imagenet pretrained models of WSOD backbone/LBBA backbone
pretrained_models/pretrained_on_wsddn: pretrained WSOD network of VOC 2007/2012, using this pretrained model usually converges faster and more stable.
models/voc07: our pretrained WSOD
models/lbba: our pretrained LBBA
codes_zip: our code template of LBBA training procedure and LBBA-boosted WSOD training procedure

Prepare

Environment

We use Anaconda to construct our experimental environment.

Install all required packages (or simply follow lbba_requirements.txt).

Essential Data

We have initialized all directories with gitkeep files.

first, cd lbba_boosted_wsod

then, download selective_search_data/* into data/selective_search_data

download pretrained_models/imagenet_pretrain/* into data/imagenet_weights

download pretrained_models/pretrained_on_wsddn/* into data/wsddn_weights

Datasets

Same with rbgirshick/py-faster-rcnn

For example, PASCAL VOC 2007 dataset

Download the training, validation, test data and VOCdevkit

wget http://host.robots.ox.ac.uk/pascal/VOC/voc2007/VOCtrainval_06-Nov-2007.tar
wget http://host.robots.ox.ac.uk/pascal/VOC/voc2007/VOCtest_06-Nov-2007.tar
wget http://host.robots.ox.ac.uk/pascal/VOC/voc2007/VOCdevkit_08-Jun-2007.tar

Extract all of these tars into one directory named VOCdevkit

tar xvf VOCtrainval_06-Nov-2007.tar
tar xvf VOCtest_06-Nov-2007.tar
tar xvf VOCdevkit_08-Jun-2007.tar

It should have this basic structure

$VOCdevkit/                           # development kit
$VOCdevkit/VOCcode/                   # VOC utility code
$VOCdevkit/VOC2007                    # image sets, annotations, etc.
# ... and several other directories ...

Create symlinks for the PASCAL VOC dataset

cd $FRCN_ROOT/data
ln -s $VOCdevkit VOCdevkit2007

Evaluate our WSOD

Download models/voc07/voc07_55.8.pth to lbba_boosted_wsod/

./test_voc07.sh 0 pascal_voc vgg16 voc07_55.8.pth

Note that different environments might result in a slight performance drop. For example, we obtain 55.8 mAP with CUDA 10.1 but obtain 55.5 mAP using the same code with CUDA 11.

Train WSOD

Download models/lbba/lbba_final.pth (or lbba_init.pth) to lbba_boosted_wsod/

bash train_wsod.sh 1 pascal_voc vgg16 voc07_wsddn_pre lbba_final.pth

Note that we provide different LBBA checkpoints (initialization stage, final stage, or even one-class adjuster mentioned in the suppl.).

Citation

@InProceedings{Dong_2021_ICCV,
    author    = {Dong, Bowen and Huang, Zitong and Guo, Yuelin and Wang, Qilong and Niu, Zhenxing and Zuo, Wangmeng},
    title     = {Boosting Weakly Supervised Object Detection via Learning Bounding Box Adjusters},
    booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
    month     = {October},
    year      = {2021},
    pages     = {2876-2885}
}

LBBA-boosted WSOD

Related tags

Overview

LBBA-boosted WSOD

Summary

Hardware

Requirements

Additional resources

Description

Prepare

Environment

Essential Data

Datasets

Evaluate our WSOD

Train WSOD

Citation

Owner

Martin Dong

Viperdb - A tiny log-structured key-value database written in pure Python

Code for ECIR'20 paper Diagnosing BERT with Retrieval Heuristics

JupyterNotebook - C/C++, Javascript, HTML, LaTex, Shell scripts in Jupyter Notebook Also run them on remote computer

[CVPR 2020] Transform and Tell: Entity-Aware News Image Captioning

Single cell current best practices tutorial case study for the paper:Luecken and Theis, "Current best practices in single-cell RNA-seq analysis: a tutorial"

Backend code to use MCPI's python API to make infinite worlds with custom generation

A real-time motion capture system that estimates poses and global translations using only 6 inertial measurement units

FIRA: Fine-Grained Graph-Based Code Change Representation for Automated Commit Message Generation

PConv-Keras - Unofficial implementation of "Image Inpainting for Irregular Holes Using Partial Convolutions". Try at: www.fixmyphoto.ai

Python scripts to detect faces in Python with the BlazeFace Tensorflow Lite models

FS2KToolbox FS2K Dataset Towards the translation between Face

Large-scale language modeling tutorials with PyTorch

Tesla Light Show xLights Guide With python

MAT: Mask-Aware Transformer for Large Hole Image Inpainting

A machine learning project which can detect and predict the skin disease through image recognition.

PyTorch implementation of the paper: "Preference-Adaptive Meta-Learning for Cold-Start Recommendation", IJCAI, 2021.

A pytorch implementation of Pytorch-Sketch-RNN

JudeasRx - graphical app for doing personalized causal medicine using the methods invented by Judea Pearl et al.

Using Tensorflow Object Detection API to detect Waymo open dataset

Optical Character Recognition + Instance Segmentation for russian and english languages