Contrastive learning of Class-agnostic Activation Map for Weakly Supervised Object Localization and Semantic Segmentation (CVPR 2022)

Last update: Dec 27, 2022

Related tags

Deep Learning image-segmentation

Overview

CCAM (Unsupervised)

Code repository for our paper "CCAM: Contrastive learning of Class-agnostic Activation Map for Weakly Supervised Object Localization and Semantic Segmentation" in CVPR 2022.

The repository includes full training, evaluation, and visualization codes on CUB-200-2011, ILSVRC2012, and PASCAL VOC2012 datasets.

We provide the extracted class-agnostic bounding boxes (on CUB-200-2011 and ILSVRC2012) and background cues (on PASCAL VOC12) at here.

Dependencies

Python 3
PyTorch 1.7.1
OpenCV-Python
Numpy
Scipy
MatplotLib
Yaml
Easydict

Dataset

CUB-200-2011

You will need to download the images (JPEG format) in CUB-200-2011 dataset at here. Make sure your data/CUB_200_2011 folder is structured as follows:

├── CUB_200_2011/
|   ├── images
|   ├── images.txt
|   ├── bounding_boxes.txt
|   ...
|   └── train_test_split.txt

You will need to download the images (JPEG format) in ILSVRC2012 dataset at here. Make sure your data/ILSVRC2012 folder is structured as follows:

ILSVRC2012

├── ILSVRC2012/ 
|   ├── train
|   ├── val
|   ├── val_boxes
|   |   ├——val
|   |   |   ├—— ILSVRC2012_val_00050000.xml
|   |   |   ├—— ...
|   ├── train.txt
|   └── val.txt

PASCAL VOC2012

You will need to download the images (JPEG format) in PASCAL VOC2012 dataset at here. Make sure your data/VOC2012 folder is structured as follows:

├── VOC2012/
|   ├── Annotations
|   ├── ImageSets
|   ├── SegmentationClass
|   ├── SegmentationClassAug
|   └── SegmentationObject

For WSOL task

please refer to the directory of './WSOL'

cd WSOL

For WSSS task

please refer to the directory of './WSSS'

cd WSSS

Comparison with CAM

CUSTOM DATASET

As CCAM is an unsupervised method, it can be applied to various scenarios, like ReID, Saliency detection, or skin lesion detection. We provide an example to apply CCAM on your custom dataset like 'Market-1501'.

cd CUSTOM

Reference

If you are using our code, please consider citing our paper.

@article{xie2022contrastive,
  title={Contrastive learning of Class-agnostic Activation Map for Weakly Supervised Object Localization and Semantic Segmentation},
  author={Xie, Jinheng and Xiang, Jianfeng and Chen, Junliang and Hou, Xianxu and Zhao, Xiaodong and Shen, Linlin},
  journal={arXiv preprint arXiv:2203.13505},
  year={2022}
}

Contrastive learning of Class-agnostic Activation Map for Weakly Supervised Object Localization and Semantic Segmentation (CVPR 2022)

Related tags

Overview

CCAM (Unsupervised)

Dependencies

Dataset

CUB-200-2011

ILSVRC2012

PASCAL VOC2012

For WSOL task

For WSSS task

Comparison with CAM

CUSTOM DATASET

Reference

Owner

Computer Vision Insitute, SZU

Repo for FUZE project. I will also publish some Linux kernel LPE exploits for various real world kernel vulnerabilities here. the samples are uploaded for education purposes for red and blue teams.

ncnn is a high-performance neural network inference framework optimized for the mobile platform

ObjDetApp deploys a pytorch model for object detection

ToFFi - Toolbox for Frequency-based Fingerprinting of Brain Signals

A multi-scale unsupervised learning for deformable image registration

Multi-Task Learning as a Bargaining Game

Modular Probabilistic Programming on MXNet

Source codes of CenterTrack++ in 2021 ICME Workshop on Big Surveillance Data Processing and Analysis

Tensorflow implementation of "BEGAN: Boundary Equilibrium Generative Adversarial Networks"

[CVPR2021] Invertible Image Signal Processing

Submanifold sparse convolutional networks

RETRO-pytorch - Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch

TensorFlow (v2.7.0) benchmark results on an M1 Macbook Air 2020 laptop (macOS Monterey v12.1).

An efficient framework for reinforcement learning.

Large-Scale Unsupervised Object Discovery

A High-Quality Real Time Upscaler for Anime Video

Python Assignments for the Deep Learning lectures by Andrew NG on coursera with complete submission for grading capability.

This package proposes simplified exporting pytorch models to ONNX and TensorRT, and also gives some base interface for model inference.

Code for AA-RMVSNet: Adaptive Aggregation Recurrent Multi-view Stereo Network (ICCV 2021).

Answering Open-Domain Questions of Varying Reasoning Steps from Text

Contrastive learning of Class-agnostic Activation Map for Weakly Supervised Object Localization and Semantic Segmentation (CVPR 2022)

Related tags

Overview

CCAM (Unsupervised)

Dependencies

Dataset

CUB-200-2011

ILSVRC2012

PASCAL VOC2012

For WSOL task

For WSSS task

Comparison with CAM

CUSTOM DATASET

Reference

Owner

Computer Vision Insitute, SZU

Repo for FUZE project. I will also publish some Linux kernel LPE exploits for various real world kernel vulnerabilities here. the samples are uploaded for education purposes for red and blue teams.

ncnn is a high-performance neural network inference framework optimized for the mobile platform

*ObjDetApp* deploys a pytorch model for object detection

ToFFi - Toolbox for Frequency-based Fingerprinting of Brain Signals

A multi-scale unsupervised learning for deformable image registration

Multi-Task Learning as a Bargaining Game

Modular Probabilistic Programming on MXNet

Source codes of CenterTrack++ in 2021 ICME Workshop on Big Surveillance Data Processing and Analysis

Tensorflow implementation of "BEGAN: Boundary Equilibrium Generative Adversarial Networks"

[CVPR2021] Invertible Image Signal Processing

Submanifold sparse convolutional networks

RETRO-pytorch - Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch

TensorFlow (v2.7.0) benchmark results on an M1 Macbook Air 2020 laptop (macOS Monterey v12.1).

An efficient framework for reinforcement learning.

Large-Scale Unsupervised Object Discovery

A High-Quality Real Time Upscaler for Anime Video

Python Assignments for the Deep Learning lectures by Andrew NG on coursera with complete submission for grading capability.

This package proposes simplified exporting pytorch models to ONNX and TensorRT, and also gives some base interface for model inference.

Code for AA-RMVSNet: Adaptive Aggregation Recurrent Multi-view Stereo Network (ICCV 2021).

Answering Open-Domain Questions of Varying Reasoning Steps from Text

ObjDetApp deploys a pytorch model for object detection