Codes for "Solving Long-tailed Recognition with Deep Realistic Taxonomic Classifier"

Last update: May 26, 2022

Related tags

Overview

Deep-RTC [project page]

This repository contains the source code accompanying our ECCV 2020 paper.

Solving Long-tailed Recognition with Deep Realistic Taxonomic Classifier
Tz-Ying Wu, Pedro Morgado, Pei Wang, Chih-Hui Ho, Nuno Vasconcelos

@inproceedings{Wu20DeepRTC,
	title={Solving Long-tailed Recognition with Deep Realistic Taxonomic Classifier},
	author={Tz-Ying Wu and Pedro Morgado and Pei Wang and Chih-Hui Ho and Nuno Vasconcelos},
	booktitle={European Conference on Computer Vision (ECCV)},
	year={2020}
}

Dependencies

Python (3.5.6)
PyTorch (1.2.0)
torchvision (0.4.0)
NumPy (1.15.2)
Pillow (5.2.0)
PyYaml (5.1.2)
tensorboardX (1.8)

Data preparation

CIFAR100 [Raw images] [Long-tail version]
AWA2 [Raw images]
ImageNet [Raw images] [Long-tail version]
iNaturalist [Raw images]

These datasets can be downloaded from the above links. Please organize the images in the hierarchical folders that represent the dataset hierarchy, and put the root folder under prepro/raw. For example,

prepro/raw/imagenet
--abstraction
----bubble
------ILSVRC2012_val_00014026.JPEG
------ILSVRC2012_val_00000697.JPEG
...
--physical_entity
----object
...

While CIFAR100 and iNaturalist have released taxonomies, we built the tree-type taxonomy of AWA2 and ImageNet with WordNet. All the taxonomies are provided in prepro/data/{dataset}/tree.npy, and the data splits are provided in prepro/splits/{dataset}/{split}.json. Please refer to prepro/README.md for more details. After the raw images are managed hierarchically, run

$ ./prepare_data.sh {dataset}

where {dataset}=awa2/cifar100/imagenet/inaturalist. This will automatically generate the data lists for all splits, and build the codeword matrices needed for training Deep-RTC. Note that our codes can be applied to other datasets once they are organized hierarchically.

Training and evaluation

To train and evaluate Deep-RTC, run

$ export PYTHONPATH=${PWD}/prepro:${PYTHONPATH}
$ ./run.sh {dataset}

where {dataset}=awa2/cifar100/imagenet/inaturalist. Our pretrained models can be downloaded here.

Codes for "Solving Long-tailed Recognition with Deep Realistic Taxonomic Classifier"

Related tags

Overview

Deep-RTC [project page]

Dependencies

Data preparation

Training and evaluation

Owner

Gina Wu

Lua-parser-lark - An out-of-box Lua parser written in Lark

Evaluating saliency methods on artificial data with different background types

Time-stretch audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included.

Moon-patrol - A faithful recreation of the 1983 hit classic Moon Patrol for the Atari 2600 created using the Pygame library for Python

Compute execution plan: A DAG representation of work that you want to get done. Individual nodes of the DAG could be simple python or shell tasks or complex deeply nested parallel branches or embedded DAGs themselves.

Generating retro pixel game characters with Generative Adversarial Networks. Dataset "TinyHero" included.

A curated list of awesome Deep Learning tutorials, projects and communities.

DeepLabv3+：Encoder-Decoder with Atrous Separable Convolution语义分割模型在tensorflow2当中的实现

Python implementation of MULTIseq barcode alignment using fuzzy string matching and GMM barcode assignment

A simple editor for captions in .SRT file extension

QMagFace: Simple and Accurate Quality-Aware Face Recognition

Deep learning-based approach to discovering Granger causality networks in multivariate time series

Source Code and data for my paper titled Linguistic Knowledge in Data Augmentation for Natural Language Processing: An Example on Chinese Question Matching

The official repository for "Score Transformer: Generating Musical Scores from Note-level Representation" (MMAsia '21)

Implementation of H-Transformer-1D, Hierarchical Attention for Sequence Learning using 🤗 transformers

A Real-World Benchmark for Reinforcement Learning based Recommender System

Liver segmentation using MONAI and pytorch

Visual odometry package based on hardware-accelerated NVIDIA Elbrus library with world class quality and performance.

Code of TIP2021 Paper《SFace: Sigmoid-Constrained Hypersphere Loss for Robust Face Recognition》. We provide both MxNet and Pytorch versions.

Use stochastic processes to generate samples and use them to train a fully-connected neural network based on Keras