A Novel Plug-in Module for Fine-grained Visual Classification

Last update: Dec 20, 2022

Overview

A Novel Plug-in Module for Fine-grained Visual Classification

paper url: https://arxiv.org/abs/2202.03822

We propose a novel plug-in module that can be integrated to many common backbones, including CNN-based or Transformer-based networks to provide strongly discriminative regions. The plugin module can output pixel-level feature maps and fuse filtered features to enhance fine-grained visual classification. Experimental results show that the proposed plugin module outperforms state-ofthe-art approaches and significantly improves the accuracy to 92.77% and 92.83% on CUB200-2011 and NABirds, respectively.

1. Environment setting

install requirements
replace folder timm/ to our timm/ folder (for ViT or Swin-T)

Prepare dataset

In this paper, we use 2 large bird's datasets:

Our pretrained model

Download the pretrained model from this url: https://drive.google.com/drive/folders/1ivMJl4_EgE-EVU_5T8giQTwcNQ6RPtAo?usp=sharing

backup/ is our pretrained model path.
resnet50_miil_21k.pth and vit_base_patch16_224_miil_21k.pth are imagenet21k pretrained model (place these file under models/), thanks to https://github.com/Alibaba-MIIL/ImageNet21K/blob/main/MODEL_ZOO.md !!

OS

Windows10
Ubuntu20.04
macOS

2. Train

configuration file: config.py

python train.py --train_root "./CUB200-2011/train/" --val_root "./CUB200-2011/test/"

3. Evaluation

configuration file: config_eval.py

python eval.py --pretrained_path "./backup/CUB200/best.pth" --val_root "./CUB200-2011/test/"

4. Visualization

configuration file: config_plot.py

python plot_heat.py --pretrained_path "./backup/CUB200/best.pth" --img_path "./img/001.png/"

Acknowledgment

Thanks to timm for Pytorch implementation.
This work was financially supported by the National Taiwan Normal University (NTNU) within the framework of the Higher Education Sprout Project by the Ministry of Education(MOE) in Taiwan, sponsored by Ministry of Science and Technology, Taiwan, R.O.C. under Grant no. MOST 110- 2221-E-003-026, 110-2634-F-003 -007, and 110-2634-F-003 -006. In addition, we thank to National Center for Highperformance Computing (NCHC) for providing computational and storage resources.

A Novel Plug-in Module for Fine-grained Visual Classification

Related tags

Overview

A Novel Plug-in Module for Fine-grained Visual Classification

1. Environment setting

Prepare dataset

Our pretrained model

OS

2. Train

3. Evaluation

4. Visualization

Acknowledgment

Owner

ChouPoYung

Deep learning library for solving differential equations and more

Unofficial implementation of Fast-SCNN: Fast Semantic Segmentation Network

Portfolio analytics for quants, written in Python

HarDNeXt: Official HarDNeXt repository

Artificial intelligence technology inferring issues and logically supporting facts from raw text

The official code of Anisotropic Stroke Control for Multiple Artists Style Transfer

PyTorch Implementation of PIXOR: Real-time 3D Object Detection from Point Clouds

Image-retrieval-baseline - MUGE Multimodal Retrieval Baseline

Bachelor's Thesis in Computer Science: Privacy-Preserving Federated Learning Applied to Decentralized Data

Official implementation of Deep Reparametrization of Multi-Frame Super-Resolution and Denoising

Breaking Shortcut: Exploring Fully Convolutional Cycle-Consistency for Video Correspondence Learning

Yoloxkeypointsegment - An anchor-free version of YOLO, with a simpler design but better performance

CIFAR-10_train-test - training and testing codes for dataset CIFAR-10

A library for performing coverage guided fuzzing of neural networks

Repositorio de los Laboratorios de Análisis Numérico / Análisis Numérico I de FAMAF, UNC.

Keras like implementation of Deep Learning architectures from scratch using numpy.

This repository contains notebook implementations of the following Neural Process variants: Conditional Neural Processes (CNPs), Neural Processes (NPs), Attentive Neural Processes (ANPs).

Unsupervised Domain Adaptation for Nighttime Aerial Tracking (CVPR2022)

LOFO (Leave One Feature Out) Importance calculates the importances of a set of features based on a metric of choice,

Implementation for Shape from Polarization for Complex Scenes in the Wild