This is a code repository for paper OODformer: Out-Of-Distribution Detection Transformer

Last update: Dec 02, 2022

Related tags

Overview

OODformer: Out-Of-Distribution Detection Transformer

This repo is the official the implementation of the OODformer: Out-Of-Distribution Detection Transformer in PyTorch using CIFAR as an illustrative example:
##Getting started

At first please install all the dependencies using : pip install -r requirement.txt ##Datasets Please download all the in-distribution (CIFAR-10,CIFAR-100,ImageNet-30) and out-of-distribution dataset(LSUN_resize, ImageNet_resize, Places-365, DTD, Stanford Dogs, Food-101, Caltech-256, CUB-200) to data folder under the root directory.

Training

For training Vision Transformer and its Data efficient variant please download their corresponding pre-train weight from ViT and DeiT repository.

To fine-tune vision transformer network on any in-distribution dataset on multi GPU settings:

srun --gres=gpu:4  python vit/src/train.py --exp-name name_of_the_experimet --tensorboard --model-arch b16 --checkpoint-path path/to/checkpoint --image-size 224 --data-dir data/ImageNet30 --dataset ImageNet --num-classes 30 --train-steps 4590 --lr 0.01 --wd 1e-5 --n-gpu 4 --num-workers 16 --batch-size 512 --method SupCE

model-arch : specify the model of vit and deit variants (see vit/src/config.py )
method : currently we support only supervised cross-entropy
train_steps : cyclic lr has been used for lr scheduler, number of training epoch can be calculated using (#train steps* batch size)/#training samples
checkpoint_path : for loading pre-trained weight of vision transformer based on their different model.

Training Support

OODformer can also be trained with various supervised and self-supervised loss like :

Training Base ResNet model

To train resnet variants(e.g., resent-50,wide-resent) as base model on in-distribution dataset :

srun --gres=gpu:4  python main_ce.py --batch_size 512 --epochs 500 --model resent34 --learning_rate 0.8  --cosine --warm --dataset cifar10

Evaluation

To evaluate the similarity distance from the mean embedding of an in-distribution (e.g., CIFAR-10) class a list of distance metrics (e.g., Mahalanobis, Cosine, Euclidean, and Softmax) can be used with OODformer as stated below :

srun --gres=gpu:1 python OOD_Distance.py --ckpt checkpoint_path --model vit --model_arch b16 --distance Mahalanobis --dataset id_dataset --out_dataset ood_dataset

Visualization

Various embedding visualization can be viewed using generate_tsne.py

(1) UMAP of in-distribution embedding

(2) UMAP of combined in and out-of distribution embedding

Reference

@article{koner2021oodformer,
  title={OODformer: Out-Of-Distribution Detection Transformer},
  author={Koner, Rajat and Sinhamahapatra, Poulami and Roscher, Karsten and G{\"u}nnemann, Stephan and Tresp, Volker},
  journal={arXiv preprint arXiv:2107.08976},
  year={2021}
}

Acknowledgments

Part of this code is inspired by HobbitLong/SupContrast.

This is a code repository for paper OODformer: Out-Of-Distribution Detection Transformer

Related tags

Overview

OODformer: Out-Of-Distribution Detection Transformer

Training

Training Support

Training Base ResNet model

Evaluation

Visualization

Reference

Acknowledgments

Owner

Research on Event Accumulator Settings for Event-Based SLAM

Locally Enhanced Self-Attention: Rethinking Self-Attention as Local and Context Terms

You Only 👀 One Sequence

A set of tools for creating and testing machine learning features, with a scikit-learn compatible API

NExT-QA: Next Phase of Question-Answering to Explaining Temporal Actions (CVPR2021)

CL-Gym: Full-Featured PyTorch Library for Continual Learning

A Moonraker plug-in for real-time compensation of frame thermal expansion

Deep Structured Instance Graph for Distilling Object Detectors (ICCV 2021)

Official Pytorch implementation of "Learning Debiased Representation via Disentangled Feature Augmentation (Neurips 2021, Oral)"

A Closer Look at Reference Learning for Fourier Phase Retrieval

GANimation: Anatomically-aware Facial Animation from a Single Image (ECCV'18 Oral) [PyTorch]

Multi Task Vision and Language

Code of the lileonardo team for the 2021 Emotion and Theme Recognition in Music task of MediaEval 2021

This repository contains the code and models necessary to replicate the results of paper: How to Robustify Black-Box ML Models? A Zeroth-Order Optimization Perspective

University of Rochester 2021 Summer REU focusing on music sentiment transfer using CycleGAN

A program to recognize fruits on pictures or videos using yolov5

ByteTrack with ReID module following the paradigm of FairMOT, tracking strategy is borrowed from FairMOT/JDE.

Neuron Merging: Compensating for Pruned Neurons (NeurIPS 2020)

[ ICCV 2021 Oral ] Our method can estimate camera poses and neural radiance fields jointly when the cameras are initialized at random poses in complex scenarios (outside-in scenes, even with less texture or intense noise )