RVT: Robust Vision Transformers

This repository contains PyTorch code for Robust Vision Transformers.

For details see Rethinking the Design Principles of Robust Vision Transformer by Xiaofeng Mao, Gege Qi, Yuefeng Chen, Yuan He and Hui Xue.

Usage

First, clone the repository locally:

git clone https://github.com/vtddggg/Robust-Vision-Transformer.git

Then, install PyTorch 1.7.0+ and torchvision 0.8.1+ and pytorch-image-models 0.3.2:

conda install -c pytorch pytorch torchvision
pip install timm==0.3.2

We use 4 nodes with 8 gpus to train RVT-Ti, RVT-S and RVT-B:

Training RVT-Ti

python -m torch.distributed.launch --nproc_per_node=8 --nnodes=4 main.py --model rvt_tiny --data-path /path/to/imagenet --output_dir output --dist-eval

Training RVT-S

python -m torch.distributed.launch --nproc_per_node=8 --nnodes=4 main.py --model rvt_small --data-path /path/to/imagenet --output_dir output --dist-eval

Training RVT-B

python -m torch.distributed.launch --nproc_per_node=8 --nnodes=4 main.py --model rvt_base --data-path /path/to/imagenet --output_dir output --batch-size 32 --dist-eval

If you want to train RVT-Ti*, RVT-S* or RVT-B*, simply add --use_mask and --use_patch_aug to enable positon-aware attention scaling and patch-wise augmentation.

This repository contains PyTorch code for Robust Vision Transformers.

Related tags

Overview

RVT: Robust Vision Transformers

Usage

Training RVT-Ti

Training RVT-S

Training RVT-B

Owner

An intelligent, flexible grammar of machine learning.

Neural machine translation between the writings of Shakespeare and modern English using TensorFlow

A tool to estimate time varying instantaneous reproduction number during epidemics

An official implementation of the paper Exploring Sequence Feature Alignment for Domain Adaptive Detection Transformers

Implementation of MA-Trace - a general-purpose multi-agent RL algorithm for cooperative environments.

Code for the paper "Improved Techniques for Training GANs"

“袋鼯麻麻——智能购物平台”能够精准地定位识别每一个商品

OBBDetection: an oriented object detection toolbox modified from MMdetection

This is the repository for The Machine Learning Workshops, published by AI DOJO

ppo_pytorch_cpp - an implementation of the proximal policy optimization algorithm for the C++ API of Pytorch

Organseg dags - The repository contains the codebase for multi-organ segmentation with directed acyclic graphs (DAGs) in CT.

This is the pytorch implementation for the paper: Generalizable Mixed-Precision Quantization via Attribution Rank Preservation, which is accepted to ICCV2021.

Offline Multi-Agent Reinforcement Learning Implementations: Solving Overcooked Game with Data-Driven Method

Share a benchmark that can easily apply reinforcement learning in Job-shop-scheduling

particle tracking model, works with the ROMS output file(qck.nc, his.nc)

Have you ever wondered how cool it would be to have your own A.I

FocusFace: Multi-task Contrastive Learning for Masked Face Recognition

Measuring Coding Challenge Competence With APPS

A DeepStack custom model for detecting common objects in dark/night images and videos.

NAACL'2021: Factual Probing Is [MASK]: Learning vs. Learning to Recall