This project uses ViT to perform image classification tasks on DATA set CIFAR10.

Last update: Jun 03, 2022

Overview

Vision-Transformer-Multiprocess-DistributedDataParallel-Apex

Introduction

This project uses ViT to perform image classification tasks on DATA set CIFAR10. The implement of Vit and pretrained weight are from https://github.com/asyml/vision-transformer-pytorch. Different from https://github.com/Kaicheng-Yang0828/Vision-Transformer-ViT, this project use multi-process distributed training and it also use Apex to reduce GPU resource consumption.

Requirments

pytorch 1.7.1
python 3.7.3

Install Apex

1、 git clone https://github.com/NVIDIA/apex.git
2、 cd apex
3、 python setup.py install

Datasets

Download the CIFAR10 from http://www.cs.toronto.edu/~kriz/cifar.html or you can get it from https://pan.baidu.com/s/1ogAFopdVzswge2Aaru_lvw (code: k5v8), creat data floder and unzip the cifar-10-python.tar.gz under './data'

Pre_trained model

You can download the pretrained file from https://pan.baidu.com/s/1CuUj-XIXwecxWMEcLoJzPg (code: ox9n), creat Vit_weights floder and pretrained file under ./Vit_weights

Train

python main.py

Result

Base on the pretrained weight, after one epoch, I get 98.1 Accuracy (I didn't adjust the parameters carefully, you can get better results by adjusting the parameters)

model	dataset	acc
ViT-B_16	CIFAR10	98.1

Attention

1、Multi-process parallel training reduces the training time by one-fifth
2、Apex reduce about 30% GPU resources under the premise of ensuring the same accuracy rate

This project uses ViT to perform image classification tasks on DATA set CIFAR10.

Related tags

Overview

Vision-Transformer-Multiprocess-DistributedDataParallel-Apex

Introduction

Requirments

Install Apex

Datasets

Pre_trained model

Train

Result

Attention

Owner

Kaicheng Yang

Context Decoupling Augmentation for Weakly Supervised Semantic Segmentation

This script runs neural style transfer against the provided content image.

PyTorch implementation of paper A Fast Knowledge Distillation Framework for Visual Recognition.

CNN designed for pansharpening

dyld_shared_cache processing / Single-Image loading for BinaryNinja

QMagFace: Simple and Accurate Quality-Aware Face Recognition

PyTorch Implementation of [1611.06440] Pruning Convolutional Neural Networks for Resource Efficient Inference

Optimizaciones incrementales al problema N-Body con el fin de evaluar y comparar las prestaciones de los traductores de Python en el ámbito de HPC.

KoRean based ELECTRA pre-trained models (KR-ELECTRA) for Tensorflow and PyTorch

AI assistant built in python.the features are it can display time,say weather,open-google,youtube,instagram.

Code for the paper 'A High Performance CRF Model for Clothes Parsing'.

Pytorch implementation of Cut-Thumbnail in the paper Cut-Thumbnail:A Novel Data Augmentation for Convolutional Neural Network.

Vehicle direction identification consists of three module detection , tracking and direction recognization.

MATLAB codes of the book "Digital Image Processing Fourth Edition" converted to Python

CaFM-pytorch ICCV ACCEPT Introduction of dataset VSD4K

BT-Unet: A-Self-supervised-learning-framework-for-biomedical-image-segmentation-using-Barlow-Twins

Benchmarking the robustness of Spatial-Temporal Models

We evaluate our method on different datasets (including ShapeNet, CUB-200-2011, and Pascal3D+) and achieve state-of-the-art results, outperforming all the other supervised and unsupervised methods and 3D representations, all in terms of performance, accuracy, and training time.

Computer Vision is an elective course of MSAI, SCSE, NTU, Singapore

Code accompanying "Learning What To Do by Simulating the Past", ICLR 2021.