Accuracy Aligned. Concise Implementation of Swin Transformer

Last update: Dec 16, 2022

Related tags

Overview

Accuracy Aligned. Concise Implementation of Swin Transformer

This repository contains the implementation of Swin Transformer, and the training codes on ImageNet datasets. We have aligned the output of our network with the official one, that is, using the same input and random seed, the output is identical to the official one.

Our implementation is highly based on einops, which makes the implementation more concise, and easy to be understand. (Intuitively, we use only 200 lines of codes compared with ~600 lines of official codes.) Besides, our implementation keeps the same training speed.

Model	Epoch	[email protected](our)	[email protected](our)	[email protected](official)	[email protected](official)	pretrained model
Swin-T	300	81.3	95.5	81.2	95.5	here

Usage

Train on ImageNet:

Train Swin-T

python -m torch.distributed.launch --nproc_per_node=8 --use_env train.py --model Swin_T \
--batch-size 128 --drop-path 0.2 --data-path ~/ILSVRC2012/ --output_dir /data/SwinTransformer_exp/SwinT/

Train Swin-S

python -m torch.distributed.launch --nproc_per_node=8 --use_env train.py --model Swin_S \
--batch-size 128 --drop-path 0.3 --data-path ~/ILSVRC2012/ --output_dir /data/SwinTransformer_exp/SwinS/

Train Swin-B

python -m torch.distributed.launch --nproc_per_node=8 --use_env train.py --model Swin_B \
--batch-size 128 --drop-path 0.5 --data-path ~/ILSVRC2012/ --output_dir /data/SwinTransformer_exp/SwinB/

Reference

The training process involves many training and augmentation tricks, such as stochastic depth, mixup, cutmix and random erasing. I borrow large from Deit (https://github.com/facebookresearch/deit).

Citations

@misc{liu2021swin,
      title={Swin Transformer: Hierarchical Vision Transformer using Shifted Windows}, 
      author={Ze Liu and Yutong Lin and Yue Cao and Han Hu and Yixuan Wei and Zheng Zhang and Stephen Lin and Baining Guo},
      year={2021},
      eprint={2103.14030},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Accuracy Aligned. Concise Implementation of Swin Transformer

Related tags

Overview

Accuracy Aligned. Concise Implementation of Swin Transformer

Usage

Reference

Citations

Owner

FengWang

Use Python, OpenCV, and MediaPipe to control a keyboard with facial gestures

PyTorch implementation of hand mesh reconstruction described in CMR and MobRecon.

Python library containing BART query generation and BERT-based Siamese models for neural retrieval.

A universal memory dumper using Frida

Job-Recommend-Competition - Vectorwise Interpretable Attentions for Multimodal Tabular Data

Localized representation learning from Vision and Text (LoVT)

This is an (re-)implementation of DeepLab-ResNet in TensorFlow for semantic image segmentation on the PASCAL VOC dataset.

Repository for the paper "From global to local MDI variable importances for random forests and when they are Shapley values"

OneFlow is a performance-centered and open-source deep learning framework.

Implementation of ICCV19 Paper "Learning Two-View Correspondences and Geometry Using Order-Aware Network"

✨风纪委员会自动投票脚本，利用Github Action帮你进行裁决操作（为了让其他风纪委员有案件可判，本程序从中午12点才开始运行，有需要请自己修改运行时间）

Sequence Modeling with Structured State Spaces

Multi-angle c(q)uestion answering

Text Summarization - WCN — Weighted Contextual N-gram method for evaluation of Text Summarization

Simple SN-GAN to generate CryptoPunks

SCALoss: Side and Corner Aligned Loss for Bounding Box Regression (AAAI2022).

Learning from History: Modeling Temporal Knowledge Graphs with Sequential Copy-Generation Networks

Advanced Signal Processing Notebooks and Tutorials

Hl classification bc - A Network-Based High-Level Data Classification Algorithm Using Betweenness Centrality

StarGAN-ZSVC: Unofficial PyTorch Implementation