Implementation of Convolutional enhanced image Transformer

Last update: Dec 13, 2022

Overview

CeiT : Convolutional enhanced image Transformer

This is an unofficial PyTorch implementation of Incorporating Convolution Designs into Visual Transformers .

Training :

python train.py -c configs/default.yaml --name "name_of_exp"

Usage :

import torch
from ceit import CeiT

img = torch.ones([1, 3, 224, 224])
    
model = CeiT(image_size = 224, patch_size = 4, num_classes = 100)
out = model(img)

print("Shape of out :", out.shape)      # [B, num_classes]

model = CeiT(image_size = 224, patch_size = 4, num_classes = 100, with_lca = True)
out = model(img)

print("Shape of out :", out.shape)      # [B, num_classes]

Note :

LCA might not be properly implemented.

Citation :

@misc{yuan2021incorporating,
      title={Incorporating Convolution Designs into Visual Transformers}, 
      author={Kun Yuan and Shaopeng Guo and Ziwei Liu and Aojun Zhou and Fengwei Yu and Wei Wu},
      year={2021},
      eprint={2103.11816},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Acknowledgement :

Base ViT code is borrowed from @lucidrains repo : https://github.com/lucidrains/vit-pytorch
Training and dataloader code is borrowed from @jeonsworld repo : https://github.com/jeonsworld/ViT-pytorch

Implementation of Convolutional enhanced image Transformer

Related tags

Overview

CeiT : Convolutional enhanced image Transformer

Training :

Usage :

Note :

Citation :

Acknowledgement :

Owner

Rishikesh (ऋषिकेश)

My 1st place solution at Kaggle Hotel-ID 2021

This is the offical website for paper ''Category-consistent deep network learning for accurate vehicle logo recognition''

Most popular metrics used to evaluate object detection algorithms.

Sparse Progressive Distillation: Resolving Overfitting under Pretrain-and-Finetune Paradigm

[Open Source]. The improved version of AnimeGAN. Landscape photos/videos to anime

Mahadi-Now - This Is Pakistani Just Now Login Tools

Deep Neural Networks Improve Radiologists' Performance in Breast Cancer Screening

Flower classification model that classifies flowers in 10 classes made using transfer learning (~85% accuracy).

Dense Prediction Transformers

An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)

Neighborhood Contrastive Learning for Novel Class Discovery

[WWW 2021] Source code for "Graph Contrastive Learning with Adaptive Augmentation"

This is the source code for our ICLR2021 paper: Adaptive Universal Generalized PageRank Graph Neural Network.

PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)

CVPR2021: Temporal Context Aggregation Network for Temporal Action Proposal Refinement

[ICCV'21] Pri3D: Can 3D Priors Help 2D Representation Learning?

Highly comparative time-series analysis

Official implementation of the PICASO: Permutation-Invariant Cascaded Attentional Set Operator

Easy-to-use library to boost AI inference leveraging state-of-the-art optimization techniques.

AAAI 2022 paper - Unifying Model Explainability and Robustness for Joint Text Classification and Rationale Extraction