Unofficial PyTorch Implementation of "Augmenting Convolutional networks with attention-based aggregation"

Last update: Sep 09, 2022

Overview

Pytorch Implementation of Augmenting Convolutional networks with attention-based aggregation

This is the unofficial PyTorch Implementation of "Augmenting Convolutional networks with attention-based aggregation"

reference: https://arxiv.org/pdf/2112.13692.pdf

Prerequisites

PyTorch
PyTorch Lightning
timm
torchmetrics
torchvision
python3
CUDA

Comments

Due to computation limits, CIFAR100 dataset was used in contrast to ImageNet in the original paper.
Since the official code is not released yet, there may be differences in structures and hyperparameters.
- Most of the hidden dimensions were chosen based on guesswork.
MADGRAD was used instead of LAMB optimizer.
(I thought it would be inefficient to use LAMB for small batchsizes in my local machine)
LayerScale will be added soon

Citations

@misc{touvron2021augmenting,
      title={Augmenting Convolutional networks with attention-based aggregation}, 
      author={Hugo Touvron and Matthieu Cord and Alaaeldin El-Nouby and Piotr Bojanowski and Armand Joulin and Gabriel Synnaeve and Hervé Jégou},
      year={2021},
      eprint={2112.13692},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Unofficial PyTorch Implementation of "Augmenting Convolutional networks with attention-based aggregation"

Related tags

Overview

Pytorch Implementation of Augmenting Convolutional networks with attention-based aggregation

Prerequisites

Comments

Citations

Owner

DK

Transformer in Computer Vision

An self sufficient AI that crawls the web to learn how to generate art from keywords

CenterNet:Objects as Points目标检测模型在Pytorch当中的实现

PyTorch Implementation of Small Lesion Segmentation in Brain MRIs with Subpixel Embedding (ORAL, MICCAIW 2021)

VOLO: Vision Outlooker for Visual Recognition

Deep Learning as a Cloud API Service.

Privacy-Preserving Portrait Matting [ACM MM-21]

Global Rhythm Style Transfer Without Text Transcriptions

Implementation of CVAE. Trained CVAE on faces from UTKFace Dataset to produce synthetic faces with a given degree of happiness/smileyness.

Revealing and Protecting Labels in Distributed Training

Code for reproducing our analysis in the paper titled: Image Cropping on Twitter: Fairness Metrics, their Limitations, and the Importance of Representation, Design, and Agency

Syntax-Aware Action Targeting for Video Captioning

Project Aquarium is a SUSE-sponsored open source project aiming at becoming an easy to use, rock solid storage appliance based on Ceph.

FinGAT: A Financial Graph Attention Networkto Recommend Top-K Profitable Stocks

Includes PyTorch -> Keras model porting code for ConvNeXt family of models with fine-tuning and inference notebooks.

FSL-Mate: A collection of resources for few-shot learning (FSL).

Anime Face Detector using mmdet and mmpose

Moving Object Segmentation in 3D LiDAR Data: A Learning-based Approach Exploiting Sequential Data

Veri Setinizi Yolov5 Formatına Dönüştürün

This program writes christmas wish programmatically. It is using turtle as a pen pointer draw christmas trees and stars.