A PyTorch library for Vision Transformers

Last update: Nov 28, 2022

Related tags

Deep Learning vformer

Overview

VFormer

A PyTorch library for Vision Transformers

Getting Started

Read the contributing guidelines in CONTRIBUTING.rst to learn how to start contributing.

Comments

Add attention visualization methods
This article details different ways of visualizing a transformer's attention. It also talks about how such visualizations can aid in explainability of the models.

They also provide their code here.

We would like to have such visualization methods in the viz module.

good first issue
opened by NeelayS 7
Remove _Projection class

We can replace _Projection class with a one-liner if-else statement.

Should we replace it with if-else or should we keep the current implementation?

cc: @NeelayS @aditya-agrawal-30502 @alvanli

opened by abhi-glitchhg 6
Enhanced docstring

During the last PR (#45), I had to revert back because of compatibility issues

In this PR I have added some docstrings and Minor changes like changing variable names

this PR is the same as - #48 with edited title :)

@NeelayS

opened by abhi-glitchhg 3
Restructuring AbsolutePositionEmbedding class

AbsolutePositionEmbedding class was structured specifically for the PVT, but we can use it in other models too if we re-structure it properly, it should also support sinusoidal position embedding or a separate class for Sinusoidal embedding also works.
enhancement

opened by abhi-glitchhg 2
Add sharpness-aware optimizer

This paper describes how promoting smoothness with a recently proposed sharpness-aware optimizer substantially improves the performance of ViTs.

It would be good to have an implementation of this optimizer in our library. It would fit in the functional module.

A couple of PyTorch implementations are here and here.

opened by NeelayS 2
Documentation related to visualization methods

I have added some fixes for page breaks in #86.

Still, we need to enhance the docs for visualization methods.
We can include the license/copyright disclaimer for visualization methods in our license or have a separate file.

Additionally, we can add the sample outputs from these methods into the doc.

CC : @NeelayS @aditya-agrawal-30502 @alvanli
documentation enhancement good first issue

opened by abhi-glitchhg 1
[Paper] Visual Attention Network

paper - https://arxiv.org/abs/2202.09741 code- https://github.com/Visual-Attention-Network/VAN-Classification https://github.com/Visual-Attention-Network/VAN-Segmentation
Paper implementation

opened by abhi-glitchhg 0

Releases(v0.1.3)

v0.1.3(Jul 3, 2022)

Source code(tar.gz)
Source code(zip)
v0.1.2(Apr 7, 2022)

Source code(tar.gz)
Source code(zip)
v0.1.0(Feb 9, 2022)

First release of VFormer!
Source code(tar.gz)
Source code(zip)

Owner

Society for Artificial Intelligence and Deep Learning

GitHub Repository

Deep deconfounded recommender (Deep-Deconf) for paper "Deep causal reasoning for recommendations"

Deep Causal Reasoning for Recommender Systems The codes are associated with the following paper: Deep Causal Reasoning for Recommendations, Yaochen Zh

22 Oct 15, 2022

Full-featured Decision Trees and Random Forests learner.

CID3 This is a full-featured Decision Trees and Random Forests learner. It can save trees or forests to disk for later use. It is possible to query tr

3 Aug 15, 2022

Reducing Information Bottleneck for Weakly Supervised Semantic Segmentation (NeurIPS 2021)

Reducing Information Bottleneck for Weakly Supervised Semantic Segmentation (NeurIPS 2021) The implementation of Reducing Infromation Bottleneck for W

81 Dec 16, 2022

A PyTorch implementation of the baseline method in Panoptic Narrative Grounding (ICCV 2021 Oral)

52 Dec 19, 2022

[CVPR2021] Look before you leap: learning landmark features for one-stage visual grounding.

LBYL-Net This repo implements paper Look Before You Leap: Learning Landmark Features For One-Stage Visual Grounding CVPR 2021. Getting Started Prerequ

45 Dec 12, 2022

NeuralForecast is a Python library for time series forecasting with deep learning models

NeuralForecast is a Python library for time series forecasting with deep learning models. It includes benchmark datasets, data-loading utilities, evaluation functions, statistical tests, univariate m

1.1k Jan 03, 2023

existing and custom freqtrade strategies supporting the new hyperstrategy format.

freqtrade-strategies Description Existing and self-developed strategies, rewritten to support the new HyperStrategy format from the freqtrade-develop

39 Aug 20, 2021

CharacterGAN: Few-Shot Keypoint Character Animation and Reposing

CharacterGAN Implementation of the paper "CharacterGAN: Few-Shot Keypoint Character Animation and Reposing" by Tobias Hinz, Matthew Fisher, Oliver Wan

181 Dec 27, 2022

This repository contains the PyTorch implementation of the paper STaCK: Sentence Ordering with Temporal Commonsense Knowledge appearing at EMNLP 2021.

STaCK: Sentence Ordering with Temporal Commonsense Knowledge This repository contains the pytorch implementation of the paper STaCK: Sentence Ordering

23 Dec 16, 2022

A vision library for performing sliced inference on large images/small objects

SAHI: Slicing Aided Hyper Inference A vision library for performing sliced inference on large images/small objects Overview Object detection and insta

2.3k Jan 04, 2023

Repo for code associated with Modeling the Mitral Valve.

Project Title Mitral Valve Getting Started Repo for code associated with Modeling the Mitral Valve. See https://arxiv.org/abs/1902.00018 for preprint,

1 May 17, 2022

CVPRW 2021: How to calibrate your event camera

E2Calib: How to Calibrate Your Event Camera This repository contains code that implements video reconstruction from event data for calibration as desc

104 Nov 16, 2022

LinkNet - This repository contains our Torch7 implementation of the network developed by us at e-Lab.

LinkNet This repository contains our Torch7 implementation of the network developed by us at e-Lab. You can go to our blogpost or read the article Lin

158 Nov 11, 2022

PyTorch Implementation of CvT: Introducing Convolutions to Vision Transformers

CvT: Introducing Convolutions to Vision Transformers Pytorch implementation of CvT: Introducing Convolutions to Vision Transformers Usage: img = torch

193 Jan 03, 2023

This code is part of the reproducibility package for the SANER 2022 paper "Generating Clarifying Questions for Query Refinement in Source Code Search".

Clarifying Questions for Query Refinement in Source Code Search This code is part of the reproducibility package for the SANER 2022 paper "Generating

0 Dec 04, 2021

A PyTorch library for Vision Transformers

Related tags

Overview

VFormer

A PyTorch library for Vision Transformers

Getting Started

Comments

Add attention visualization methods

Remove _Projection class

Enhanced docstring

Restructuring AbsolutePositionEmbedding class

Add sharpness-aware optimizer

Documentation related to visualization methods

[Paper] Visual Attention Network

Releases(v0.1.3)

v0.1.3(Jul 3, 2022)

v0.1.2(Apr 7, 2022)

v0.1.0(Feb 9, 2022)

Owner

Society for Artificial Intelligence and Deep Learning

Deep deconfounded recommender (Deep-Deconf) for paper "Deep causal reasoning for recommendations"

Full-featured Decision Trees and Random Forests learner.

Reducing Information Bottleneck for Weakly Supervised Semantic Segmentation (NeurIPS 2021)

A PyTorch implementation of the baseline method in Panoptic Narrative Grounding (ICCV 2021 Oral)

[CVPR2021] Look before you leap: learning landmark features for one-stage visual grounding.

NeuralForecast is a Python library for time series forecasting with deep learning models

existing and custom freqtrade strategies supporting the new hyperstrategy format.

CharacterGAN: Few-Shot Keypoint Character Animation and Reposing

This repository contains the PyTorch implementation of the paper STaCK: Sentence Ordering with Temporal Commonsense Knowledge appearing at EMNLP 2021.

A vision library for performing sliced inference on large images/small objects

Repo for code associated with Modeling the Mitral Valve.

CVPRW 2021: How to calibrate your event camera

LinkNet - This repository contains our Torch7 implementation of the network developed by us at e-Lab.

PyTorch Implementation of CvT: Introducing Convolutions to Vision Transformers

This code is part of the reproducibility package for the SANER 2022 paper "Generating Clarifying Questions for Query Refinement in Source Code Search".

Collection of Docker images for ML/DL and video processing projects

“Robust Lightweight Facial Expression Recognition Network with Label Distribution Training”, AAAI 2021.

Naszilla is a Python library for neural architecture search (NAS)

An Efficient Training Approach for Very Large Scale Face Recognition or F²C for simplicity.

A simplistic and efficient pure-python neural network library from Phys Whiz with CPU and GPU support.