VFormer
A PyTorch library for Vision Transformers
Getting Started
Read the contributing guidelines in CONTRIBUTING.rst to learn how to start contributing.
Read the contributing guidelines in CONTRIBUTING.rst to learn how to start contributing.
viz module.We can replace _Projection class with a one-liner if-else statement.
Should we replace it with if-else or should we keep the current implementation?
cc: @NeelayS @aditya-agrawal-30502 @alvanli
During the last PR (#45), I had to revert back because of compatibility issues
In this PR I have added some docstrings and Minor changes like changing variable names
this PR is the same as - #48 with edited title :)
@NeelayS
AbsolutePositionEmbedding class was structured specifically for the PVT, but we can use it in other models too if we re-structure it properly, it should also support sinusoidal position embedding or a separate class for Sinusoidal embedding also works.
enhancementThis paper describes how promoting smoothness with a recently proposed sharpness-aware optimizer substantially improves the performance of ViTs.
It would be good to have an implementation of this optimizer in our library. It would fit in the functional module.
I have added some fixes for page breaks in #86.
Still, we need to enhance the docs for visualization methods.
We can include the license/copyright disclaimer for visualization methods in our license or have a separate file.
Additionally, we can add the sample outputs from these methods into the doc.
CC : @NeelayS @aditya-agrawal-30502 @alvanli
documentation enhancement good first issuepaper - https://arxiv.org/abs/2202.09741 code- https://github.com/Visual-Attention-Network/VAN-Classification https://github.com/Visual-Attention-Network/VAN-Segmentation
Paper implementationFirst release of VFormer!
UFPR-ADMR-v2 Dataset The UFPR-ADMRv2 dataset contains 5,000 dial meter images obtained on-site by employees of the Energy Company of Paraná (Copel), w
DIML Created by Wenliang Zhao*, Yongming Rao*, Ziyi Wang, Jiwen Lu, Jie Zhou This repository contains PyTorch implementation for paper Towards Interpr
CIFS This repository provides codes for CIFS (ICML 2021). CIFS: Improving Adversarial Robustness of CNNs via Channel-wise Importance-based Feature Sel
Building Shazam from scratch In this repository we tried to implement a simplified copy of the Shazam application able to tell you the name of a song
WSDEC This is the official repo for our NeurIPS paper Weakly Supervised Dense Event Captioning in Videos. Description Repo directories ./: global conf
Strongly local p-norm-cut algorithms for semi-supervised learning and local graph clustering
YOLOv5-Lite:lighter, faster and easier to deploy Perform a series of ablation experiments on yolov5 to make it lighter (smaller Flops, lower memory, a
GeoMol: Torsional Geometric Generation of Molecular 3D Conformer Ensembles This repository contains a method to generate 3D conformer ensembles direct
Proof of Concept GnuCash Webinterface This may one day be a something truly great. Milestones [ ] Browse accounts and view transactions [ ] Record sim
mcmeta Processed, version controlled history of Minecraft's generated data and assets Repository structure Each of the following branches has a commit
PreSumm This code is for EMNLP 2019 paper Text Summarization with Pretrained Encoders Updates Jan 22 2020: Now you can Summarize Raw Text Input!. Swit
AlexNet training on ImageNet LSVRC 2012 This repository contains an implementation of AlexNet convolutional neural network and its training and testin
VQGAN-CLIP-Docker About Zero-Shot Text-to-Image Generation VQGAN+CLIP Dockerized This is a stripped and minimal dependency repository for running loca
ACME-compatible implementation of Arm26 from Mujoco This repository contains a customized implementation of Mujoco's Arm26 model, that can be used wit
Massive-scale Decoding for Text Generation using Lattices Jiacheng Xu, Greg Durrett TL;DR: a new search algorithm to construct lattices encoding many
Machine-in-the-Loop Rewriting for Creative Image Captioning Data Annotated sources of data used in the paper: Data Source URL Mohammed et al. Link Gor
yellowstone-camping UPDATE: yellowstone-camping is being expanded and renamed to camply. The updated tool now interfaces with the Recreation.gov API a
Description This script helps the user to update Linux and Mac systems through the terminal. All the user has to install some requirements and then ru
Joint Entity and Relation Extraction with Set Prediction Networks Source code for Joint Entity and Relation Extraction with Set Prediction Networks. W
Depth Clustering This is a fast and robust algorithm to segment point clouds taken with Velodyne sensor into objects. It works with all available Velo