VFormer
A PyTorch library for Vision Transformers
Getting Started
Read the contributing guidelines in CONTRIBUTING.rst to learn how to start contributing.
Read the contributing guidelines in CONTRIBUTING.rst to learn how to start contributing.
viz module.We can replace _Projection class with a one-liner if-else statement.
Should we replace it with if-else or should we keep the current implementation?
cc: @NeelayS @aditya-agrawal-30502 @alvanli
During the last PR (#45), I had to revert back because of compatibility issues
In this PR I have added some docstrings and Minor changes like changing variable names
this PR is the same as - #48 with edited title :)
@NeelayS
AbsolutePositionEmbedding class was structured specifically for the PVT, but we can use it in other models too if we re-structure it properly, it should also support sinusoidal position embedding or a separate class for Sinusoidal embedding also works.
enhancementThis paper describes how promoting smoothness with a recently proposed sharpness-aware optimizer substantially improves the performance of ViTs.
It would be good to have an implementation of this optimizer in our library. It would fit in the functional module.
I have added some fixes for page breaks in #86.
Still, we need to enhance the docs for visualization methods.
We can include the license/copyright disclaimer for visualization methods in our license or have a separate file.
Additionally, we can add the sample outputs from these methods into the doc.
CC : @NeelayS @aditya-agrawal-30502 @alvanli
documentation enhancement good first issuepaper - https://arxiv.org/abs/2202.09741 code- https://github.com/Visual-Attention-Network/VAN-Classification https://github.com/Visual-Attention-Network/VAN-Segmentation
Paper implementationFirst release of VFormer!
Sokoban Bot Introduction to AI assignment 1 HCM University of Technology, term 211 Abstract This is basically a solver for Sokoban game using Breadth-
SelfText Beyond Polygon: Unconstrained Text Detection with Box Supervisionand Dynamic Self-Training Introduction This is a PyTorch implementation of "
Object Detection using YOLO from PyImageSearch By applying object detection, youโll not only be able to determine what is in an image, but also where
Lidar-Segementation An implementation on "Curved-Voxel Clustering for Accurate Segmentation of 3D LiDAR Point Clouds with Real-Time Performance" from
topography ๐ Contains different approaches to modeling terrain and topographic-style maps in python Features Inverse Distance Weighting (IDW) A given
MDM A Tensorflow implementation of the Mnemonic Descent Method. Mnemonic Descent Method: A recurrent process applied for end-to-end face alignment G.
In this project, two programs can help you take full agvantage of time on the model training with a remote server, which can push notification to your phone about the information during model trainin
TorchRL Disclaimer This library is not officially released yet and is subject to change. The features are available before an official release so that
UMPNet: Universal Manipulation Policy Network for Articulated Objects Zhenjia Xu, Zhanpeng He, Shuran Song Columbia University Robotics and Automation
iGibson: A Simulation Environment to train Robots in Large Realistic Interactive Scenes iGibson is a simulation environment providing fast visual rend
PEDPF Parameter Efficient Deep Probabilistic Forecasting (PEDPF) is a repository containing code to run experiments for several deep learning based pr
Code and Experiments for ACL-IJCNLP 2021 Paper Mind Your Outliers! Investigating the Negative Impact of Outliers on Active Learning for Visual Question Answering.
TBSG: Tree-based Search Graph for Approximate Nearest Neighbor Search. TBSG is a graph-based algorithm for ANNS based on Cover Tree, which is also an
Cross-Document Coreference Resolution This repository contains code and models for end-to-end cross-document coreference resolution, as decribed in ou
Open nsfw model This repo contains code for running Not Suitable for Work (NSFW) classification deep neural network Caffe models. Please refer our blo
Introduction This repository is for X-Linear Attention Networks for Image Captioning (CVPR 2020). The original paper can be found here. Please cite wi
ๅฎๆถ้ขๆฟไธ็็ญพๅฐ็ ไธไธช่ฟ่กๅจ ๐๐ฅ๐๐๐๐๐ ๆ ๐ช๐ข๐ง๐ ๐ฅ๐จ๐ง๐ ็ญๅฎๆถ้ขๆฟ็็ญพๅฐ้กน็ฎ ๐๐ฅ๐๐๐๐๐ ๐ช๐ข๐ง๐ ๐ฅ๐จ๐ง๐ ็นๅซๅฃฐๆ ๆฌไปๅบๅๅธ็่ๆฌๅๅ ถไธญๆถๅ็ไปปไฝ่งฃ้ๅ่งฃๅฏๅๆ่ๆฌ๏ผไป ็จไบๆต่ฏๅๅญฆไน ็ ็ฉถ๏ผ็ฆๆญข็จไบๅไธ็จ้๏ผไธ่ฝไฟ่ฏๅ ถๅ
AICITY2021_Track2_DMT The 1st place solution of track2 (Vehicle Re-Identification) in the NVIDIA AI City Challenge at CVPR 2021 Workshop. Introduction
JEREX: "Joint Entity-Level Relation Extractor" PyTorch code for JEREX: "Joint Entity-Level Relation Extractor". For a description of the model and exp
Network Compression via Central Filter Environments The code has been tested in the following environments: Python 3.8 PyTorch 1.8.1 cuda 10.2 torchsu