OadTR

Code for our ICCV2021 paper: "OadTR: Online Action Detection with Transformers" ["Paper"]

Update

July 28, 2021: Our Paper "OadTR: Online Action Detection with Transformers" was accepted by ICCV2021. At the same time, we released THUMOS14-Kinetics feature.

Dependencies

pytorch==1.6.0
json
numpy
tensorboard-logger
torchvision==0.7.0

Prepare

Unzip the anno file "./data/anno_thumos.zip"
Download the feature THUMOS14-Anet feature (Note: HDD and TVSeries are available by contacting the authors of the datasets and signing agreements due to the copyrights. You can use this Repo to extract features.)

Training

python main.py --num_layers 3 --decoder_layers 5 --enc_layers 64 --output_dir models/en_3_decoder_5_lr_drop_1

Validation

python main.py --num_layers 3 --decoder_layers 5 --enc_layers 64 --output_dir models/en_3_decoder_5_lr_drop_1 --eval --resume models/en_3_decoder_5_lr_drop_1/checkpoint000{}.pth

Citing OadTR

Please cite our paper in your publications if it helps your research:

@article{wang2021oadtr,
  title={OadTR: Online Action Detection with Transformers},
  author={Wang, Xiang and Zhang, Shiwei and Qing, Zhiwu and Shao, Yuanjie and Zuo, Zhengrong and Gao, Changxin and Sang, Nong},
  journal={arXiv preprint arXiv:2106.11149},
  year={2021}
}

Code for our ICCV 2021 Paper "OadTR: Online Action Detection with Transformers".

Related tags

Overview

OadTR

Update

Dependencies

Prepare

Training

Validation

Citing OadTR

Owner

UAV-Networks-Routing is a Python simulator for experimenting routing algorithms and mac protocols on unmanned aerial vehicle networks.

Control-Raspberry-Pi-Robot-using-Hand-Gestures - A 4WD Robot car based on Raspberry Pi that controlled by hand gestures(using openCV and mediapipe)

Meta Representation Transformation for Low-resource Cross-lingual Learning

Deploy pytorch classification model using Flask and Streamlit

PyTorch implementation of our ICCV 2021 paper, Interpretation of Emergent Communication in Heterogeneous Collaborative Embodied Agents.

Real-Time Multi-Contact Model Predictive Control via ADMM

Computer Vision is an elective course of MSAI, SCSE, NTU, Singapore

OBG-FCN - implementation of 'Object Boundary Guided Semantic Segmentation'

Exploring Classification Equilibrium in Long-Tailed Object Detection, ICCV2021

Analysis of rationale selection in neural rationale models

Neural Logic Inductive Learning

prior-based-losses-for-medical-image-segmentation

Code repo for "Cross-Scale Internal Graph Neural Network for Image Super-Resolution" (NeurIPS'20)

Learning hierarchical attention for weakly-supervised chest X-ray abnormality localization and diagnosis

DSL for matching Python ASTs

Code for the preprint "Well-classified Examples are Underestimated in Classification with Deep Neural Networks"

Revisting Open World Object Detection

Trainable Bilateral Filter Layer (PyTorch)

This git repo contains the implementation of my ML project on Heart Disease Prediction

Implementation of the algorithm shown in the article "Modelo de Predicción de Éxito de Canciones Basado en Descriptores de Audio"