Colar: Effective and Efficient Online Action Detection by Consulting Exemplars, CVPR 2022.

Last update: Dec 13, 2022

Related tags

Overview

Colar: Effective and Efficient Online Action Detection by Consulting Exemplars

This repository is the official implementation of Colar. In this work, we study the online action detection and develop an effective and efficient exemplar-consultation mechanism. Paper from arXiv.

Requirements

To install requirements:

conda env create -n env_name -f environment.yaml

Before running the code, please activate this conda environment.

Data Preparation

a. Download pre-extracted features from baiduyun (code:cola)

Please ensure the data structure is as below

├── data
   └── thumos14
       ├── Exemplar_Kinetics
       ├── thumos_all_feature_test_Kinetics.pickle
       ├── thumos_all_feature_val_Kinetics.pickle
       ├── thumos_test_anno.pickle
       ├── thumos_val_anno.pickle
       ├── data_info.json

Train

a. Config

Adjust configurations according to your machine.

./misc/init.py

c. Train

python main.py

Inference

a. You can download pre-trained models from baiduyun (code:cola), and put the weight file in the folder checkpoint.

The performance of our model is 66.9% mAP.

b. Test

python inference.py

Citation

@inproceedings{yang2022colar,
  title={Colar: Effective and Efficient Online Action Detection by Consulting Exemplars},
  author={Yang, Le and Han, Junwei and Zhang, Dingwen},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year={2022}
}

Related Projects

BackTAL: Background-Click Supervision for Temporal Action Localization.

Contact

For any discussions, please contact [email protected].

Colar: Effective and Efficient Online Action Detection by Consulting Exemplars, CVPR 2022.

Related tags

Overview

Colar: Effective and Efficient Online Action Detection by Consulting Exemplars

Requirements

Data Preparation

Train

Inference

Citation

Related Projects

Contact

Owner

LeYang

[ECCV 2020] Reimplementation of 3DDFAv2, including face mesh, head pose, landmarks, and more.

Multiple types of NN model optimization environments. It is possible to directly access the host PC GUI and the camera to verify the operation. Intel iHD GPU (iGPU) support. NVIDIA GPU (dGPU) support.

Repositório criado para abrigar os notebooks com a listas de exercícios propostos pelo professor Gustavo Guanabara do canal Curso em Vídeo do YouTube durante o Curso de Python 3

Project page for the paper Semi-Supervised Raw-to-Raw Mapping 2021.

Official PyTorch implementation of the paper Image-Based CLIP-Guided Essence Transfer.

Refactoring dalle-pytorch and taming-transformers for TPU VM

The project of phase's key role in complex and real NN

Code repo for "RBSRICNN: Raw Burst Super-Resolution through Iterative Convolutional Neural Network" (Machine Learning and the Physical Sciences workshop in NeurIPS 2021).

DyNet: The Dynamic Neural Network Toolkit

Guiding evolutionary strategies by (inaccurate) differentiable robot simulators @ NeurIPS, 4th Robot Learning Workshop

Official project repository for 'Normality-Calibrated Autoencoder for Unsupervised Anomaly Detection on Data Contamination'

SAS: Self-Augmentation Strategy for Language Model Pre-training

ECAENet (TensorFlow and Keras)

CVNets: A library for training computer vision networks

PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis

Implementation of "Learning to Match Features with Seeded Graph Matching Network" ICCV2021

An open-source Deep Learning Engine for Healthcare that aims to treat & prevent major diseases

Object classification with basic computer vision techniques

A short code in python, Enchpyter, is able to encrypt and decrypt words as you determine, of course

Uncertain natural language inference