Official implementation of ACTION-Net: Multipath Excitation for Action Recognition (CVPR'21).

Last update: Dec 26, 2022

Related tags

Deep Learning ACTION-Net

Overview

ACTION-Net

Official implementation of ACTION-Net: Multipath Excitation for Action Recognition (CVPR'21).

Getting Started

EgoGesture data folder structure

|-frames
|---Subject01
|------Scene1
|---------Color1
|------------rgb1
|---------------000001.jpg
......
|-labels
|---Subject01
|------Scene1
|---------Group1.csv
......

Something-Something V2

|-frames
|---1
|------000001.jpg
|------000002.jpg
|------000003.jpg
......

Jester

|-frames
|---1
|------000001.jpg
|------000002.jpg
|------000003.jpg
......

Requirements

Provided in the action.Dockerfile

Annotation files

Annotation files are at this link. Please follow the annotation files to construct the frame path.

Usage

sh train_ego_8f.sh 0,1,2,3 if you use four gpus

Acknowledgment

Our codes are built based on previous repos TSN, TSM and TEA

Pretrained models

Currently, we do not provide the pretrained models since we reconstruct the structure and rename our modules of ACTION for public release. It should be able to get the similar performance indicated in the paper using the codes provided above.

Official implementation of ACTION-Net: Multipath Excitation for Action Recognition (CVPR'21).

Related tags

Overview

ACTION-Net

Getting Started

Requirements

Annotation files

Usage

Acknowledgment

Pretrained models

Owner

V-Sense

Live training loss plot in Jupyter Notebook for Keras, PyTorch and others

Can we visualize a large scientific data set with a surrogate model? We're building a GAN for the Earth's Mantle Convection data set to see if we can!

A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more

A library for uncertainty quantification based on PyTorch

LSSY量化交易系统

「PyTorch Implementation of AnimeGANv2」を用いて、生成した顔画像を元の画像に上書きするデモ

Official implementation of Rethinking Graph Neural Architecture Search from Message-passing (CVPR2021)

Mind the Trade-off: Debiasing NLU Models without Degrading the In-distribution Performance

MAg: a simple learning-based patient-level aggregation method for detecting microsatellite instability from whole-slide images

Physics-informed convolutional-recurrent neural networks for solving spatiotemporal PDEs

Repository of our paper 'Refer-it-in-RGBD' in CVPR 2021

Repository for XLM-T, a framework for evaluating multilingual language models on Twitter data

LSTM Neural Networks for Spectroscopic Studies of Type Ia Supernovae

Lucid Sonic Dreams syncs GAN-generated visuals to music.

A 10000+ hours dataset for Chinese speech recognition

The audio-video synchronization of MKV Container Format is exploited to achieve data hiding

This project uses ViT to perform image classification tasks on DATA set CIFAR10.

AdaFocus (ICCV 2021) Adaptive Focus for Efficient Video Recognition

基于tensorflow 2.x的图片识别工具集

Trading Gym is an open source project for the development of reinforcement learning algorithms in the context of trading.