A PyTorch implementation of SlowFast based on ICCV 2019 paper "SlowFast Networks for Video Recognition"

Last update: Dec 23, 2022

Overview

SlowFast

A PyTorch implementation of SlowFast based on ICCV 2019 paper SlowFast Networks for Video Recognition.

Requirements

conda install pytorch=1.9.1 torchvision cudatoolkit -c pytorch

PyTorchVideo

pip install pytorchvideo

Dataset

kinetics-400 dataset is used in this repo, you could download these datasets from official websites. The data directory structure is shown as follows:

├──data
  ├── train
      ├── abseiling
          ├── _4YTwq0-73Y_000044_000054.mp4
          └── ...
          ...
      ├── archery
          same structure as abseiling
  ├── test
     same structure as train
     ...

Usage

Train Model

python train.py --batch_size 16
optional arguments:
--data_root                   Datasets root path [default value is 'data']
--batch_size                  Number of videos in each mini-batch [default value is 8]
--epochs                      Number of epochs over the model to train [default value is 10]
--save_root                   Result saved root path [default value is 'result']

Test Model

python test.py --video_path data/test/beatboxing/5s_gFWie1Ys_000069_000079.mp4
optional arguments:
--model_path                  Model path [default value is 'result/slow_fast.pth']
--video_path                  Video path [default value is 'data/test/applauding/_V-dzjftmCQ_000023_000033.mp4']

A PyTorch implementation of SlowFast based on ICCV 2019 paper "SlowFast Networks for Video Recognition"

Related tags

Overview

SlowFast

Requirements

Dataset

Usage

Train Model

Test Model

Owner

Hao Ren

Neighborhood Contrastive Learning for Novel Class Discovery

Classify music genre from a 10 second sound stream using a Neural Network.

A hue shift helper for OBS

Pythonic particle-based (super-droplet) warm-rain/aqueous-chemistry cloud microphysics package with box, parcel & 1D/2D prescribed-flow examples in Python, Julia and Matlab

Subpopulation detection in high-dimensional single-cell data

A very short and easy implementation of Quantile Regression DQN

A strongly-typed genetic programming framework for Python

Instance Segmentation in 3D Scenes using Semantic Superpoint Tree Networks

Optimizing DR with hard negatives and achieving SOTA first-stage retrieval performance on TREC DL Track (SIGIR 2021 Full Paper).

Adaptive Graph Convolution for Point Cloud Analysis

PyTorch implementation for the visual prior component (i.e. perception module) of the Visually Grounded Physics Learner [Li et al., 2020].

Personalized Federated Learning using Pytorch (pFedMe)

PyTorch Lightning implementation of Automatic Speech Recognition

Deep Learning for Time Series Forecasting.

RE3: State Entropy Maximization with Random Encoders for Efficient Exploration

AdaSpeech 2: Adaptive Text to Speech with Untranscribed Data

PyTorch code to run synthetic experiments.

GDSC-ML Team Interview Task

MediaPipe is a an open-source framework from Google for building multimodal

One-Shot Neural Ensemble Architecture Search by Diversity-Guided Search Space Shrinking