3D ResNet Video Classification accelerated by TensorRT

Last update: Nov 21, 2022

Overview

Activity Recognition TensorRT

Perform video classification using 3D ResNets trained on Kinetics-400 dataset and accelerated with TensorRT

P.S Click on the gif to watch the full-length video!

Index

Activity Recogntion TensorRT
Index
TensorRT Installation
Python Dependencies
Clone the repository
Download Pretrained Models
Running the code
Citations

TensorRT Installation

Assuming you have CUDA already installed, go ahead and download TensorRT from here.

Follow instructions of installing the system binaries and python package for tensorrt here.

Python dependencies

Install the necessary python dependencies by running the following command -

pip3 install -r requirements.txt

Clone the repository

This is a straightforward step, however, if you are new to git recommend glancing threw the steps.

First, install git

sudo apt install git

Next, clone the repository

# Using HTTPS
https://github.com/aj-ames/Activity-Recognition-TensorRT.git
# Using SSH
[email protected]:aj-ames/Activity-Recognition-TensorRT.git

Download Pretrained Models

Download models from google-drive and place them in the current directory.

Running the code

The code supports a number of command line arguments. Use help to see all supported arguments

➜ python3 action_recognition_tensorrt.py --help
usage: action_recognition_tensorrt.py [-h] [--stream STREAM] [--model MODEL] [--fp16] [--frameskip FRAMESKIP]

Object Detection using YOLOv4 and OpenCV4

optional arguments:
  -h, --help            show this help message and exit
  --stream STREAM       Path to use video stream
  --model MODEL         Path to model to use
  --fp16                To enable fp16 precision
  --frameskip FRAMESKIP
                        Number of frames to skip

Run the script this way:

# Video
python3 action_recognition_tensorrt.py --stream /path/to/video --model resnext-101-kinetics.onnx --fp16 --frameskip 3

# Webcam
python3 action_recognition_tensorrt.py --stream webcam --model resnext-101-kinetics.onnx --fp16 --frameskip 3

Citations

@article{hara3dcnns,
  author={Kensho Hara and Hirokatsu Kataoka and Yutaka Satoh},
  title={Can Spatiotemporal 3D CNNs Retrace the History of 2D CNNs and ImageNet?},
  journal={arXiv preprint},
  volume={arXiv:1711.09577},
  year={2017},
}

3D ResNet Video Classification accelerated by TensorRT

Related tags

Overview

Activity Recognition TensorRT

Index

TensorRT Installation

Python dependencies

Clone the repository

Download Pretrained Models

Running the code

Citations

Owner

Akash James

Conservative and Adaptive Penalty for Model-Based Safe Reinforcement Learning

Code for Blind Image Decomposition (BID) and Blind Image Decomposition network (BIDeN).

Action Segmentation Evaluation

🗣️ Microsoft Edge TTS for Home Assistant, no need for app_key

[ACL-IJCNLP 2021] Improving Named Entity Recognition by External Context Retrieving and Cooperative Learning

An intuitive library to extract features from time series

✨风纪委员会自动投票脚本，利用Github Action帮你进行裁决操作（为了让其他风纪委员有案件可判，本程序从中午12点才开始运行，有需要请自己修改运行时间）

PassAPI is a password generator in hash format and fully developed in Python, with the aim of teaching how to handle and build

Personals scripts using ageitgey/face_recognition

Stacked Generative Adversarial Networks

Code for the ICML 2021 paper "Bridging Multi-Task Learning and Meta-Learning: Towards Efficient Training and Effective Adaptation", Haoxiang Wang, Han Zhao, Bo Li.

Code for the Paper "Diffusion Models for Handwriting Generation"

Hierarchical User Intent Graph Network for Multimedia Recommendation

Hybrid Neural Fusion for Full-frame Video Stabilization

Official implementation of the paper 'High-Resolution Photorealistic Image Translation in Real-Time: A Laplacian Pyramid Translation Network' in CVPR 2021

Log4j JNDI inj. vuln scanner

Using VideoBERT to tackle video prediction

PyTorch implementation of DeepLab v2 on COCO-Stuff / PASCAL VOC

Solution of Kaggle competition: Sartorius - Cell Instance Segmentation

A Nim frontend for pytorch, aiming to be mostly auto-generated and internally using ATen.