Code for paper: Towards Tokenized Human Dynamics Representation

Last update: May 31, 2022

Overview

Video Tokneization

Codebase for video tokenization, based on our paper Towards Tokenized Human Dynamics Representation.

Prerequisites (tested under Python 3.8 and CUDA 11.1)

apt-get install ffmpeg  
pip install torch==1.8  
pip install torchvision  
pip install pytorch-lightning  
pip install pytorch-lightning-bolts  
pip install aniposelib wandb gym test-tube ffmpeg-python matplotlib easydict scikit-learn

Data Preparation

Make a directory besides this repo and name it aistplusplus
Download from AIST++ website until it looks like

├── annotations
│   ├── cameras
│   ├── ignore_list.txt
│   ├── keypoints2d
│   ├── keypoints3d
│   ├── motions
│   └── splits
└── video_list.txt

How to run

Write one configuration file, e.g., configs/tan.yaml.
Run python pretrain.py --cfg configs/tan.yaml with GPU, which will create a folder under logs for this run. Folder name specified by the NAME in configuration file. Then run python cluster.py --cfg configs/tan.yaml (CPU-only) and check results in demo.ipynb.
Or you can download and unzip my training result into logs folder from here.

Code for paper: Towards Tokenized Human Dynamics Representation

Related tags

Overview

Video Tokneization

Prerequisites (tested under Python 3.8 and CUDA 11.1)

Data Preparation

How to run

Owner

Kenneth Li

Systemic Evolutionary Chemical Space Exploration for Drug Discovery

Hand Gesture Volume Control is AIML based project which uses image processing to control the volume of your Computer.

PyTorch implementation of "A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."

Code release for "Conditional Adversarial Domain Adaptation" (NIPS 2018)

Patch Rotation: A Self-Supervised Auxiliary Task for Robustness and Accuracy of Supervised Models

MRQy is a quality assurance and checking tool for quantitative assessment of magnetic resonance imaging (MRI) data.

Official NumPy Implementation of Deep Networks from the Principle of Rate Reduction (2021)

A machine learning library for spiking neural networks. Supports training with both torch and jax pipelines, and deployment to neuromorphic hardware.

Modeling Temporal Concept Receptive Field Dynamically for Untrimmed Video Analysis

Cross-Image Region Mining with Region Prototypical Network for Weakly Supervised Segmentation

Learning High-Speed Flight in the Wild

A program to recognize fruits on pictures or videos using yolov5

[CVPR 2021] Released code for Counterfactual Zero-Shot and Open-Set Visual Recognition

An implementation of the efficient attention module.

PyTorch implementation of DeepUME: Learning the Universal Manifold Embedding for Robust Point Cloud Registration (BMVC 2021)

Optical machine for senses sensing using speckle and deep learning

Curved Projection Reformation

Visualizer using audio and semantic analysis to explore BigGAN (Brock et al., 2018) latent space.

Code for the paper "VisualBERT: A Simple and Performant Baseline for Vision and Language"

Gluon CV Toolkit