BABEL: Bodies, Action and Behavior with English Labels [CVPR 2021]

Last update: Dec 28, 2022

Related tags

Overview

BABEL: Bodies, Action and Behavior with English Labels [CVPR 2021]

Abhinanda R. Punnakkal*, Arjun Chandrasekaran*, Nikos Athanasiou, Alejandra Quiros-Ramirez, Michael J. Black. * denotes equal contribution

Project Website | Paper | Video | Poster

BABEL is a large dataset with language labels describing the actions being performed in mocap sequences. BABEL labels about 43 hours of mocap sequences from AMASS [1] with action labels. Sequences have action labels at two possible levels of abstraction:

Sequence labels which describe the overall action in the sequence
Frame labels which describe all actions in every frame of the sequence. Each frame label is precisely aligned with the duration of the corresponding action in the mocap sequence, and multiple actions can overlap.

To download the BABEL action labels, visit our 'Data' page. You can download the mocap sequences from AMASS.

Tutorials

We release some helper code in Jupyter notebooks to load the BABEL dataset, visualize mocap sequences and their action labels, search BABEL for sequences containing specific actions, etc.

See notebooks/ for more details.

Action Recognition

We provide features, training and inference code, and pre-trained checkpoints for 3D skeleton-based action recognition.

Please see action_recognition/ for more details.

Acknowledgements

The notebooks in this repo are inspired by the those provided by AMASS. The Action Recognition code is based on the 2s-AGCN implementation.

References

[1] Mahmood, Naureen, et al. "AMASS: Archive of motion capture as surface shapes." Proceedings of the IEEE/CVF International Conference on Computer Vision. 2019.

License

Software Copyright License for non-commercial scientific research purposes. Please read carefully the terms and conditions and any accompanying documentation before you download and/or use the AMASS dataset, and software, (the "Model & Software"). By downloading and/or using the Model & Software (including downloading, cloning, installing, and any other use of this GitHub repository), you acknowledge that you have read these terms and conditions, understand them, and agree to be bound by them. If you do not agree with these terms and conditions, you must not download and/or use the Model & Software. Any infringement of the terms of this agreement will automatically terminate your rights under this License.

Contact

The code in this repository is developed by Abhinanda Punnakkal and Arjun Chandrasekaran.

If you have any questions you can contact us at [email protected].

BABEL: Bodies, Action and Behavior with English Labels [CVPR 2021]

Related tags

Overview

BABEL: Bodies, Action and Behavior with English Labels [CVPR 2021]

Tutorials

Action Recognition

Acknowledgements

References

License

Contact

Owner

Code to accompany our paper "Continual Learning Through Synaptic Intelligence" ICML 2017

Distilled coarse part of LoFTR adapted for compatibility with TensorRT and embedded divices

[CVPR2021 Oral] UP-DETR: Unsupervised Pre-training for Object Detection with Transformers

Realtime micro-expression recognition using OpenCV and PyTorch

Microsoft Cognitive Toolkit (CNTK), an open source deep-learning toolkit

PyTorch Code for "Generalization in Dexterous Manipulation via Geometry-Aware Multi-Task Learning"

LeViT a Vision Transformer in ConvNet's Clothing for Faster Inference

Adversarial-autoencoders - Tensorflow implementation of Adversarial Autoencoders

🍅🍅🍅YOLOv5-Lite: lighter, faster and easier to deploy. Evolved from yolov5 and the size of model is only 1.7M (int8) and 3.3M (fp16). It can reach 10+ FPS on the Raspberry Pi 4B when the input size is 320×320~

A code implementation of AC-GC: Activation Compression with Guaranteed Convergence, in NeurIPS 2021.

MetaShift: A Dataset of Datasets for Evaluating Contextual Distribution Shifts and Training Conflicts (ICLR 2022)

Deep Illuminator is a data augmentation tool designed for image relighting. It can be used to easily and efficiently generate a wide range of illumination variants of a single image.

Lama-cleaner: Image inpainting tool powered by LaMa

Attentional Focus Modulates Automatic Finger‑tapping Movements

YOLOX-RMPOLY

Region-aware Contrastive Learning for Semantic Segmentation, ICCV 2021

Generative Art Using Neural Visual Grammars and Dual Encoders

Pytorch ImageNet1k Loader with Bounding Boxes.

A Haskell kernel for IPython.

Vertical Federated Principal Component Analysis and Its Kernel Extension on Feature-wise Distributed Data based on Pytorch Framework