Efficient Two-Step Networks for Temporal Action Segmentation (Neurocomputing 2021)

Last update: Apr 16, 2022

Related tags

Overview

Efficient Two-Step Networks for Temporal Action Segmentation

This repository provides a PyTorch implementation of the paper Efficient Two-Step Networks for Temporal Action Segmentation.

Requirements

* Python 3.8.5
* pyTorch 1.8.1

You can download packages using requirements.txt.
pip install -r requirements.txt

Datasets

Download the data provided by MS-TCN, which contains the I3D features (w/o fine-tune) and the ground truth labels for 3 datasets. (~30GB)
Extract it so that you have the data folder in the same directory as train.py.

directory structure

├── config
│   ├── 50salads
│   ├── breakfast
│   └── gtea
├── csv
│   ├── 50salads
│   ├── breakfast
│   └── gtea
├─ dataset ─── 50salads/...
│           ├─ breakfast/...
│           └─ gtea ─── features/
│                    ├─ groundTruth/
│                    ├─ splits/
│                    └─ mapping.txt
├── libs
├── result
├── utils 
├── requirements.txt
├── train.py
├── eval.py
└── README.md

Training and Testing of ETSN

Setting

First, convert ground truth files into numpy array.

python utils/generate_gt_array.py ./dataset

Then, please run the below script to generate csv files for data laoder'.

python utils/builda_dataset.py ./dataset

Training

You can train a model by changing the settings of the configuration file.

python train.py ./config/xxx/xxx/config.yaml

Evaluation

You can evaluate the performance of result after running.

python eval.py ./result/xxx/xxx/config.yaml test

We also provide trained ETSN model in Google Drive. Extract it so that you have the result folder in the same directory as train.py.

average cross validation results

python utils/average_cv_results.py [result_dir]

Citation

If you find our code useful, please cite our paper.

@article{LI2021373,
author = {Yunheng Li and Zhuben Dong and Kaiyuan Liu and Lin Feng and Lianyu Hu and Jie Zhu and Li Xu and Yuhan wang and Shenglan Liu},
journal = {Neurocomputing},
title = {Efficient Two-Step Networks for Temporal Action Segmentation},
year = {2021},
volume = {454},
pages = {373-381},
issn = {0925-2312},
doi = {https://doi.org/10.1016/j.neucom.2021.04.121},
url = {https://www.sciencedirect.com/science/article/pii/S0925231221006998},

}

Contact

For any question, please raise an issue or contact.

Acknowledgement

We appreciate MS-TCN for extracted I3D feature, backbone network and evaluation code.

Appreciating Yuchi Ishikawa shares the re-implementation of MS-TCN with pytorch.

Efficient Two-Step Networks for Temporal Action Segmentation (Neurocomputing 2021)

Related tags

Overview

Efficient Two-Step Networks for Temporal Action Segmentation

Requirements

Datasets

directory structure

Training and Testing of ETSN

Setting

Training

Evaluation

average cross validation results

Citation

Contact

Acknowledgement

Owner

Public implementation of the Convolutional Motif Kernel Network (CMKN) architecture

A denoising diffusion probabilistic model (DDPM) tailored for conditional generation of protein distograms

ColossalAI-Examples - Examples of training models with hybrid parallelism using ColossalAI

K-PLUG: Knowledge-injected Pre-trained Language Model for Natural Language Understanding and Generation in E-Commerce (EMNLP Founding 2021)

PyElecCL - Electron Monte Carlo Second Checks

Do Smart Glasses Dream of Sentimental Visions? Deep Emotionship Analysis for Eyewear Devices

BRepNet: A topological message passing system for solid models

Pairwise Learning for Neural Link Prediction for OGB (PLNLP-OGB)

Makes patches from huge resolution .svs slide files using openslide

Provide baselines and evaluation metrics of the task: traffic flow prediction

PECOS - Prediction for Enormous and Correlated Spaces

Official implementation of VQ-Diffusion

Bayesian regularization for functional graphical models.

Official PyTorch implementation of "The Center of Attention: Center-Keypoint Grouping via Attention for Multi-Person Pose Estimation" (ICCV 21).

nfelo: a power ranking, prediction, and betting model for the NFL

AlphaNet Improved Training of Supernet with Alpha-Divergence

A PyTorch implementation of PointRend: Image Segmentation as Rendering

An pytorch implementation of Masked Autoencoders Are Scalable Vision Learners

[ICCV 2021 Oral] Deep Evidential Action Recognition

Multi-Scale Geometric Consistency Guided Multi-View Stereo