PyTorch implementation for Stochastic Fine-grained Labeling of Multi-state Sign Glosses for Continuous Sign Language Recognition.

Last update: Dec 19, 2022

Related tags

Deep Learning stochastic-cslr

Overview

Stochastic CSLR

This is the PyTorch implementation for the ECCV 2020 paper: Stochastic Fine-grained Labeling of Multi-state Sign Glosses for Continuous Sign Language Recognition.

Quick Start

1. Installation

pip install git+https://github.com/zheniu/stochastic-cslr

Also, you need to install sclite for evaluation. Take a look at step 2 for instructions.

2. Prepare the dataset

Download the RWTH-PHOENIX-2014 dataset here.
Unzip it and obtain the path to phoenix-2014-multisigner/ folder for later use.
Install sclite for evaluation. Check phoenix-2014-multisigner/evaluation/NIST-sclite_sctk-2.4.0-20091110-0958.tar.bz2 for detail.
After installing sclite, put it in your PATH.

3. Run a quick test

You can use the script quick_test.py for a quick test.

python3 quick_test.py --data-root your_path_to/phoenix-2014-multisigner

By specifying the model type --model sfl/dfl, the data split --split dev/test, whether to use a language model--use-lm, you can get the following results:

Model	WER (dev)	sub/del/ins (dev)	WER (test)	sub/del/ins (test)
DFL	27.1	12.7/7.4/7.0	27.7	13.8/7.3/6.6
SFL	26.2	12.7/6.9/6.7	26.6	13.7/6.5/6.4
DFL + LM	25.6	11.5/9.2/4.9	26.4	12.4/9.3/4.7
SFL + LM	24.3	11.4/8.5/4.4	25.3	12.4/8.5/4.3

Note that these results are slightly different from the paper as a different random seed is used.

You may also take a look at quick_test.py as it shows how to use the pretrained models.

4. Train your own model

The configuration files for deterministic and stochastic fine-grained labeling are put under config/. The training script is based on a PyTorch experiment runner torchzq, which automatically reads the hyperparameters in the YAML file and passes them to stochastic_cslr/runner.py.

Before running, change the data_root in the YAML configurations to phoenix-2014-multisigner/ first.

Train (for instance, dfl):

tzq config/dfl-fp16.yml train

Test the trained model

tzq config/dfl-fp16.yml test

Citation

You may cite this work by:

@inproceedings{niu2020stochastic,
  title={Stochastic Fine-Grained Labeling of Multi-state Sign Glosses for Continuous Sign Language Recognition},
  author={Niu, Zhe and Mak, Brian},
  booktitle={European Conference on Computer Vision},
  pages={172--186},
  year={2020},
  organization={Springer}
}

PyTorch implementation for Stochastic Fine-grained Labeling of Multi-state Sign Glosses for Continuous Sign Language Recognition.

Related tags

Overview

Stochastic CSLR

Quick Start

1. Installation

2. Prepare the dataset

3. Run a quick test

4. Train your own model

Train (for instance, dfl):

Test the trained model

Citation

Owner

Zhe Niu

Practical tutorials and labs for TensorFlow used by Nvidia, FFN, CNN, RNN, Kaggle, AE

As-ViT: Auto-scaling Vision Transformers without Training

Official PyTorch implementation of "Meta-Learning with Task-Adaptive Loss Function for Few-Shot Learning" (ICCV2021 Oral)

RaftMLP: How Much Can Be Done Without Attention and with Less Spatial Locality?

Rotation Robust Descriptors

OrienMask: Real-time Instance Segmentation with Discriminative Orientation Maps

Jittor implementation of Recursive-NeRF: An Efficient and Dynamically Growing NeRF

face2comics by Sxela (Alex Spirin) - face2comics datasets

Baseline of DCASE 2020 task 4

Reinforcement-learning - Repository of the class assignment questions for the course on reinforcement learning

Use deep learning, genetic programming and other methods to predict stock and market movements

In real-world applications of machine learning, reliable and safe systems must consider measures of performance beyond standard test set accuracy

PyTorch implementations of Top-N recommendation, collaborative filtering recommenders.

Code for our ALiBi method for transformer language models.

This repository is for our EMNLP 2021 paper "Automated Generation of Accurate & Fluent Medical X-ray Reports"

An efficient toolkit for Face Stylization based on the paper "AgileGAN: Stylizing Portraits by Inversion-Consistent Transfer Learning"

tensorrt int8 量化yolov5 4.0 onnx模型

Evolving neural network parameters in JAX.

Survival analysis (SA) is a well-known statistical technique for the study of temporal events.

PlenOctrees: NeRF-SH Training & Conversion