Sign Language Transformers (CVPR'20)

This repo contains the training and evaluation code for the paper Sign Language Transformers: Sign Language Transformers: Joint End-to-end Sign Language Recognition and Translation.

This code is based on Joey NMT but modified to realize joint continuous sign language recognition and translation. For text-to-text translation experiments, you can use the original Joey NMT framework.

Requirements

Download the feature files using the data/download.sh script.
[Optional] Create a conda or python virtual environment.
Install required packages using the requirements.txt file.

pip install -r requirements.txt

Usage

python -m signjoey train configs/sign.yaml

! Note that the default data directory is ./data. If you download them to somewhere else, you need to update the data_path parameters in your config file.

ToDo:

Initial code release.
Release image features for Phoenix2014T.
Share extensive qualitative and quantitative results & config files to generate them.
(Nice to have) - Guide to set up conda environment and docker image.

Reference

Please cite the paper below if you use this code in your research:

@inproceedings{camgoz2020sign,
  author = {Necati Cihan Camgoz and Oscar Koller and Simon Hadfield and Richard Bowden},
  title = {Sign Language Transformers: Joint End-to-end Sign Language Recognition and Translation},
  booktitle = {IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
  year = {2020}
}

Acknowledgements

_{This work was funded by the SNSF Sinergia project "Scalable Multimodal Sign Language Technology for Sign Language Learning and Assessment" (SMILE) grant agreement number CRSII2 160811 and the European Union’s Horizon2020 research and innovation programme under grant agreement no. 762021 (Content4All). This work reflects only the author’s view and the Commission is not responsible for any use that may be made of the information it contains. We would also like to thank NVIDIA Corporation for their GPU grant.}

Sign Language Transformers (CVPR'20)

Related tags

Overview

Sign Language Transformers (CVPR'20)

Requirements

Usage

ToDo:

Reference

Acknowledgements

Owner

Necati Cihan Camgoz

BMN: Boundary-Matching Network

This is the pytorch code for the paper Curious Representation Learning for Embodied Intelligence.

Video Swin Transformer - PyTorch

ANEA: Distant Supervision for Low-Resource Named Entity Recognition

PyImpetus is a Markov Blanket based feature subset selection algorithm that considers features both separately and together as a group in order to provide not just the best set of features but also the best combination of features

Scribble-Supervised LiDAR Semantic Segmentation, CVPR 2022 (ORAL)

PyTorch implementation of "Image-to-Image Translation Using Conditional Adversarial Networks".

Defending against Model Stealing via Verifying Embedded External Features

Parsing, analyzing, and comparing source code across many languages

Code for ICML 2021 paper: How could Neural Networks understand Programs?

This is a demo app to be used in the video streaming applications

SalFBNet: Learning Pseudo-Saliency Distribution via Feedback Convolutional Networks

The official repository for our paper "The Neural Data Router: Adaptive Control Flow in Transformers Improves Systematic Generalization".

Swapping face using Face Mesh with TensorFlow Lite

The official codes for the ICCV2021 presentation "Uniformity in Heterogeneity: Diving Deep into Count Interval Partition for Crowd Counting"

Bridging Composite and Real: Towards End-to-end Deep Image Matting

Luminaire is a python package that provides ML driven solutions for monitoring time series data.

Auto-Encoding Score Distribution Regression for Action Quality Assessment

WeakVRD-Captioning - Implementation of paper Improving Image Captioning with Better Use of Caption

This repository contains several image-to-image translation models, whcih were tested for RGB to NIR image generation. The models are Pix2Pix, Pix2PixHD, CycleGAN and PointWise.