A Sign Language detection project using Mediapipe landmark detection and Tensorflow LSTM's

Last update: Feb 06, 2022

Overview

sign-language-detection

A Sign Language detection project using Mediapipe landmark detection and Tensorflow LSTM. The project is built for a vocabulary of 3 words, but more can be added by collecting and adding data for other words.

Vocabulary

Open
to
Work

Output

Disclaimer

Colab doesn't detect webcam and you can't use it for mediapipe detection and dataset collection through webcam so most of that was done locally and then training and inference using Tensorflow was performed on Colab.

You can uncomment the commented part if you wish to do all that locally. In my case, I had some clash between mediapipe and tensorflow on the ARM architecture m1 mac.

The notebook uses the approach to Sign Language Detection by Nicholas Renotte, of course with a whole bunch of tweaks to suit my usecase 🙂

Tweaks:

Input and output in the form of videos to work with colab.
Remove face landmarks as they end up just being noise.
Use tanh activation as it works way better with LSTMs compared to relu.
Colors and Cosmetics.
Disclaimer at bottom.
Different threshold value for inference.

A Sign Language detection project using Mediapipe landmark detection and Tensorflow LSTM's

Related tags

Overview

sign-language-detection

Vocabulary

Output

Disclaimer

Tweaks:

Owner

Hashim

SberSwap Video Swap base on deep learning

Best practices for segmentation of the corporate network of any company

Pytorch for Segmentation

MARS: Learning Modality-Agnostic Representation for Scalable Cross-media Retrieva

An official repository for Paper "Uformer: A General U-Shaped Transformer for Image Restoration".

Hyperbolic Image Segmentation, CVPR 2022

A PyTorch implementation of SlowFast based on ICCV 2019 paper "SlowFast Networks for Video Recognition"

This is the official implement of paper "ActionCLIP: A New Paradigm for Action Recognition"

Testing and Estimation of structural breaks in Stata

CLASP - Contrastive Language-Aminoacid Sequence Pretraining

NVIDIA container runtime

Motion Planner Augmented Reinforcement Learning for Robot Manipulation in Obstructed Environments (CoRL 2020)

The official implementation of Variable-Length Piano Infilling (VLI).

Synthesizing and manipulating 2048x1024 images with conditional GANs

TensorFlow implementation of Barlow Twins (Barlow Twins: Self-Supervised Learning via Redundancy Reduction)

A Python package for faster, safer, and simpler ML processes

Forest R-CNN: Large-Vocabulary Long-Tailed Object Detection and Instance Segmentation (ACM MM 2020)

MNE: Magnetoencephalography (MEG) and Electroencephalography (EEG) in Python

Face2webtoon - Despite its importance, there are few previous works applying I2I translation to webtoon.