Shuwa Gesture Toolkit is a framework that detects and classifies arbitrary gestures in short videos

Last update: Dec 22, 2022

Related tags

Overview

Shuwa Gesture Toolkit

Shuwa (手話) is Japanese for "Sign Language"

Shuwa Gesture Toolkit is a framework that detects and classifies arbitrary gestures in short videos. It is particularly useful for recognizing basic words in sign language. We collected thousands of example videos of people signing Japanese Sign Language (JSL) and Hong Kong Sign Language (HKSL) to train the baseline model for recognizing gestures and facial expressions.

The Shuwa Gesture Toolkit also allows you to train new gestures, so it can be trained to recognize any sign from any sign language in the world.

[Web Demo]

How it works

By combining pose, face, and hand detector results over multiple frames we can acquire a fairly requirement for sign language understanding includes body movement, facial movement, and hand gesture. After that we use DD-Net as a recognitor to predict sign features represented in the 832D vector. Finally using use K-Nearest Neighbor classification to output the class prediction.

All related models listed below.

PoseNet: Pose detector model.
FaceMesh : Face keypoints detector model.
HandLandmarks : Hand keypoints detector model.
DD-Net : Skeleton-based action recognition model.

Installation

For MacOS user
Install python 3.7 from official python.org for tkinter support.
Install dependencies
```
pip3 install -r requirements.txt 
```

Run Python Demo

python3 webcam_demo_knn.py

Use record mode to add more sign.
Play mode.

Run Detector demo

You can try each detector individually by using these scripts.

FaceMesh

python3 face_landmark\webcam_demo_face.py

PoseNet

python3 posenet\webcam_demo_pose.py

HandLandmarks

python3 hand_landmark\webcam_demo_hand.py

Deploy on the Web using Tensorflow.js

Instructions here

Train classifier from scratch

You can add a custom sign by using Record mode in the full demo program.
But if you want to train the classifier from scratch you can check out the process here

Shuwa Gesture Toolkit is a framework that detects and classifies arbitrary gestures in short videos

Related tags

Overview

Shuwa Gesture Toolkit

How it works

Installation

Run Python Demo

Run Detector demo

Deploy on the Web using Tensorflow.js

Train classifier from scratch

Owner

Google

[ICLR 2021] Is Attention Better Than Matrix Decomposition?

Tensorflow 2.x implementation of Panoramic BlitzNet for object detection and semantic segmentation on indoor panoramic images.

Distance correlation and related E-statistics in Python

Pytorch implementation for reproducing StackGAN_v2 results in the paper StackGAN++: Realistic Image Synthesis with Stacked Generative Adversarial Networks

Pytorch implementation of Supporting Clustering with Contrastive Learning, NAACL 2021

PyTorch implementation of "VRT: A Video Restoration Transformer"

Implementation for paper MLP-Mixer: An all-MLP Architecture for Vision

Variational autoencoder for anime face reconstruction

K-Means Clustering and Hierarchical Clustering Unsupervised Learning Solution in Python3.

Code for paper PairRE: Knowledge Graph Embeddings via Paired Relation Vectors.

Libraries, tools and tasks created and used at DeepMind Robotics.

Crawl & visualize ICLR papers and reviews

This project intends to use SVM supervised learning to determine whether or not an individual is diabetic given certain attributes.

This repository contains the entire code for our work "Two-Timescale End-to-End Learning for Channel Acquisition and Hybrid Precoding"

Wordplay, an artificial Intelligence based crossword puzzle solver.

DIP-football - A football video analyse system based on Yolov5, alphapose, Qt6

This repository contains the official code of the paper Equivariant Subgraph Aggregation Networks (ICLR 2022)

[CVPR'21] FedDG: Federated Domain Generalization on Medical Image Segmentation via Episodic Learning in Continuous Frequency Space

This repo includes the supplementary of our paper "CEMENT: Incomplete Multi-View Weak-Label Learning with Long-Tailed Labels"

SwinIR: Image Restoration Using Swin Transformer