Implementation of QuickDraw - an online game developed by Google, combined with AirGesture - a simple gesture recognition application

Last update: Dec 18, 2022

Overview

QuickDraw - AirGesture

Introduction

Here is my python source code for QuickDraw - an online game developed by google, combined with AirGesture - a simple gesture recognition application. By using my code, you could:

Run an app which you could draw in front of a camera with your hand (If you use laptop, your webcam will be used by default)
Run an app which you could draw on a canvas

Camera app

In order to use this application, you only need to use your hand to draw in front of a camera/webcam. The middle point of your hand will be detected and highlighted by a red dot. When you are ready for drawing, you need to press space button to start drawing. When you want to stop drawing, press space button again. Below is the demo by running the sript camera_app.py:

Camera app demo

Drawing app

The script and demo will be released soon

Categories:

The table below shows 18 categories my model used:


apple	book	bowtie	candle
cloud	cup	door	envelope
eyeglasses	hammer	hat	ice cream
leaf	scissors	star	t-shirt
pants	tree

Trained models

You could find my trained model at data/trained_models/

Docker

For being convenient, I provide Dockerfile which could be used for running training phase as well as launching application

Assume that docker image's name is qd_ag. You already clone this repository and cd into it.

Build:

sudo docker build --network=host -t qd_ag .

Run:

If you want to launch the application, first you need to run xhost + to turn off access control (if you only want to run the training, you could skip this step). Then you run:

sudo docker run --gpus all -it --rm --volume="path/to/your/data:/workspace/code/data -e DISPLAY=$DISPLAY --env="QT_X11_NO_MITSHM=1" -v /tmp/.X11-unix:/tmp/.X11-unix --device=/dev/video0:/dev/video0 qd_ag

Inside docker container, you could run train.py or camera_app.py scripts for training or launching app respectively. By default, the camera_app.py script will automatically generate a video capturing what you have done during the session, at data/output.mp4

Experiments:

For each class, I split the data to training and test sets with ratio of 8:2. The training/test loss/accuracy curves for the experiment are shown below:

Implementation of QuickDraw - an online game developed by Google, combined with AirGesture - a simple gesture recognition application

Related tags

Overview

QuickDraw - AirGesture

Introduction

Camera app

Drawing app

Categories:

Trained models

Docker

Experiments:

Owner

Viet Nguyen

Official Repsoitory for "Activate or Not: Learning Customized Activation." [CVPR 2021]

Official code for the publication "HyFactor: Hydrogen-count labelled graph-based defactorization Autoencoder".

SpeechBrain is an open-source and all-in-one speech toolkit based on PyTorch.

Source code for "Pack Together: Entity and Relation Extraction with Levitated Marker"

🔥RandLA-Net in Tensorflow (CVPR 2020, Oral & IEEE TPAMI 2021)

Code for the paper "Relation of the Relations: A New Formalization of the Relation Extraction Problem"

Mask-invariant Face Recognition through Template-level Knowledge Distillation

A basic implementation of Layer-wise Relevance Propagation (LRP) in PyTorch.

Background-Click Supervision for Temporal Action Localization

Does MAML Only Work via Feature Re-use? A Data Set Centric Perspective

ARAE-Tensorflow for Discrete Sequences (Adversarially Regularized Autoencoder)

This is the repository of the NeurIPS 2021 paper "Curriculum Disentangled Recommendation withNoisy Multi-feedback"

Propagate Yourself: Exploring Pixel-Level Consistency for Unsupervised Visual Representation Learning, CVPR 2021

Serving PyTorch 1.0 Models as a Web Server in C++

Using image super resolution models with vapoursynth and speeding them up with TensorRT

New approach to benchmark VQA models

Bilinear attention networks for visual question answering

https://arxiv.org/abs/2102.11005

Teaching end to end workflow of deep learning

Learned image compression