Yoga Pose Identification and Icon Matching

Project Goal

Detect yoga poses performed by a user and overlay a corresponding icon image. Running the main script starts the videostream with automatic pose detection.

Part 1: Pose Detection

I use the 32 body landmarks provided by MediaPipe to measure joint angles, then determine yoga poses based on key joint angles for each pose. For example, in the star pose, the angle between the shoulder, elbow, and wrist landmarks (elbow flexion) are below 20 degrees and the angle of the elbow, shoulder, and opposite shoulder (shoulder flexion) are also below 20 degrees.

Part 2: Icon Image Transformation

To transform the icon image that will be overlayed over the user, I first preprocess the icon image then apply an affine transform. To preprocess the icon, I resize the icon image to be roughly the same heigt as the user, a metric also calculated with MediaPie's landmarks. I then apply a border to the icon image so that its image array has the same dimensions as the video stream frames. These steps help make the affine transform more effective. I select three key pose landmarks for each pose, then find three key points on the icon that should match these points. For example, I chose to match the nose and ankles of the person with the top tip and bottom two tips of the star.

Part 3: Image Overlay

I overlayed just the icon pixels (the icon background is ignored) by summing .5 of the icon pixel value with .5 of the the video frame value, resulting in a transparent overlay of just the icon.

OpenCV, MediaPipe Pose Estimation, Affine Transform for Icon Overlay

Related tags

Overview

Yoga Pose Identification and Icon Matching

Project Goal

Part 1: Pose Detection

Part 2: Icon Image Transformation

Part 3: Image Overlay

Results

Star Pose

Tree Pose

Chair pose

Owner

Anna Garverick

codes for Self-paced Deep Regression Forests with Consideration on Ranking Fairness

Repo for CReST: A Class-Rebalancing Self-Training Framework for Imbalanced Semi-Supervised Learning

Gradient-free global optimization algorithm for multidimensional functions based on the low rank tensor train format

This repository lets you interact with Lean through a REPL.

WPPNets: Unsupervised CNN Training with Wasserstein Patch Priors for Image Superresolution

Implementation of " SESS: Self-Ensembling Semi-Supervised 3D Object Detection" (CVPR2020 Oral)

Image-to-Image Translation with Conditional Adversarial Networks (Pix2pix) implementation in keras

A Lightweight Hyperparameter Optimization Tool 🚀

pq is a jq-like Pickle file viewer

Python scripts for performing object detection with the 1000 labels of the ImageNet dataset in ONNX.

Source code of our BMVC 2021 paper: AniFormer: Data-driven 3D Animation with Transformer

This package contains deep learning models and related scripts for RoseTTAFold

Fuzzing tool (TFuzz): a fuzzing tool based on program transformation

This is the official code of our paper "Diversity-based Trajectory and Goal Selection with Hindsight Experience Relay" (PRICAI 2021)

A Vision Transformer approach that uses concatenated query and reference images to learn the relationship between query and reference images directly.

This is the source code of the 1st place solution for segmentation task (with Dice 90.32%) in 2021 CCF BDCI challenge.

Web service for facial landmark detection, head pose estimation, facial action unit recognition, and eye-gaze estimation based on OpenFace 2.0

Lipschitz-constrained Unsupervised Skill Discovery

AutoDeeplab / auto-deeplab / AutoML for semantic segmentation, implemented in Pytorch

这是一个unet-pytorch的源码，可以训练自己的模型