MediaPipeで姿勢推定を行い、Tokyo2020オリンピック風のピクトグラムを表示するデモ

Last update: Dec 26, 2022

Overview

Tokyo2020-Pictogram-using-MediaPipe

MediaPipeで姿勢推定を行い、Tokyo2020オリンピック風のピクトグラムを表示するデモです。

Tokyo2020Pictgram02.mp4

Requirement

mediapipe 0.8.6 or later
OpenCV 3.4.2 or later

Demo

以下コマンドでデモを起動してください。
ESCキー押下でプログラム終了します。

python main.py

--device
カメラデバイス番号の指定
デフォルト：0
--width
カメラキャプチャ時の横幅
デフォルト：640
--height
カメラキャプチャ時の縦幅
デフォルト：360
--static_image_mode
静止画モード
デフォルト：指定なし
--model_complexity
モデルの複雑度(0:Lite 1:Full 2:Heavy)
※性能差はPose Estimation Qualityを参照ください
デフォルト：1
--min_detection_confidence
検出信頼値の閾値
デフォルト：0.5
--min_tracking_confidence
トラッキング信頼値の閾値
デフォルト：0.5
--rev_color
背景色とピクトグラムの色を反転する
デフォルト：指定なし

Using Docker

Ubuntuの場合はホストマシンにMediaPipeをインストールせず、Docker + docker-composeを使うこともできます。

まず環境に合わせてdocker-compose.ymlを編集します。
ビデオデバイスを指定する際video0を使う場合は以下のように編集します。

    # Edit here
    devices:
      # - "/dev/video0:/dev/video0"
      # - "/dev/video1:/dev/video0"
-     - "/dev/video2:/dev/video0"
+     - "/dev/video0:/dev/video0"

次にDockerイメージをビルドします。

docker-compose build

最後にDockerコンテナを起動します。

docker-compose up

Author

高橋かずひと(https://twitter.com/KzhtTkhs)

License

Tokyo2020-Pictogram-using-MediaPipe is under Apache-2.0 License.

MediaPipeで姿勢推定を行い、Tokyo2020オリンピック風のピクトグラムを表示するデモ

Related tags

Overview

Tokyo2020-Pictogram-using-MediaPipe

Requirement

Demo

Using Docker

Author

License

Owner

KazuhitoTakahashi

PlenOctrees: NeRF-SH Training & Conversion

PyTorch implementations of the paper: "Learning Independent Instance Maps for Crowd Localization"

GUI for a Vocal Remover that uses Deep Neural Networks.

Flexible Networks for Learning Physical Dynamics of Deformable Objects (2021)

A non-linear, non-parametric Machine Learning method capable of modeling complex datasets

Differentiable Prompt Makes Pre-trained Language Models Better Few-shot Learners

This repo provides the base code for pytorch-lightning and weight and biases simultaneous integration.

Code for "Multi-View Multi-Person 3D Pose Estimation with Plane Sweep Stereo"

A python3 tool to take a 360 degree survey of the RF spectrum (hamlib + rotctld + RTL-SDR/HackRF)

JAX-based neural network library

A large-scale face dataset for face parsing, recognition, generation and editing.

source code the paper Fast and Robust Iterative Closet Point.

A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.

PyTea: PyTorch Tensor shape error analyzer

robomimic: A Modular Framework for Robot Learning from Demonstration

BEAS: Blockchain Enabled Asynchronous & Secure Federated Machine Learning

Codes for AAAI22 paper "Learning to Solve Travelling Salesman Problem with Hardness-Adaptive Curriculum"

Control-Raspberry-Pi-Robot-using-Hand-Gestures - A 4WD Robot car based on Raspberry Pi that controlled by hand gestures(using openCV and mediapipe)

Sparse-dense operators implementation for Paddle

Implementation of Learning Gradient Fields for Molecular Conformation Generation (ICML 2021).