Yolox-bytetrack-sample - Python sample of MOT (Multiple Object Tracking) using YOLOX and ByteTrack

Last update: Nov 09, 2022

Overview

yolox-bytetrack-sample

YOLOXとByteTrackを用いたMOT(Multiple Object Tracking)のPythonサンプルです。
YOLOXはYOLOX-ONNX-TFLite-Sampleで、ONNXに変換したモデルを使用しています。

08-01.yolox-bytetrack.mp4

Requirement

OpenCV 3.4.2 or later
onnxruntime 1.5.2 or later
cython_bbox 0.1.3 or later

※onnxruntime-gpuはonnxruntimeでも動作しますが、推論時間がかかるためGPUを推奨します
※Windowsでcython_bbox のインストールが失敗する場合は、GitHubからのインストールをお試しください(2022/02/13時点)

pip install -e git+https://github.com/samson-wang/cython_bbox.git#egg=cython-bbox

Demo

デモの実行方法は以下です。

python sample.py

--device
カメラデバイス番号の指定
デフォルト：0
--movie
動画ファイルの指定 ※指定時はカメラデバイスより優先
デフォルト：指定なし
--width
カメラキャプチャ時の横幅
デフォルト：960
--height
カメラキャプチャ時の縦幅
デフォルト：540

YOLOXパラメータ

--yolox_model
ロードするモデルの格納パス
デフォルト：model/yolox_nano.onnx
--input_shape
モデルの入力サイズ
デフォルト：416,416
--score_th
クラス判別の閾値
デフォルト：0.3
--nms_th
NMSの閾値
デフォルト：0.45
--nms_score_th
NMSのスコア閾値
デフォルト：0.1
--with_p6
Large P6モデルを使用するか否か
デフォルト：指定なし

ByteTrackパラメータ

--track_thresh
デフォルト：0.5
--track_buffer
デフォルト：30
--match_thresh
デフォルト：0.8
--min_box_area
デフォルト：10
--mot20
デフォルト：指定なし

※パラメータ詳細はByteTrackを参照ください。

Reference

Author

高橋かずひと(https://twitter.com/KzhtTkhs)

License

yolox-bytetrack-sample is under MIT License.

License(Movie)

サンプル動画はNHKクリエイティブ・ライブラリーのケニア共和国キツイ町並み(4) ふかんショットを使用しています。

Yolox-bytetrack-sample - Python sample of MOT (Multiple Object Tracking) using YOLOX and ByteTrack

Related tags

Overview

yolox-bytetrack-sample

Requirement

Demo

Reference

Author

License

License(Movie)

Owner

KazuhitoTakahashi

Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.

Research on controller area network Intrusion Detection Systems

Rocket-recycling with Reinforcement Learning

Tools for the Cleveland State Human Motion and Control Lab

Pytorch-diffusion - A basic PyTorch implementation of 'Denoising Diffusion Probabilistic Models'

Groceries ARL: Association Rules (Birliktelik Kuralı)

Fast and Simple Neural Vocoder, the Multiband RNNMS

Amazing-Python-Scripts - 🚀 Curated collection of Amazing Python scripts from Basics to Advance with automation task scripts.

Baseline powergrid model for NY

Grammar Induction using a Template Tree Approach

Video-Music Transformer

Self-Supervised Monocular 3D Face Reconstruction by Occlusion-Aware Multi-view Geometry Consistency[ECCV 2020]

Open source repository for the code accompanying the paper 'Non-Rigid Neural Radiance Fields Reconstruction and Novel View Synthesis of a Deforming Scene from Monocular Video'.

A crash course in six episodes for software developers who want to become machine learning practitioners.

A CNN implementation using only numpy. Supports multidimensional images, stride, etc.

An essential implementation of BYOL in PyTorch + PyTorch Lightning

Synthetic Scene Text from 3D Engines

Repository of continual learning papers

Meta Representation Transformation for Low-resource Cross-lingual Learning

PyTorch code for our ECCV 2018 paper "Image Super-Resolution Using Very Deep Residual Channel Attention Networks"