Libtorch yolov3 deepsort

Last update: Dec 13, 2022

Overview

It is for my undergrad thesis in Tsinghua University.

There are four modules in the project:

Detection: YOLOv3
Tracking: SORT and DeepSORT
Processing: Run detection and tracking, then display and save the results (a compressed video, a few snapshots for each target)
GUI: Display the results

YOLOv3

A Libtorch implementation of the YOLO v3 object detection algorithm, written with modern C++.

The code is based on the walktree.

The config file in .\models can be found at Darknet.

SORT

I also merged SORT to do tracking.

A similar software in Python is here, which also rewrite form the most starred version and SORT

DeepSORT

Recently I reimplement DeepSORT which employs another CNN for re-id. It seems it gives better result but also slows the program a bit. Also, a PyTorch version is available at ZQPei, thanks!

Performance

Currently on a GTX 1060 6G it consumes about 1G RAM and have 37 FPS.

The video I test is TownCentreXVID.avi.

GUI

With wxWidgets, I developed the GUI module for visualization of results.

Previously I used Dear ImGui. However, I do not think it suits my purpose.

Pre-trained network

This project uses pre-trained network weights from others

How to build

This project requires LibTorch, OpenCV, wxWidgets and CMake to build.

LibTorch can be easily integrated with CMake, but there are a lot of strange things...

On Ubuntu 16.04, I use apt install to install the others. Everything is fine. On Windows 10 + Visual Studio 2017, I use the latest stable version of the others from their official websites.

Snapshots

Here are some intermediate output from detection and tracking module:

Here is the snapshot of processing module:

Here is the snapshot of GUI module:

Libtorch yolov3 deepsort

Related tags

Overview

Overview

YOLOv3

SORT

DeepSORT

Performance

GUI

Pre-trained network

How to build

Snapshots

Owner

Xu Wei

[AAAI2021] The source code for our paper 《Enhancing Unsupervised Video Representation Learning by Decoupling the Scene and the Motion》.

EMNLP 2021 Findings' paper, SCICAP: Generating Captions for Scientific Figures

Multi-Objective Reinforced Active Learning

This repository contains the implementation of the HealthGen model, a generative model to synthesize realistic EHR time series data with missingness

Labelbox is the fastest way to annotate data to build and ship artificial intelligence applications

Official Pytorch implementation of the paper "MotionCLIP: Exposing Human Motion Generation to CLIP Space"

Naszilla is a Python library for neural architecture search (NAS)

NaturalCC is a sequence modeling toolkit that allows researchers and developers to train custom models

Async API for controlling Hue Lights

Apollo optimizer in tensorflow

This is the code of NeurIPS'21 paper "Towards Enabling Meta-Learning from Target Models".

Pytorch Implementation for Dilated Continuous Random Field

Official implementation of the ICCV 2021 paper: "The Power of Points for Modeling Humans in Clothing".

A python package to perform same transformation to coco-annotation as performed on the image.

Official PyTorch code for Mutual Affine Network for Spatially Variant Kernel Estimation in Blind Image Super-Resolution (MANet, ICCV2021)

MGFN: Multi-Graph Fusion Networks for Urban Region Embedding was accepted by IJCAI-2022.

Implementation of Learning Gradient Fields for Molecular Conformation Generation (ICML 2021).

Official Keras Implementation for UNet++ in IEEE Transactions on Medical Imaging and DLMIA 2018

A basic neural network for image segmentation.

Unified Instance and Knowledge Alignment Pretraining for Aspect-based Sentiment Analysis