a project for 3D multi-object tracking

Last update: Jan 04, 2023

Related tags

Deep Learning 3D-Multi-Object-Tracker

Overview

3D Multi-Object Tracker

This project is developed for tracking multiple objects in 3D scene. The visualization code is from here.

Features

Fast: currently, the codes can achieve 700 FPS using only CPU (not include detection and data op), can perform tracking on all kitti val sequence in several seconds.
Support both online and global implementation. The overall framework of design is shown below:

Kitti Results

Results on the Kitti tracking val seq [1,6,8,10,12,13,14,15,16,18,19] using second-iou and point-rcnn detections. We followed the HOTA metric, and tuned the parameters in this code by firstly considering the HOTA performance.

Detector	HOTA	DetA	AssA	DetRe	DetPr	AssRe	AssPr	LocA	MOTA
second-iou	78.787	74.482	83.611	80.665	84.72	89.022	88.575	88.63	85.129
point-rcnn	78.91	75.814	82.406	83.489	82.185	87.209	87.586	87.308	88.412

Prepare data

You can download the Kitti tracking pose data from here, and you can find the point-rcnn and second-iou detections from here.

To run this code, you should organize Kitti tracking dataset as below:

# Kitti Tracking Dataset       
└── kitti_tracking
       ├── testing 
       |      ├──calib
       |      |    ├──0000.txt
       |      |    ├──....txt
       |      |    └──0028.txt
       |      ├──image_02
       |      |    ├──0000
       |      |    ├──....
       |      |    └──0028
       |      ├──pose
       |      |    ├──0000
       |      |    |    └──pose.txt
       |      |    ├──....
       |      |    └──0028
       |      |         └──pose.txt
       |      ├──label_02
       |      |    ├──0000.txt
       |      |    ├──....txt
       |      |    └──0028.txt
       |      └──velodyne
       |           ├──0000
       |           ├──....
       |           └──0028      
       └── training # the structure is same as testing set
              ├──calib
              ├──image_02
              ├──pose
              ├──label_02
              └──velodyne

Detections

└── point-rcnn
       ├── training
       |      ├──0000
       |      |    ├──000001.txt
       |      |    ├──....txt
       |      |    └──000153.txt
       |      ├──...
       |      └──0020
       └──testing

Requirements

python3
numpy
opencv
yaml

Quick start

Please modify the dataset path and detections path in the yaml file to your own path.
Then run python3 kitti_3DMOT.py config/point_rcnn_mot.yaml
The results are automatically saved to evaluation\results\sha_key\data, and evaluated by HOTA metrics.

Notes

The evaluation codes are copied from Kitti.

a project for 3D multi-object tracking

Related tags

Overview

3D Multi-Object Tracker

Features

Kitti Results

Prepare data

Requirements

Quick start

Notes

Owner

Model of an AI powered sign language interpreter.

CoMoGAN: continuous model-guided image-to-image translation. CVPR 2021 oral.

AlphaBot2 Pi Core software for interfacing with the various components.

Code for "OctField: Hierarchical Implicit Functions for 3D Modeling (NeurIPS 2021)"

Chinese Mandarin tts text-to-speech 中文 (普通话) 语音合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder,

[ICCV 2021] Deep Hough Voting for Robust Global Registration

The code for our paper submitted to RAL/IROS 2022: OverlapTransformer: An Efficient and Rotation-Invariant Transformer Network for LiDAR-Based Place Recognition.

Github Traffic Insights as Prometheus metrics.

Calibrated Hyperspectral Image Reconstruction via Graph-based Self-Tuning Network.

Final project code: Implementing MAE with downscaled encoders and datasets, for ESE546 FA21 at University of Pennsylvania

Official code repository for "Exploring Neural Models for Query-Focused Summarization"

Tweesent-back - Tweesent backend uses fastAPI as the web framework

Task Transformer Network for Joint MRI Reconstruction and Super-Resolution (MICCAI 2021)

Pytorch implementation of few-shot semantic image synthesis

CVPR2022 (Oral) - Rethinking Semantic Segmentation: A Prototype View

NeuroGen: activation optimized image synthesis for discovery neuroscience

GraphRNN: Generating Realistic Graphs with Deep Auto-regressive Models

Attentive Implicit Representation Networks (AIR-Nets)

HeartRate detector with ArduinoandPython - Use Arduino and Python create a heartrate detector.

Reproduction process of AlexNet

a project for 3D multi-object tracking

Related tags

Overview

3D Multi-Object Tracker

Features

Kitti Results

Prepare data

Requirements

Quick start

Notes

Owner

Model of an AI powered sign language interpreter.

CoMoGAN: continuous model-guided image-to-image translation. CVPR 2021 oral.

AlphaBot2 Pi Core software for interfacing with the various components.

Code for "OctField: Hierarchical Implicit Functions for 3D Modeling (NeurIPS 2021)"

Chinese Mandarin tts text-to-speech 中文 (普通话) 语音 合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder,

[ICCV 2021] Deep Hough Voting for Robust Global Registration

The code for our paper submitted to RAL/IROS 2022: OverlapTransformer: An Efficient and Rotation-Invariant Transformer Network for LiDAR-Based Place Recognition.

Github Traffic Insights as Prometheus metrics.

Calibrated Hyperspectral Image Reconstruction via Graph-based Self-Tuning Network.

Final project code: Implementing MAE with downscaled encoders and datasets, for ESE546 FA21 at University of Pennsylvania

Official code repository for "Exploring Neural Models for Query-Focused Summarization"

Tweesent-back - Tweesent backend uses fastAPI as the web framework

Task Transformer Network for Joint MRI Reconstruction and Super-Resolution (MICCAI 2021)

Pytorch implementation of few-shot semantic image synthesis

CVPR2022 (Oral) - Rethinking Semantic Segmentation: A Prototype View

NeuroGen: activation optimized image synthesis for discovery neuroscience

GraphRNN: Generating Realistic Graphs with Deep Auto-regressive Models

Attentive Implicit Representation Networks (AIR-Nets)

HeartRate detector with ArduinoandPython - Use Arduino and Python create a heartrate detector.

Reproduction process of AlexNet

Chinese Mandarin tts text-to-speech 中文 (普通话) 语音合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder,