Det3D

A general 3D Object Detection codebase in PyTorch.

1. Introduction

Det3D is the first 3D Object Detection toolbox which provides off the box implementations of many 3D object detection algorithms such as PointPillars, SECOND, PIXOR, etc, as well as state-of-the-art methods on major benchmarks like KITTI(ViP) and nuScenes(CBGS). Key features of Det3D include the following aspects:

Multi Datasets Support: KITTI, nuScenes, Lyft
Point-based and Voxel-based model zoo
State-of-the-art performance
DDP & SyncBN

2. Installation

Please refer to INSTALATION.md.

3. Quick Start

Please refer to GETTING_STARTED.md.

4. Model Zoo

4.1 nuScenes

	mAP	mATE	mASE	mAOE	mAVE	mAAE	NDS	ckpt
CBGS	49.9	0.335	0.256	0.323	0.251	0.197	61.3	link
PointPillar	41.8	0.363	0.264	0.377	0.288	0.198	56.0	link

The original model and prediction files are available in the CBGS README.

4.2 KITTI

Second on KITTI(val) Dataset

car  AP @0.70, 0.70,  0.70:
bbox AP:90.54, 89.35, 88.43
bev  AP:89.89, 87.75, 86.81
3d   AP:87.96, 78.28, 76.99
aos  AP:90.34, 88.81, 87.66

PointPillars on KITTI(val) Dataset

car  [email protected],  0.70,  0.70:
bbox AP:90.63, 88.86, 87.35
bev  AP:89.75, 86.15, 83.00
3d   AP:85.75, 75.68, 68.93
aos  AP:90.48, 88.36, 86.58

4.3 Lyft

Lyft Config

4.4 Waymo

5. Functionality

Models
- VoxelNet
- SECOND
- PointPillars
Features
- Multi task learning & Multi-task Learning
- Distributed Training and Validation
- SyncBN
- Flexible anchor dimensions
- TensorboardX
- Checkpointer & Breakpoint continue
- Self-contained visualization
- Finetune
- Multiscale Training & Validation
- Rotated RoI Align

6. TODO List

To Be Released
- CGBS on Lyft(val) Dataset
Models
- PointRCNN
- PIXOR

7. Call for contribution.

Support Waymo Dataset.
Add other 3D detection / segmentation models, such as VoteNet, STD, etc.

8. Developers

Benjin Zhu , Bingqi Ma

9. License

Det3D is released under the Apache licenes.

10. Citation

Det3D is a derivative codebase of CBGS, if you find this work useful in your research, please consider cite:

@article{zhu2019class,
  title={Class-balanced Grouping and Sampling for Point Cloud 3D Object Detection},
  author={Zhu, Benjin and Jiang, Zhengkai and Zhou, Xiangxin and Li, Zeming and Yu, Gang},
  journal={arXiv preprint arXiv:1908.09492},
  year={2019}
}

A general 3D Object Detection codebase in PyTorch.

Related tags

Overview

Det3D

1. Introduction

2. Installation

3. Quick Start

4. Model Zoo

4.1 nuScenes

4.2 KITTI

Second on KITTI(val) Dataset

PointPillars on KITTI(val) Dataset

4.3 Lyft

4.4 Waymo

5. Functionality

6. TODO List

7. Call for contribution.

8. Developers

9. License

10. Citation

11. Acknowledgement

Owner

Benjin Zhu

Code for MSc Quantitative Finance Dissertation

Code for Multimodal Neural SLAM for Interactive Instruction Following

The NEOSSat is a dual-mission microsatellite designed to detect potentially hazardous Earth-orbit-crossing asteroids and track objects that reside in deep space

Code for EMNLP'21 paper "Types of Out-of-Distribution Texts and How to Detect Them"

[ ICCV 2021 Oral ] Our method can estimate camera poses and neural radiance fields jointly when the cameras are initialized at random poses in complex scenarios (outside-in scenes, even with less texture or intense noise )

PyTorch Implementation for "ForkGAN with SIngle Rainy NIght Images: Leveraging the RumiGAN to See into the Rainy Night"

Boundary-aware Transformers for Skin Lesion Segmentation

Creating Artificial Life with Reinforcement Learning

Hybrid Neural Fusion for Full-frame Video Stabilization

The project is an official implementation of our CVPR2019 paper "Deep High-Resolution Representation Learning for Human Pose Estimation"

Vector.ai assignment

Memory Efficient Attention (O(sqrt(n)) for Jax and PyTorch

ROMP: Monocular, One-stage, Regression of Multiple 3D People, ICCV21

The King is Naked: on the Notion of Robustness for Natural Language Processing

Temporal Dynamic Convolutional Neural Network for Text-Independent Speaker Verification and Phonemetic Analysis

This package contains a PyTorch Implementation of IB-GAN of the submitted paper in AAAI 2021

SANet: A Slice-Aware Network for Pulmonary Nodule Detection

BOVText: A Large-Scale, Multidimensional Multilingual Dataset for Video Text Spotting

Code for "MetaMorph: Learning Universal Controllers with Transformers", Gupta et al, ICLR 2022

Nicely is a real-time Feedback and Intervention Program Depression is a prevalent issue across all age groups, socioeconomic classes, and cultural identities.