Behind the Curtain: Learning Occluded Shapes for 3D Object Detection

Last update: Dec 19, 2022

Related tags

Deep Learning BtcDet

Overview

Behind the Curtain: Learning Occluded Shapes for 3D Object Detection

Acknowledgement

We implement our model, BtcDet, based on [OpenPcdet 0.3.0].

Installation

Requirements

All the codes are tested in the following environment:

Linux (tested on Ubuntu 14.04/16.04)
Python 3.6+
PyTorch 1.1 or higher (tested on PyTorch 1.7, 1.8.1, 1.9, 1.10)
CUDA 9.0 or higher (PyTorch 1.3+ needs CUDA 9.2+, test on CUDA 10.2)
spconv v1.2.1 (commit fad3000249d27ca918f2655ff73c41f39b0f3127)

Install

b. Install the dependent libraries as follows:

Install the dependent python libraries:

pip install -r requirements.txt

Install the SparseConv library, we use the implementation from [spconv].
- If you use PyTorch 1.1, then make sure you install the spconv v1.0 with (commit 8da6f96) instead of the latest one.
- If you use PyTorch 1.3+, then you need to install the spconv v1.2. As mentioned by the author of spconv
```
git clone -b v1.2.1  https://github.com/traveller59/spconv.git --recursive

cd spconv

sudo apt-get install libboost-all-dev

python setup.py bdist_wheel

cd ./dist 
```
then use pip to install generated whl file.
```
pip install spconv-1.2.1-{your system info}.whl
```
After that, you should first get out of the spconv directory, then do python import spconv to see if you installed it correctly.

c. Install this btcdet library by running the following command:

cd btcdet
python setup.py develop

Preparation

KITTI Dataset

Please download the official KITTI 3D object detection dataset and organize the downloaded files as follows (the road planes could be downloaded from [road plane], which are optional for data augmentation in the training):

BtcDet
├── data
│   ├── kitti
    │   │   │──detection3d  │── ImageSets
                    │   │   │── training
                    │   │   │   ├──calib & velodyne & label_2 & image_2 & (optional: planes) & (optional: depth_2)
                    │   │   │── testing
                    │   │   │   ├──calib & velodyne & image_2

Generate the data infos by running the following command:

python -m btcdet.datasets.kitti.kitti_dataset create_kitti_infos tools/cfgs/dataset_configs/kitti_dataset.yaml

Generate Approximated complete object points:

(at btcdet directory, execute:)

python -m btcdet.datasets.multifindbestfit

Alternatively,

you can use our generated kitti's data including the generated complete object points, download it [here (about 31GBs)] and put the zip file inside data/kitti/ and unzip it as detection3d directory.

Run training:

cd tools/

Single gpu training

mkdir output

mkdir output/kitti_car

python train.py --cfg_file ./cfgs/model_configs/btcdet_kitti_car.yaml --output_dir ../output/kitti_car/ --batch_size 2

Multi gpu training

bash scripts/dist_train.sh 4  --batch_size 8 --gpu_str "0,1,2,3" --cfg_file ./cfgs/model_configs/btcdet_kitti_car.yaml --output_dir ../output/kitti_car/

Behind the Curtain: Learning Occluded Shapes for 3D Object Detection

Related tags

Overview

Behind the Curtain: Learning Occluded Shapes for 3D Object Detection

Acknowledgement

Installation

Requirements

Install

Preparation

KITTI Dataset

Generate Approximated complete object points:

Alternatively,

Run training:

Owner

Qiangeng Xu

Code implementing "Improving Deep Learning Interpretability by Saliency Guided Training"

A python implementation of Physics-informed Spline Learning for nonlinear dynamics discovery

DTCN SMP Challenge - Sequential prediction learning framework and algorithm

For medical image segmentation

Official implementation of VaxNeRF (Voxel-Accelearated NeRF).

RefineNet: Multi-Path Refinement Networks for High-Resolution Semantic Segmentation

Resources for the Ki testnet challenge

A demo of how to use JAX to create a simple gravity simulation

Official repository of the paper Learning to Regress 3D Face Shape and Expression from an Image without 3D Supervision

Equivariant Imaging: Learning Beyond the Range Space

K-PLUG: Knowledge-injected Pre-trained Language Model for Natural Language Understanding and Generation in E-Commerce (EMNLP Founding 2021)

A PyTorch implementation of EfficientNet and EfficientNetV2 (coming soon!)

Github project for Attention-guided Temporal Coherent Video Object Matting.

Official repo for SemanticGAN https://nv-tlabs.github.io/semanticGAN/

Bayesian Generative Adversarial Networks in Tensorflow

Official implementation for: Blended Diffusion for Text-driven Editing of Natural Images.

DumpSMBShare - A script to dump files and folders remotely from a Windows SMB share

Safe Model-Based Reinforcement Learning using Robust Control Barrier Functions

TrackFormer: Multi-Object Tracking with Transformers

PyKaldi GOP-DNN on Epa-DB