MonoRCNN is a monocular 3D object detection method for automonous driving

Last update: Dec 27, 2022

Related tags

Overview

MonoRCNN

MonoRCNN is a monocular 3D object detection method for automonous driving, published at ICCV 2021. This project is an implementation of MonoRCNN.

Visualization

Methodology

Installation

Python 3.6
PyTorch 1.5.0
Detectron2 0.1.3

Please use the Detectron2 included in this project. To ignore fully occluded objects during training, build.py, rpn.py, and roi_heads.py have been modified.

Dataset Preparation

KITTI

Model & Log

KITTI val1 split

Organize the downloaded files as follows:

├── projects
│   ├── MonoRCNN
│   │   ├── output
│   │   │   ├── model
│   │   │   ├── log.txt
│   │   │   ├── ...

Test

cd projects/MonoRCNN
./main.py --config-file config/MonoRCNN_KITTI.yaml --num-gpus 1 --resume --eval-only

Set VISUALIZE as True to visualize 3D object detection results (saved in output/evaluation/test/visualization).

Training

cd projects/MonoRCNN
./main.py --config-file config/MonoRCNN_KITTI.yaml --num-gpus 1

Citation

If you find this project useful in your research, please cite:

@inproceedings{MonoRCNN_ICCV21,
    title = {Geometry-based Distance Decomposition for Monocular 3D Object Detection},
    author = {Xuepeng Shi and Qi Ye and 
              Xiaozhi Chen and Chuangrong Chen and 
              Zhixiang Chen and Tae-Kyun Kim},
    booktitle = {ICCV},
    year = {2021},
}

Contact

[email protected]

MonoRCNN is a monocular 3D object detection method for automonous driving

Related tags

Overview

MonoRCNN

Visualization

Methodology

Related Link

Installation

Dataset Preparation

Model & Log

Test

Training

Citation

Contact

Acknowledgement

Owner

Keras Model Implementation Walkthrough

The official implementation of You Only Compress Once: Towards Effective and Elastic BERT Compression via Exploit-Explore Stochastic Nature Gradient.

E-RAFT: Dense Optical Flow from Event Cameras

A tf.keras implementation of Facebook AI's MadGrad optimization algorithm

Official repository for Few-shot Image Generation via Cross-domain Correspondence (CVPR '21)

The code for our CVPR paper PISE: Person Image Synthesis and Editing with Decoupled GAN, Project Page, supp.

StyleGAN2 - Official TensorFlow Implementation

Direct design of biquad filter cascades with deep learning by sampling random polynomials.

[NeurIPS-2021] Slow Learning and Fast Inference: Efficient Graph Similarity Computation via Knowledge Distillation

Companion repository to the paper accepted at the 4th ACM SIGSPATIAL International Workshop on Advances in Resilient and Intelligent Cities

BOOKSUM: A Collection of Datasets for Long-form Narrative Summarization

A Broad Study on the Transferability of Visual Representations with Contrastive Learning

An official implementation of "Exploiting a Joint Embedding Space for Generalized Zero-Shot Semantic Segmentation" (ICCV 2021) in PyTorch.

Style-based Point Generator with Adversarial Rendering for Point Cloud Completion (CVPR 2021)

这是一个deeplabv3-plus-pytorch的源码，可以用于训练自己的模型。

Learning Temporal Consistency for Low Light Video Enhancement from Single Images (CVPR2021)

IOT: Instance-wise Layer Reordering for Transformer Structures

A repo with study material, exercises, examples, etc for Devnet SPAUTO

Official repository for: Continuous Control With Ensemble DeepDeterministic Policy Gradients

Source code for our paper "Empathetic Response Generation with State Management"