SoK: Vehicle Orientation Representations for Deep Rotation Estimation

Last update: Oct 07, 2022

Overview

SoK: Vehicle Orientation Representations for Deep Rotation Estimation

Raymond H. Tu, Siyuan Peng, Valdimir Leung, Richard Gao, Jerry Lan

This is the official implementation for the paper SoK: Vehicle Orientation Representations for Deep Rotation Estimation

Envrionment Setup

Install required packages via conda

# create conda environment based on yml file
conda env update --file environment.yml
# activate conda environment
conda activate KITTI-Orientation

Clone git repo:

git clone [email protected]:umd-fire-coml/KITTI-orientation-learning.git

Training

Check training.sh for example training script

Training Parameter setup:

Training parameters can be configured using cmd arguments

--predict: Specify prediction target. Options are rot-y, alpha
--converter: Specify prediction method. Options are alpha, rot-y, tricosine, multibin, voting-bin, single-bin
--kitti_dir: path to kitti dataset directory. Its subdirectory should have training/ and testing/ Default path is dataset/
--training_record: root directory of all training record, parent of weights and logs directory. Default path is training_record
--resume: Resume from previous training under training_record directory
--add_pos_enc: Add positional encoding to input
--add_depth_map: Add depth map information to input

For all the training parameter setup, please using

python3 model/training.py -h

Training Result

Exp ID	Target	Loss Functions	Additional Inputs	Accuracy (%)
E1	rot-y	L2 Loss	-	90.490
E2	rot-y	Angle Loss	-	89.052
E3	alpha	L2 Loss	-	90.132
E4	Single Bin	L2 Loss	-	94.815
E5	Single Bin	L2 Loss	Pos Enc	94.277
E6	Single Bin	L2 Loss	Dep Map	93.952
E7	Voting Bins (4-Bin)	L2 Loss	-	93.609
E8	Tricosine	L2 Loss	-	94.249
E9	Tricosine	L2 Loss	Pos Enc	94.351
E10	Tricosine	L2 Loss	Dep Map	94.384
E11	2 Conf Bins	L2(Bins,Confs)	-	83.304
E12	4 Conf Bins	L2(Bins,Confs)	-	88.071

SoK: Vehicle Orientation Representations for Deep Rotation Estimation

Related tags

Overview

SoK: Vehicle Orientation Representations for Deep Rotation Estimation

Raymond H. Tu, Siyuan Peng, Valdimir Leung, Richard Gao, Jerry Lan

Table of Conents

Envrionment Setup

Training

Training Parameter setup:

Training Result

Owner

FIRE Capital One Machine Learning of the University of Maryland

ROSITA: Enhancing Vision-and-Language Semantic Alignments via Cross- and Intra-modal Knowledge Integration

Plenoxels: Radiance Fields without Neural Networks, Code release WIP

A Novel Incremental Learning Driven Instance Segmentation Framework to Recognize Highly Cluttered Instances of the Contraband Items

A 2D Visual Localization Framework based on Essential Matrices [ICRA2020]

Use unsupervised and supervised learning to predict stocks

Official code for 'Weakly-supervised Video Anomaly Detection with Robust Temporal Feature Magnitude Learning' [ICCV 2021]

Train robotic agents to learn pick and place with deep learning for vision-based manipulation in PyBullet.

An implementation of a discriminant function over a normal distribution to help classify datasets.

The source code for 'Noisy-Labeled NER with Confidence Estimation' accepted by NAACL 2021

Pseudo-Visual Speech Denoising

Realtime YOLO Monster Detection With Non Maximum Supression

Official Implement of CVPR 2021 paper “Cross-Modal Collaborative Representation Learning and a Large-Scale RGBT Benchmark for Crowd Counting”

A containerized REST API around OpenAI's CLIP model.

Informal Persian Universal Dependency Treebank

Official implementation of "SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers"

Code for the RA-L (ICRA) 2021 paper "SeqNet: Learning Descriptors for Sequence-Based Hierarchical Place Recognition"

Implementations of paper Controlling Directions Orthogonal to a Classifier

On Effective Scheduling of Model-based Reinforcement Learning

Neural style transfer in PyTorch.

This is an easy python software which allows to sort images with faces by gender and after by age.