A novel benchmark dataset for Monocular Layout prediction

Last update: Apr 26, 2022

Related tags

Deep Learning AutoLay

Overview

AutoLay

AutoLay: Benchmarking Monocular Layout Estimation

Kaustubh Mani, N. Sai Shankar, J. Krishna Murthy, and K. Madhava Krishna

Abstract

In this paper, we tackle the problem of estimating the layout of a scene in bird’s eye view from monocular imagery. Specifically, we target amodal layout estimation, i.e., we estimate semantic labels for parts of the scene that do not even project to the visible regime of the image. While prior approaches to amodal layout estimation focused on coarse attributes of a scene(roads, sidewalks), we shift our attention to generate amodal estimation for fine-grained atrributes such as lanes, crosswalks, vehicles, etc. To this end, we introduce AutoLay, a new dataset for amodal layout estimation in bird’s eye view. AutoLay includes precise annotations for (amodal) layouts for 32 sequences from the KITTI dataset. In addition to fine-grained attributes such as lanes, sidewalks, and vehicles, we also provide detailed semantic annotations for 3D pointclouds. To foster reproducibility and further research in this nascent area, we open-source implementations for several baselines and current art. Further, we propose VideoLayout, a real-time neural net architecture that leverages temporal information from monocular video, to produce more accurate and consistent layouts. VideoLayout achieves state-of-the-art performance on AutoLay, while running in real-time (18 fps).

Dataset

We use 32 video sequences from the KITTI Raw dataset in AutoLay. We provide per-frame annotations in perspective, orthographic (bird’s eye view), as well as in 3D. Of the 32 annotated sequences, 17 sequences-containing 7414 images—are used for training. The other 15 sequences—comprising 4438 images—form the test set. This makes for nearly 12K annotated images, across a distance of 9.5 Km, and a variety of urban scenarios (residential, urban, road). The semantic classes considered in this dataset are road, sidewalk, vehicle, crosswalk, and lane. Each lane segment is provided a unique id, which we classify further. The lane class is further classified as ego-lane and other lane. We also have an other road class for road areas that do not fall under any of the above categories.

Sample dataset can be downloaded from here.

Benchmark

We provide a comprehensive benchmark of all the state-of-the-art methods for layout estimation on Autolay.

A novel benchmark dataset for Monocular Layout prediction

Related tags

Overview

AutoLay

AutoLay: Benchmarking Monocular Layout Estimation

Kaustubh Mani, N. Sai Shankar, J. Krishna Murthy, and K. Madhava Krishna

Abstract

Dataset

Benchmark

Results

Road Layout Estimation

Vehicle Layout Estimation

Lane Layout Estimation

Owner

Kaustubh Mani

darija <-> english dictionary

Software & Hardware to do multi color printing with Sharpies

A curated list of automated deep learning (including neural architecture search and hyper-parameter optimization) resources.

Implementation for Learning to Track with Object Permanence

Research on controller area network Intrusion Detection Systems

On Evaluation Metrics for Graph Generative Models

Pseudo-Visual Speech Denoising

Mining-the-Social-Web-3rd-Edition - The official online compendium for Mining the Social Web, 3rd Edition (O'Reilly, 2018)

Deep learning for spiking neural networks

MQBench: Towards Reproducible and Deployable Model Quantization Benchmark

Codebase for the self-supervised goal reaching benchmark introduced in the LEXA paper

A script helps the user to update Linux and Mac systems through the terminal

Simulation environments for the CrazyFlie quadrotor: Used for Reinforcement Learning and Sim-to-Real Transfer

CNN visualization tool in TensorFlow

Anomaly detection in multi-agent trajectories: Code for training, evaluation and the OpenAI highway simulation.

This is the pytorch implementation for the paper: Learning Accurate Performance Predictors for Ultrafast Automated Model Compression, which is in submission to TPAMI

A certifiable defense against adversarial examples by training neural networks to be provably robust

Official PyTorch implementation of UACANet: Uncertainty Aware Context Attention for Polyp Segmentation

API for RL algorithm design & testing of BCA (Building Control Agent) HVAC on EnergyPlus building energy simulator by wrapping their EMS Python API

A New Open-Source Off-road Environment for Benchmark Generalization of Autonomous Driving

A novel benchmark dataset for Monocular Layout prediction

Related tags

Overview

AutoLay

AutoLay: Benchmarking Monocular Layout Estimation

Kaustubh Mani, N. Sai Shankar, J. Krishna Murthy, and K. Madhava Krishna

Abstract

Dataset

Benchmark

Results

Road Layout Estimation

Vehicle Layout Estimation

Lane Layout Estimation

Owner

Kaustubh Mani

darija <-> english dictionary

Software & Hardware to do multi color printing with Sharpies

A curated list of automated deep learning (including neural architecture search and hyper-parameter optimization) resources.

Implementation for Learning to Track with Object Permanence

Research on controller area network Intrusion Detection Systems

On Evaluation Metrics for Graph Generative Models

Pseudo-Visual Speech Denoising

Mining-the-Social-Web-3rd-Edition - The official online compendium for Mining the Social Web, 3rd Edition (O'Reilly, 2018)

Deep learning for spiking neural networks

MQBench: Towards Reproducible and Deployable Model Quantization Benchmark

Codebase for the self-supervised goal reaching benchmark introduced in the LEXA paper

A script helps the user to update Linux and Mac systems through the terminal

Simulation environments for the CrazyFlie quadrotor: Used for Reinforcement Learning and Sim-to-Real Transfer

CNN visualization tool in TensorFlow

Anomaly detection in multi-agent trajectories: Code for training, evaluation and the OpenAI highway simulation.

This is the pytorch implementation for the paper: *Learning Accurate Performance Predictors for Ultrafast Automated Model Compression*, which is in submission to TPAMI

A certifiable defense against adversarial examples by training neural networks to be provably robust

Official PyTorch implementation of UACANet: Uncertainty Aware Context Attention for Polyp Segmentation

API for RL algorithm design & testing of BCA (Building Control Agent) HVAC on EnergyPlus building energy simulator by wrapping their EMS Python API

A New Open-Source Off-road Environment for Benchmark Generalization of Autonomous Driving

This is the pytorch implementation for the paper: Learning Accurate Performance Predictors for Ultrafast Automated Model Compression, which is in submission to TPAMI