Official Datasets and Implementation from our Paper "Video Class Agnostic Segmentation in Autonomous Driving".

Last update: Oct 24, 2022

Overview

Video Class Agnostic Segmentation

[Method Paper] [Benchmark Paper] [Project] [Demo]

Official Datasets and Implementation from our Paper "Video Class Agnostic Segmentation Benchmark in Autonomous Driving" in Workshop on Autonomous Driving, CVPR 2021.

Installation

This repo is tested under Python 3.6, PyTorch 1.4

Download Required Packages

pip install -r requirements.txt
pip install "git+https://github.com/cocodataset/panopticapi.git"

Setup mmdet

python setup.py develop

Motion Segmentation Track

Dataset Preparation

Follow Dataset Preparation Instructions.

Inference

Download Trained Weights on Ego Flow Suppressed, trained on Cityscapes and KITTI-MOTS
Modify Configs according to dataset path + Image/Annotation/Flow prefix

configs/data/kittimots_motion_supp.py
configs/data/cscapesvps_motion_supp.py

Evaluate CAQ,

python tools/test_eval_caq.py CONFIG_FILE WEIGHTS_FILE

CONFIG_FILE: configs/infer_kittimots.py or configs/infer_cscapesvps.py

Qualitative Results

python tools/test_vis.py CONFIG_FILE WEIGHTS_FILE --vis_unknown --save_dir OUTS_DIR

Evaluate Image Panoptic Quality, Note: evaluated on 1024x2048 Images

python tools/test_eval_ipq.py configs/infer_cscapesvps_pq.py WEIGHTS_FILE --out PKL_FILE

Training

Coming Soon ...

Open-set Segmentation Track

Coming soon ...

Acknowledgements

Dataset and Repository relied on these sources:

Voigtlaender, Paul, et al. "Mots: Multi-object tracking and segmentation." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2019.
Kim, Dahun, et al. "Video panoptic segmentation." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2020.
Wang, Xinlong, et al. "Solo: Segmenting objects by locations." European Conference on Computer Vision. Springer, Cham, 2020.
This Repository built upon SOLO Code

Citation

@article{siam2021video,
      title={Video Class Agnostic Segmentation Benchmark for Autonomous Driving}, 
      author={Mennatullah Siam and Alex Kendall and Martin Jagersand},
      year={2021},
      eprint={2103.11015},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Contact

If you have any questions regarding the dataset or repository, please contact [email protected].

Official Datasets and Implementation from our Paper "Video Class Agnostic Segmentation in Autonomous Driving".

Related tags

Overview

Video Class Agnostic Segmentation

Installation

Motion Segmentation Track

Dataset Preparation

Inference

Training

Open-set Segmentation Track

Acknowledgements

Citation

Contact

Owner

Mennatullah Siam

A CNN implementation using only numpy. Supports multidimensional images, stride, etc.

Source code for our paper "Do Not Trust Prediction Scores for Membership Inference Attacks"

This is an official implementation for "ResT: An Efficient Transformer for Visual Recognition".

On-device wake word detection powered by deep learning.

Code for ACL'2021 paper WARP 🌀 Word-level Adversarial ReProgramming

[RSS 2021] An End-to-End Differentiable Framework for Contact-Aware Robot Design

A Distributional Approach To Controlled Text Generation

PyTorch Implementation of "Light Field Image Super-Resolution with Transformers"

System Combination for Grammatical Error Correction Based on Integer Programming

🤗 Push your spaCy pipelines to the Hugging Face Hub

CTRL-C: Camera calibration TRansformer with Line-Classification

An open source bike computer based on Raspberry Pi Zero (W, WH) with GPS and ANT+. Including offline map and navigation.

Implementing SYNTHESIZER: Rethinking Self-Attention in Transformer Models using Pytorch

This project aims at providing a concise, easy-to-use, modifiable reference implementation for semantic segmentation models using PyTorch.

Dense Prediction Transformers

MPI Interest Group on Algorithms on 1st semester 2021

A robotic arm that mimics hand movement through MediaPipe tracking.

Lepard: Learning Partial point cloud matching in Rigid and Deformable scenes

Post-training Quantization for Neural Networks with Provable Guarantees

QueryDet: Cascaded Sparse Query for Accelerating High-Resolution SmallObject Detection