Instance Segmentation by Jointly Optimizing Spatial Embeddings and Clustering Bandwidth

Last update: Dec 07, 2022

Related tags

Overview

Instance segmentation by jointly optimizing spatial embeddings and clustering bandwidth

This codebase implements the loss function described in:

Instance Segmentation by Jointly Optimizing Spatial Embeddings and Clustering Bandwidth Davy Neven, Bert De Brabandere, Marc Proesmans, and Luc Van Gool Conference on Computer Vision and Pattern Recognition (CVPR), june 2019

Our network architecture is a multi-branched version of ERFNet and uses the Lovasz-hinge loss for maximizing the IoU of each instance.

License

This software is released under a creative commons license which allows for personal and research use only. For a commercial license please contact the authors. You can view a license summary here.

Getting started

This codebase showcases the proposed loss function on car instance segmentation using the Cityscapes dataset.

Prerequisites

Dependencies:

Pytorch 1.1
Python 3.6.8 (or higher)
Cityscapes + scripts (if you want to evaluate the model)

Training

Training consists out of 2 steps. We first train on 512x512 crops around each object, to avoid computation on background patches. Afterwards, we finetune on larger patches (1024x1024) to account for bigger objects and background features which are not present in the smaller crops.

To generate these crops do the following:

$ CITYSCAPES_DIR=/path/to/cityscapes/ python utils/generate_crops.py

Afterwards start training:

$ CITYSCAPES_DIR=/path/to/cityscapes/ python train.py

Different options can be modified in train_config.py, e.g. to visualize set display=True.

Testing

You can download a pretrained model here. Save this file in the src/pretrained_models/ or adapt the test_config.py file.

To test the model on the Cityscapes validation set run:

$ CITYSCAPES_DIR=/path/to/cityscapes/ python test.py

The pretrained model gets 56.4 AP on the car validation set.

Acknowledgement

This work was supported by Toyota, and was carried out at the TRACE Lab at KU Leuven (Toyota Research on Automated Cars in Europe - Leuven)

Instance Segmentation by Jointly Optimizing Spatial Embeddings and Clustering Bandwidth

Related tags

Overview

Instance segmentation by jointly optimizing spatial embeddings and clustering bandwidth

License

Getting started

Prerequisites

Training

Testing

Acknowledgement

Owner

Graph WaveNet apdapted for brain connectivity analysis.

Lunar is a neural network aimbot that uses real-time object detection accelerated with CUDA on Nvidia GPUs.

The Incredible PyTorch: a curated list of tutorials, papers, projects, communities and more relating to PyTorch.

Brain tumor detection using CNN (InceptionResNetV2 Model)

HGCAE Pytorch implementation. CVPR2021 accepted.

ViViT: Curvature access through the generalized Gauss-Newton's low-rank structure

Self-supervised Label Augmentation via Input Transformations (ICML 2020)

Deep motion transfer

TART - A PyTorch implementation for Transition Matrix Representation of Trees with Transposed Convolutions

Universal Adversarial Triggers for Attacking and Analyzing NLP (EMNLP 2019)

Deploying PyTorch Model to Production with FastAPI in CUDA-supported Docker

The codes I made while I practiced various TensorFlow examples

[CVPR 2021] Unsupervised Degradation Representation Learning for Blind Super-Resolution

my graduation project is about live human face augmentation by projection mapping by using CNN

Official PyTorch implementation of PICCOLO: Point-Cloud Centric Omnidirectional Localization (ICCV 2021)

CLIP + VQGAN / PixelDraw

The dataset and source code for our paper: "Did You Ask a Good Question? A Cross-Domain Question IntentionClassification Benchmark for Text-to-SQL"

An efficient and effective learning to rank algorithm by mining information across ranking candidates. This repository contains the tensorflow implementation of SERank model. The code is developed based on TF-Ranking.

A deep-learning pipeline for segmentation of ambiguous microscopic images.

Script that attempts to force M1 macs into RGB mode when used with monitors that are defaulting to YPbPr.