Robust Instance Segmentation through Reasoning about Multi-Object Occlusion [CVPR 2021]

Last update: Jun 27, 2022

Related tags

Overview

Robust Instance Segmentation through Reasoning about Multi-Object Occlusion [CVPR 2021]

Abstract

Analyzing complex scenes with DNN is a challenging task, particularly when images contain multiple objects that partially occlude each other. Existing approaches to image analysis mostly process objects independently and do not take into account the relative occlusion of nearby objects. We propose a deep network for multi-object instance segmentation that is robust to occlusion and can be trained from bounding box supervision only.

We also introduce an Occlusion Challenge dataset generated from real-world segmented objects with accurate annotations and propose a taxonomy of occlusion scenarios that pose a particular challenge for computer vision.

NOTICE

dataset links and model will be released in a few days. Update: 18 June

Requirments

The code uses Python 3.6 and it is tested on PyTorch GPU version 1.2, with CUDA-10.0 and cuDNN-7.5.

Installation

Clone the repository with:

git clone https://github.com/XD7479/Multi-Object-Occlusion.git
cd Multi-Object-Occlusion

Install requirments:

pip install -r requirements.txt

Datasets

Download the KINS dataset here and the Occlusion Challenge dataset here.
Enter the project folder and make links for the datasets:

ln -s  kins
ln -s  occ_challenge

Download the pre-trained model here.
Make links for the pre-trained model:

ln -s  models

Check the configuration file configs.py for the dataset and backbone you're using:

dataset_eval = 'occ_challenge'      # kins, occ_challenge
nn_type = 'resnext'             # vgg, resnext

Run the evaluation code with:

python3 eval_meanIoU.py

Segmentation Demo

Citation

@misc{yuan2021robust,
      title={Robust Instance Segmentation through Reasoning about Multi-Object Occlusion}, 
      author={Xiaoding Yuan and Adam Kortylewski and Yihong Sun and Alan Yuille},
      booktitle = {Proceedings IEEE Conf. on Computer Vision and Pattern Recognition (CVPR)},
      month = jun,
      year = {2021},
      month_numeric = {6}
}

Contact

If you have any questions you can contact Xiaoding Yuan by [email protected].

Robust Instance Segmentation through Reasoning about Multi-Object Occlusion [CVPR 2021]

Related tags

Overview

Robust Instance Segmentation through Reasoning about Multi-Object Occlusion [CVPR 2021]

Abstract

NOTICE

Requirments

Installation

Datasets

Segmentation Demo

Citation

Contact

Owner

Irene Yuan

Reducing Information Bottleneck for Weakly Supervised Semantic Segmentation (NeurIPS 2021)

🍅🍅🍅YOLOv5-Lite: lighter, faster and easier to deploy. Evolved from yolov5 and the size of model is only 1.7M (int8) and 3.3M (fp16). It can reach 10+ FPS on the Raspberry Pi 4B when the input size is 320×320~

Final project for Intro to CS class.

Artstation-Artistic-face-HQ Dataset (AAHQ)

Lucid Sonic Dreams syncs GAN-generated visuals to music.

Allows including an action inside another action (by preprocessing the Yaml file). This is how composite actions should have worked.

Code for Generating Disentangled Arguments with Prompts: A Simple Event Extraction Framework that Works

A 1.3B text-to-image generation model trained on 14 million image-text pairs

Deep Residual Networks with 1K Layers

ContourletNet: A Generalized Rain Removal Architecture Using Multi-Direction Hierarchical Representation

Coarse implement of the paper "A Simultaneous Denoising and Dereverberation Framework with Target Decoupling", On DNS-2020 dataset, the DNSMOS of first stage is 3.42 and second stage is 3.47.

🍷 Gracefully claim weekly free games and monthly content from Epic Store.

This repository contains the exercises and its solution contained in the book "An Introduction to Statistical Learning" in python.

CT-Net: Channel Tensorization Network for Video Classification

Intent parsing and slot filling in PyTorch with seq2seq + attention

The missing CMake project initializer

Deep Learning and Reinforcement Learning Library for Scientists and Engineers 🔥

Flexible time series feature extraction & processing

Jingju baseline - A baseline model of our project of Beijing opera script generation

An AI made using artificial intelligence (AI) and machine learning algorithms (ML) .