A Survey on Deep Learning Technique for Video Segmentation

A Survey on Deep Learning Technique for Video Segmentation
Wenguan Wang, Tianfei Zhou, Fatih Porikli, David Crandall, and Luc Van Gool.

Contributing

Please feel free to create issues or pull requests to add papers.

Welcome any discussions on video segmentation at

1. Introduction

Video segmentation, i.e., partitioning video frames into multiple segments or objects, plays a critical role in a broad range of practical applications, from enhancing visual effects in movie, to understanding scenes in autonomous driving, to virtual background creation in video conferencing. In this survey, we comprehensively review two basic lines of research — video object segmentation and video semantic segmentation — by introducing their respective task settings, background concepts, perceived need, development history, and main challenges. In particular, we review eight sub-fields as given in the following figure:

2. Deep Learning-based Video Object Segmentation

3. Deep Learning-based Video Semantic Segmentation

4. Datasets

Popular Datasets in VOS and VSS

Citation

If you find our survey and repository useful for your research, please consider citing our paper:

@article{wang2021survey,
  title={A survey on deep learning technique for video segmentation},
  author={Wang, Wenguan and Zhou, Tianfei and Porikli, Fatih and Crandall, David and Van Gool, Luc},
  journal={arXiv preprint arXiv:2107.01153},
  year={2021}
}

A Survey on Deep Learning Technique for Video Segmentation

Related tags

Overview

A Survey on Deep Learning Technique for Video Segmentation

Contributing

1. Introduction

2. Deep Learning-based Video Object Segmentation

3. Deep Learning-based Video Semantic Segmentation

4. Datasets

Citation

Owner

Tianfei Zhou

iNAS: Integral NAS for Device-Aware Salient Object Detection

Pytorch implementation of Straight Sampling Network For Point Cloud Learning (ICIP2021).

A Tensorflow implementation of CapsNet based on Geoffrey Hinton's paper Dynamic Routing Between Capsules

Computationally efficient algorithm that identifies boundary points of a point cloud.

A spatial genome aligner for analyzing multiplexed DNA-FISH imaging data.

Official implementation of "Learning Proposals for Practical Energy-Based Regression", 2021.

Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"

Scheme for training and applying a label propagation framework

Towards Fine-Grained Reasoning for Fake News Detection

A python interface for training Reinforcement Learning bots to battle on pokemon showdown

Progressive Image Deraining Networks: A Better and Simpler Baseline

This package implements THOR: Transformer with Stochastic Experts.

ML course - EPFL Machine Learning Course, Fall 2021

ICON: Implicit Clothed humans Obtained from Normals (CVPR 2022)

bespoke tooling for offensive security's Windows Usermode Exploit Dev course (OSED)

Personal project about genus-0 meshes, spherical harmonics and a cow

EMNLP 2021 paper The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers.

[TIP 2020] Multi-Temporal Scene Classification and Scene Change Detection with Correlation based Fusion

This is the code of NeurIPS'21 paper "Towards Enabling Meta-Learning from Target Models".

Transfer-Learn is an open-source and well-documented library for Transfer Learning.