Per-Pixel Classification is Not All You Need for Semantic Segmentation

Last update: Jan 08, 2023

Related tags

Deep Learning MaskFormer

Overview

MaskFormer: Per-Pixel Classification is Not All You Need for Semantic Segmentation

Bowen Cheng, Alexander G. Schwing, Alexander Kirillov

[arXiv] [Project] [BibTeX]

Features

Better results while being more efficient.
Unified view of semantic- and instance-level segmentation tasks.
Support major semantic segmentation datasets: ADE20K, Cityscapes, COCO-Stuff, Mapillary Vistas.
Support ALL Detectron2 models.

Installation

See installation instructions.

Getting Started

See Preparing Datasets for MaskFormer.

See Getting Started with MaskFormer.

Model Zoo and Baselines

We provide a large set of baseline results and trained models available for download in the MaskFormer Model Zoo.

License

Shield:

The majority of MaskFormer is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

However portions of the project are available under separate license terms: Swin-Transformer-Semantic-Segmentation is licensed under the MIT license.

Citing MaskFormer

If you use MaskFormer in your research or wish to refer to the baseline results published in the Model Zoo, please use the following BibTeX entry.

@article{cheng2021maskformer,
  title={Per-Pixel Classification is Not All You Need for Semantic Segmentation},
  author={Bowen Cheng and Alexander G. Schwing and Alexander Kirillov},
  journal={arXiv},
  year={2021}
}

Per-Pixel Classification is Not All You Need for Semantic Segmentation

Related tags

Overview

MaskFormer: Per-Pixel Classification is Not All You Need for Semantic Segmentation

Features

Installation

Getting Started

Model Zoo and Baselines

License

Citing MaskFormer

Owner

Facebook Research

Hand Gesture Volume Control | Open CV | Computer Vision

A PyTorch implementation of "Multi-Scale Contrastive Siamese Networks for Self-Supervised Graph Representation Learning", IJCAI-21

Trading Gym is an open source project for the development of reinforcement learning algorithms in the context of trading.

Co-mining: Self-Supervised Learning for Sparsely Annotated Object Detection, AAAI 2021.

RoboDesk A Multi-Task Reinforcement Learning Benchmark

Torch-mutable-modules - Use in-place and assignment operations on PyTorch module parameters with support for autograd

BRNet - code for Automated assessment of BI-RADS categories for ultrasound images using multi-scale neural networks with an order-constrained loss function

ICLR2021 (Under Review)

OrienMask: Real-time Instance Segmentation with Discriminative Orientation Maps

Numbering permanent and deciduous teeth via deep instance segmentation in panoramic X-rays

Pretraining Representations For Data-Efficient Reinforcement Learning

A Broad Study on the Transferability of Visual Representations with Contrastive Learning

Embeddinghub is a database built for machine learning embeddings.

Code for visualizing the loss landscape of neural nets

This repository contains the reference implementation for our proposed Convolutional CRFs.

Code for Transformer Hawkes Process, ICML 2020.

Cosine Annealing With Warmup

A very short and easy implementation of Quantile Regression DQN

Code for the paper Hybrid Spectrogram and Waveform Source Separation