Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"

Last update: Jan 02, 2023

Related tags

Deep Learning Mask2Former

Overview

Mask2Former: Masked-attention Mask Transformer for Universal Image Segmentation

Bowen Cheng, Ishan Misra, Alexander G. Schwing, Alexander Kirillov, Rohit Girdhar

[arXiv] [Project] [BibTeX]

Features

A single architecture for panoptic, instance and semantic segmentation.
Support major segmentation datasets: ADE20K, Cityscapes, COCO, Mapillary Vistas.

Installation

See installation instructions.

Getting Started

See Preparing Datasets for Mask2Former.

See Getting Started with Mask2Former.

Advanced usage

See Advanced Usage of Mask2Former.

Model Zoo and Baselines

We provide a large set of baseline results and trained models available for download in the Mask2Former Model Zoo.

License

Shield:

The majority of Mask2Former is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

However portions of the project are available under separate license terms: Swin-Transformer-Semantic-Segmentation is licensed under the MIT license, Deformable-DETR is licensed under the Apache-2.0 License.

Citing Mask2Former

If you use Mask2Former in your research or wish to refer to the baseline results published in the Model Zoo, please use the following BibTeX entry.

@article{cheng2021mask2former,
  title={Masked-attention Mask Transformer for Universal Image Segmentation},
  author={Bowen Cheng and Ishan Misra and Alexander G. Schwing and Alexander Kirillov and Rohit Girdhar},
  journal={arXiv},
  year={2021}
}

If you find the code useful, please also consider the following BibTeX entry.

@inproceedings{cheng2021maskformer,
  title={Per-Pixel Classification is Not All You Need for Semantic Segmentation},
  author={Bowen Cheng and Alexander G. Schwing and Alexander Kirillov},
  journal={NeurIPS},
  year={2021}
}

Acknowledgement

Code is largely based on MaskFormer (https://github.com/facebookresearch/MaskFormer).

Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"

Related tags

Overview

Mask2Former: Masked-attention Mask Transformer for Universal Image Segmentation

Features

Installation

Getting Started

Advanced usage

Model Zoo and Baselines

License

Citing Mask2Former

Acknowledgement

Owner

Meta Research

Scikit-learn compatible estimation of general graphical models

Deep learning models for classification of 15 common weeds in the southern U.S. cotton production systems.

A modular, open and non-proprietary toolkit for core robotic functionalities by harnessing deep learning

This is the pytorch re-implementation of the IterNorm

Neural Scene Flow Prior (NeurIPS 2021 spotlight)

Band-Adaptive Spectral-Spatial Feature Learning Neural Network for Hyperspectral Image Classification

Source code for From Stars to Subgraphs

SSPNet: Scale Selection Pyramid Network for Tiny Person Detection from UAV Images.

Technical Indicators implemented in Python only using Numpy-Pandas as Magic - Very Very Fast! Very tiny! Stock Market Financial Technical Analysis Python library . Quant Trading automation or cryptocoin exchange

Image-Stitching - Panorama composition using SIFT Features and a custom implementaion of RANSAC algorithm

Finding Biological Plausibility for Adversarially Robust Features via Metameric Tasks

Learning the Beauty in Songs: Neural Singing Voice Beautifier; ACL 2022 (Main conference); Official code

RATE: Overcoming Noise and Sparsity of Textual Features in Real-Time Location Estimation (CIKM'17)

TensorFlow CNN for fast style transfer

Supplementary code for TISMIR paper "Sliding-Window Pitch-Class Histograms as a Means of Modeling Musical Form"

An unofficial PyTorch implementation of a federated learning algorithm, FedAvg.

This repository contains the files for running the Patchify GUI.

Efficient and intelligent interactive segmentation annotation software

hySLAM is a hybrid SLAM/SfM system designed for mapping

User-friendly bulk RNAseq deconvolution using simulated annealing