Exploring Cross-Image Pixel Contrast for Semantic Segmentation

Last update: Jan 02, 2023

Overview

Exploring Cross-Image Pixel Contrast for Semantic Segmentation

Exploring Cross-Image Pixel Contrast for Semantic Segmentation,
Wenguan Wang, Tianfei Zhou, Fisher Yu, Jifeng Dai, Ender Konukoglu and Luc Van Gool
arXiv technical report (arXiv 2101.11939)

Abstract

Current semantic segmentation methods focus only on mining “local” context, i.e., dependencies between pixels within individual images, by context-aggregation modules (e.g., dilated convolution, neural attention) or structureaware optimization criteria (e.g., IoU-like loss). However, they ignore “global” context of the training data, i.e., rich semantic relations between pixels across different images. Inspired by the recent advance in unsupervised contrastive representation learning, we propose a pixel-wise contrastive framework for semantic segmentation in the fully supervised setting. The core idea is to enforce pixel embeddings belonging to a same semantic class to be more similar than embeddings from different classes. It raises a pixel-wise metric learning paradigm for semantic segmentation, by explicitly exploring the structures of labeled pixels, which are long ignored in the field. Our method can be effortlessly incorporated into existing segmentation frameworks without extra overhead during testing.

We experimentally show that, with famous segmentation models (i.e., DeepLabV3, HRNet, OCR) and backbones (i.e., ResNet, HRNet), our method brings consistent performance improvements across diverse datasets (i.e., Cityscapes, PASCALContext, COCO-Stuff).

Installation

This implementation is built on openseg.pytorch. Many thanks to the authors for the efforts.

Please follow the Getting Started for installation and dataset preparation.

Running

Cityscapes

Train DeepLabV3

bash scripts/cityscapes/deeplab/run_r_101_d_8_deeplabv3_train_contrast.sh train 'resnet101-deeplabv3-contrast'

Features (in progress)

t-SNE Visualization

Pixel-wise Cross-Entropy Loss

Pixel-wise Contrastive Learning Objective

Citation

@article{wang2021exploring,
  title   = {Exploring Cross-Image Pixel Contrast for Semantic Segmentation},
  author  = {Wang, Wenguan and Zhou, Tianfei and Yu, Fisher and Dai, Jifeng and Konukoglu, Ender and Van Gool, Luc},
  journal = {arXiv preprint arXiv:2101.11939},
  year    = {2021}
}

Exploring Cross-Image Pixel Contrast for Semantic Segmentation

Related tags

Overview

Exploring Cross-Image Pixel Contrast for Semantic Segmentation

Abstract

Installation

Running

Cityscapes

Features (in progress)

t-SNE Visualization

Citation

Owner

Tianfei Zhou

Tutorials and implementations for "Self-normalizing networks"

Semi-supervised Implicit Scene Completion from Sparse LiDAR

Style transfer, deep learning, feature transform

Code for ICLR2018 paper: Improving GAN Training via Binarized Representation Entropy (BRE) Regularization - Y. Cao · W Ding · Y.C. Lui · R. Huang

Pytorch implementation of MalConv

Understanding Convolutional Neural Networks from Theoretical Perspective via Volterra Convolution

A PyTorch Toolbox for Face Recognition

Using Streamlit to host a multi-page tool with model specs and classification metrics, while also accepting user input values for prediction.

Distributionally robust neural networks for group shifts

A computational block to solve entity alignment over textual attributes in a knowledge graph creation pipeline.

DeepLab2: A TensorFlow Library for Deep Labeling

My solution for the 7th place / 245 in the Umoja Hack 2022 challenge

Mining-the-Social-Web-3rd-Edition - The official online compendium for Mining the Social Web, 3rd Edition (O'Reilly, 2018)

Project dự đoán giá cổ phiếu bằng thuật toán LSTM gồm: code train và code demo

PRIME: A Few Primitives Can Boost Robustness to Common Corruptions

Run Keras models in the browser, with GPU support using WebGL

Exploration of some patients clinical variables.

An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)

An implementation of Deep Forest 2021.2.1.

Instance Segmentation in 3D Scenes using Semantic Superpoint Tree Networks