Region-aware Contrastive Learning for Semantic Segmentation, ICCV 2021

Abstract

Recent works have made great success in semantic segmentation by exploiting contextual information in a local or global manner within individual image and supervising the model with pixel-wise cross entropy loss. However, from the holistic view of the whole dataset, semantic relations not only exist inside one single image, but also prevail in the whole training data, which makes solely considering intra-image correlations insufficient. Inspired by recent progress in unsupervised contrastive learning, we propose the region-aware contrastive learning (RegionContrast) for semantic segmentation in the supervised manner. In order to enhance the similarity of semantically similar pixels while keeping the discrimination from others, we employ contrastive learning to realize this objective. With the help of memory bank, we explore to store all the representative features into the memory. Without loss of generality, to efficiently incorporate all training data into the memory bank while avoiding taking too much computation resource, we propose to construct region centers to represent features from different categories for every image. Hence, the proposed region-aware contrastive learning is performed in a region level for all the training data, which saves much more memory than methods exploring the pixel-level relations. The proposed RegionContrast brings little computation cost during training and requires no extra overhead for testing. Extensive experiments demonstrate that our method achieves state-of-the-art performance on three benchmark datasets including Cityscapes, ADE20K and COCO Stuff. For more details, please refer to our ICCV paper (paper).

Installation

Check INSTALL.md for installation instructions.

Training and Evaluation

cd experiments/v3_contrast
bash train.sh

Citation

@InProceedings{Hu_2021_ICCV,
    author    = {Hu, Hanzhe and Cui, Jinshi and Wang, Liwei},
    title     = {Region-Aware Contrastive Learning for Semantic Segmentation},
    booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
    month     = {October},
    year      = {2021},
    pages     = {16291-16301}
}

TODO

Dynamic Sampling

Region-aware Contrastive Learning for Semantic Segmentation, ICCV 2021

Related tags

Overview

Region-aware Contrastive Learning for Semantic Segmentation, ICCV 2021

Abstract

Installation

Training and Evaluation

Citation

TODO

Owner

Hanzhe Hu

Code for Boundary-Aware Segmentation Network for Mobile and Web Applications

Conditional Generative Adversarial Networks (CGAN) for Mobility Data Fusion

ConvMixer unofficial implementation

Winning Solution in NTIRE19 Challenges on Video Restoration and Enhancement (CVPR19 Workshops) - Video Restoration with Enhanced Deformable Convolutional Networks. EDVR has been merged into BasicSR and this repo is a mirror of BasicSR.

An alarm clock coded in Python 3 with Tkinter

Disentangled Lifespan Face Synthesis

Augmented CLIP - Training simple models to predict CLIP image embeddings from text embeddings, and vice versa.

Code for Multiple Instance Active Learning for Object Detection, CVPR 2021

[ICCV'21] Neural Radiance Flow for 4D View Synthesis and Video Processing

PyTorch implementations of deep reinforcement learning algorithms and environments

The official homepage of the (outdated) COCO-Stuff 10K dataset.

Unofficial PyTorch Implementation of Multi-Singer

Generate vibrant and detailed images using only text.

DvD-TD3: Diversity via Determinants for TD3 version

A hue shift helper for OBS

Robbing the FED: Directly Obtaining Private Data in Federated Learning with Modified Models

Kaggle G2Net Gravitational Wave Detection : 2nd place solution

PyTorch implementation for the paper Visual Representation Learning with Self-Supervised Attention for Low-Label High-Data Regime

External Attention Network

Python tools for 3D face: 3DMM, Mesh processing(transform, camera, light, render), 3D face representations.