ST++: Make Self-training Work Better for Semi-supervised Semantic Segmentation

Last update: Jan 03, 2023

Overview

ST++

This is the official PyTorch implementation of our paper:

ST++: Make Self-training Work Better for Semi-supervised Semantic Segmentation.
Lihe Yang, Wei Zhuo, Lei Qi, Yinghuan Shi and Yang Gao.

Getting Started

Data Preparation

Pre-trained Model

ResNet-50 | ResNet-101 | DeepLabv2-ResNet-101

Dataset

Pascal | Augmented Masks | Cityscapes | Class Mapped Masks

File Organization

├── ./pretrained
    ├── resnet50.pth
    ├── resnet101.pth
    └── deeplabv2_resnet101_coco_pretrained.pth
    
├── [Your Pascal Path]
    ├── JPEGImages
    └── SegmentationClass    # replace the official folder with above augmented masks 
    
├── [Your Cityscapes Path]
    ├── gtFine               # replace the official folder with above class mapped masks 
    └── leftImg8bit

Training and Testing

export semi_setting='pascal/1_8/split_0'

CUDA_VISIBLE_DEVICES=0,1 python -W ignore main.py \
  --dataset pascal --data-root [Your Pascal Path] \
  --batch-size 16 --backbone resnet50 --model deeplabv3plus \
  --labeled-id-path dataset/splits/$semi_setting/labeled.txt \
  --unlabeled-id-path dataset/splits/$semi_setting/unlabeled.txt \
  --pseudo-mask-path outdir/pseudo_masks/$semi_setting \
  --save-path outdir/models/$semi_setting

This script is for our ST framework. To run ST++, add --plus --reliable-id-path outdir/reliable_ids/$semi_setting.

Acknowledgement

The DeepLabv2 MS COCO pre-trained model is borrowed and converted from AdvSemiSeg. The image partitions are borrowed from Context-Aware-Consistency and PseudoSeg. Part of the training hyper-parameters and network structures are adapted from PyTorch-Encoding. The strong data augmentations are borrowed from MoCo v2 and PseudoSeg.

AdvSemiSeg: https://github.com/hfslyc/AdvSemiSeg.
Context-Aware-Consistency: https://github.com/dvlab-research/Context-Aware-Consistency.
PseudoSeg: https://github.com/googleinterns/wss.
PyTorch-Encoding: https://github.com/zhanghang1989/PyTorch-Encoding.
MoCo: https://github.com/facebookresearch/moco.
OpenSelfSup: https://github.com/open-mmlab/OpenSelfSup.

Thanks a lot for their great works!

Citation

If you find this project useful, please consider citing:

@article{yang2021st++,
  title={ST++: Make Self-training Work Better for Semi-supervised Semantic Segmentation},
  author={Yang, Lihe and Zhuo, Wei and Qi, Lei and Shi, Yinghuan and Gao, Yang},
  journal={arXiv preprint arXiv:2106.05095},
  year={2021}
}

ST++: Make Self-training Work Better for Semi-supervised Semantic Segmentation

Related tags

Overview

ST++

Getting Started

Data Preparation

Pre-trained Model

Dataset

File Organization

Training and Testing

Acknowledgement

Citation

Owner

Lihe Yang

A Pytorch implementation of "Splitter: Learning Node Representations that Capture Multiple Social Contexts" (WWW 2019).

Code for "Modeling Indirect Illumination for Inverse Rendering", CVPR 2022

RCD: Relation Map Driven Cognitive Diagnosis for Intelligent Education Systems

Code for the paper A Theoretical Analysis of the Repetition Problem in Text Generation

Code Release for Learning to Adapt to Evolving Domains

This repository contains notebook implementations of the following Neural Process variants: Conditional Neural Processes (CNPs), Neural Processes (NPs), Attentive Neural Processes (ANPs).

PyTorch implementation of Tacotron speech synthesis model.

Official code for the ICLR 2021 paper Neural ODE Processes

Code and Resources for the Transformer Encoder Reasoning Network (TERN)

Makes patches from huge resolution .svs slide files using openslide

PyTorch implementation of Lip to Speech Synthesis with Visual Context Attentional GAN (NeurIPS2021)

Python Auto-ML Package for Tabular Datasets

code for Fast Point Cloud Registration with Optimal Transport

Codes to calculate solar-sensor zenith and azimuth angles directly from hyperspectral images collected by UAV. Works only for UAVs that have high resolution GNSS/IMU unit.

PyTorch implementations of Generative Adversarial Networks.

aka "Bayesian Methods for Hackers": An introduction to Bayesian methods + probabilistic programming with a computation/understanding-first, mathematics-second point of view. All in pure Python ;)

A real-time approach for mapping all human pixels of 2D RGB images to a 3D surface-based model of the body

Improving Calibration for Long-Tailed Recognition (CVPR2021)

This codebase is the official implementation of Test-Time Classifier Adjustment Module for Model-Agnostic Domain Generalization (NeurIPS2021, Spotlight)

🗣️ Microsoft Edge TTS for Home Assistant, no need for app_key