Knowledge Distillation Toolbox for Semantic Segmentation

Last update: Dec 12, 2022

Related tags

Overview

SegDistill: Toolbox for Knowledge Distillation on Semantic Segmentation Networks

This repo contains the supported code and configuration files for SegDistill .It is based on mmsegmentaion.

Installation

conda create -n mmcv python=3.8 -y
conda activate mmcv

pip install torch==1.7.1+cu110 torchvision==0.8.2+cu110 torchaudio==0.7.2 -f https://download.pytorch.org/whl/torch_stable.html

pip install mmcv-full==1.2.2 -f https://download.openmmlab.com/mmcv/dist/cu110/torch1.7.0/index.html

pip install future tensorboard
pip install IPython
pip install attr
pip install timm

git clone https://github.com/wzpscott/SegDistill.git -b main
cd SegDistill
pip install -e .

Prepare Data

We conducted experiments on ADE20k dataset. The training and validation set of ADE20K could be download from this link. Test set can be download from here. After downloading the dataset, you need to arrange the structure of your dataset like:

mmsegmentation
├── mmseg
├── tools
├── configs
├── data
│   ├── ade
│   │   ├── ADEChallengeData2016
│   │   │   ├── annotations
│   │   │   │   ├── training
│   │   │   │   ├── validation
│   │   │   ├── images
│   │   │   │   ├── training
│   │   │   │   ├── validation
│   ├── ...

See here for more instructions on data preparation.

Prepare Models

We provide links to pretrained weights of models used in the paper.

Model	Pretrained on ImageNet-1K	Trained on ADE20k
Segformer	link	link
Swin-Transformer	link	link
PSPNet	link	link

Write configs for semantic segmentaion KD

We use mmcv-fashion configs to control the KD process.

Run an example config with the following command:

 bash tools/dist_train.sh distillation_configs/example_config.py {num_gpu}

See here for detailed instructions for custom KD process on various network architectures.

Channel Group Distillation

Our Channel Group Distillation (CGD) considers a more extensive range of correlations inthe activation map and works well fortransformer structures than previous KD methods.

Comparison to Other KD methods

Results on ADE20k

Qualitative segmentation results on ADE20k produced from Segformer B0: (a) raw images, (b) ground truth (GT), (c) outputof the original student model (d) Channel-wise Distillation (CD) and (e) Channel Group Distillation(CGD)

Knowledge Distillation Toolbox for Semantic Segmentation

Related tags

Overview

SegDistill: Toolbox for Knowledge Distillation on Semantic Segmentation Networks

Installation

Prepare Data

Prepare Models

Write configs for semantic segmentaion KD

Channel Group Distillation

Owner

Deep motion generator collections

Project page for our ICCV 2021 paper "The Way to my Heart is through Contrastive Learning"

YOLO5Face: Why Reinventing a Face Detector (https://arxiv.org/abs/2105.12931)

This project aims to explore the deployment of Swin-Transformer based on TensorRT, including the test results of FP16 and INT8.

Implementation of Change-Based Exploration Transfer (C-BET)

Code for KDD'20 "Generative Pre-Training of Graph Neural Networks"

Semi Supervised Learning for Medical Image Segmentation, a collection of literature reviews and code implementations.

Segmentation models with pretrained backbones. PyTorch.

AI pipelines for Nvidia Jetson Platform

Code for LIGA-Stereo Detector, ICCV'21

Industrial Image Anomaly Localization Based on Gaussian Clustering of Pre-trained Feature

ACAV100M: Automatic Curation of Large-Scale Datasets for Audio-Visual Video Representation Learning. In ICCV, 2021.

Machine learning algorithms for many-body quantum systems

PyTorch implementation of popular datasets and models in remote sensing

[ICLR 2021] Heteroskedastic and Imbalanced Deep Learning with Adaptive Regularization

A curated list of awesome papers for Semantic Retrieval (TOIS Accepted: Semantic Models for the First-stage Retrieval: A Comprehensive Review).

Estimating Example Difficulty using Variance of Gradients

Orange Chicken: Data-driven Model Generalizability in Crosslinguistic Low-resource Morphological Segmentation

NeuralDiff: Segmenting 3D objects that move in egocentric videos

[CVPR 2022] Official code for the paper: "A Stitch in Time Saves Nine: A Train-Time Regularizing Loss for Improved Neural Network Calibration"