PyTorch implementation of PSPNet

Last update: Nov 16, 2022

Overview

PSPNet with PyTorch

Unofficial implementation of "Pyramid Scene Parsing Network" (https://arxiv.org/abs/1612.01105). This repository is just for caffe to pytorch model conversion and evaluation.

Requirements

pytorch
click
addict
pydensecrf
protobuf

Preparation

Instead of building the author's caffe implementation, you can convert off-the-shelf caffemodels to pytorch models via the caffe.proto.

1. Compile the `caffe.proto` for Python API

This step can be skipped. FYI.
Download the author's caffe.proto into the libs, not the one in the original caffe.

# For protoc command
pip install protobuf
# This generates ./caffe_pb2.py
protoc --python_out=. caffe.proto

2. Model conversion

Find the caffemodels on the author's page (e.g. pspnet50_ADE20K.caffemodel) and store them to the data/models/ directory.
Convert the caffemodels to .pth file.

python convert.py -c <PATH TO YAML>

Demo

python demo.py -c <PATH TO YAML> -i <PATH TO IMAGE>

With a --no-cuda option, this runs on CPU.
With a --crf option, you can perform a CRF postprocessing.

Evaluation

PASCAL VOC2012 only. Please set the dataset path in config/voc12.yaml.

python eval.py -c config/voc12.yaml

88.1% mIoU (SS) and 88.6% mIoU (MS) on validation set.
NOTE: 3 points lower than caffe implementation. WIP

SS: averaged prediction with flipping (2x)
MS: averaged prediction with multi-scaling (6x) and flipping (2x)
Both: No CRF post-processing

References

Official implementation: https://github.com/hszhao/PSPNet
Chainer implementation: https://github.com/mitmul/chainer-pspnet

PyTorch implementation of PSPNet

Related tags

Overview

PSPNet with PyTorch

Requirements

Preparation

1. Compile the `caffe.proto` for Python API

2. Model conversion

Demo

Evaluation

References

Owner

Kazuto Nakashima

implicit displacement field

EEGEyeNet is benchmark to evaluate ET prediction based on EEG measurements with an increasing level of difficulty

Implement object segmentation on images using HOG algorithm proposed in CVPR 2005

Node Dependent Local Smoothing for Scalable Graph Learning

Dungeons and Dragons randomized content generator

Official PyTorch implementation of Data-free Knowledge Distillation for Object Detection, WACV 2021.

Canonical Capsules: Unsupervised Capsules in Canonical Pose (NeurIPS 2021)

Hippocampal segmentation using the UNet network for each axis

Pytorch implementation of set transformer

The toolkit to generate auto labeled datasets

Continual Learning of Long Topic Sequences in Neural Information Retrieval

Audio Source Separation is the process of separating a mixture into isolated sounds from individual sources

Towards the D-Optimal Online Experiment Design for Recommender Selection (KDD 2021)

PyTorch implementation for Stochastic Fine-grained Labeling of Multi-state Sign Glosses for Continuous Sign Language Recognition.

This is an official pytorch implementation of Fast Fourier Convolution.

Learning to Estimate Hidden Motions with Global Motion Aggregation

Pytorch implementation of our method for high-resolution (e.g. 2048x1024) photorealistic video-to-video translation.

MassiveSumm: a very large-scale, very multilingual, news summarisation dataset

Minimal fastai code needed for working with pytorch

Pytorch implementation AttnGAN: Fine-Grained Text to Image Generation with Attentional Generative Adversarial Networks

PyTorch implementation of PSPNet

Related tags

Overview

PSPNet with PyTorch

Requirements

Preparation

1. Compile the caffe.proto for Python API

2. Model conversion

Demo

Evaluation

References

Owner

Kazuto Nakashima

implicit displacement field

EEGEyeNet is benchmark to evaluate ET prediction based on EEG measurements with an increasing level of difficulty

Implement object segmentation on images using HOG algorithm proposed in CVPR 2005

Node Dependent Local Smoothing for Scalable Graph Learning

Dungeons and Dragons randomized content generator

Official PyTorch implementation of Data-free Knowledge Distillation for Object Detection, WACV 2021.

Canonical Capsules: Unsupervised Capsules in Canonical Pose (NeurIPS 2021)

Hippocampal segmentation using the UNet network for each axis

Pytorch implementation of set transformer

The toolkit to generate auto labeled datasets

Continual Learning of Long Topic Sequences in Neural Information Retrieval

Audio Source Separation is the process of separating a mixture into isolated sounds from individual sources

Towards the D-Optimal Online Experiment Design for Recommender Selection (KDD 2021)

PyTorch implementation for Stochastic Fine-grained Labeling of Multi-state Sign Glosses for Continuous Sign Language Recognition.

This is an official pytorch implementation of Fast Fourier Convolution.

Learning to Estimate Hidden Motions with Global Motion Aggregation

Pytorch implementation of our method for high-resolution (e.g. 2048x1024) photorealistic video-to-video translation.

MassiveSumm: a very large-scale, very multilingual, news summarisation dataset

Minimal fastai code needed for working with pytorch

Pytorch implementation AttnGAN: Fine-Grained Text to Image Generation with Attentional Generative Adversarial Networks

1. Compile the `caffe.proto` for Python API