TensorFlow implementation of original paper : https://github.com/hszhao/PSPNet

Last update: Dec 29, 2022

Overview

Keras implementation of PSPNet(caffe)

Implemented Architecture of Pyramid Scene Parsing Network in Keras.

For the best compability please use Python3.5

Setup

Install dependencies:
- Tensorflow (-gpu)
- Keras
- numpy
- scipy
- pycaffe(PSPNet)(optional for converting the weights)
```
pip install -r requirements.txt --upgrade
```
Converted trained weights are needed to run the network. Weights(in .h5 .json format) have to be downloaded and placed into directory weights/keras

Already converted weights can be downloaded here:

Convert weights by yourself(optional)

(Note: this is not required if you use .h5/.json weights)

Running this needs the compiled original PSPNet caffe code and pycaffe.

python weight_converter.py <path to .prototxt> <path to .caffemodel>

Usage:

python pspnet.py -m <model> -i <input_image>  -o <output_path>
python pspnet.py -m pspnet101_cityscapes -i example_images/cityscapes.png -o example_results/cityscapes.jpg
python pspnet.py -m pspnet101_voc2012 -i example_images/pascal_voc.jpg -o example_results/pascal_voc.jpg

List of arguments:

 -m --model        - which model to use: 'pspnet50_ade20k', 'pspnet101_cityscapes', 'pspnet101_voc2012'
    --id           - (int) GPU Device id. Default 0
 -s --sliding      - Use sliding window
 -f --flip         - Additional prediction of flipped image
 -ms --multi_scale - Predict on multiscale images

Keras results:

Implementation details

The interpolation layer is implemented as custom layer "Interp"
Forward step takes about ~1 sec on single image

Memory usage can be optimized with:

config = tf.ConfigProto()
config.gpu_options.per_process_gpu_memory_fraction = 0.3 
sess = tf.Session(config=config)

ndimage.zoom can take a long time

TensorFlow implementation of original paper : https://github.com/hszhao/PSPNet

Related tags

Overview

Keras implementation of PSPNet(caffe)

Setup

Convert weights by yourself(optional)

Usage:

Keras results:

Implementation details

Owner

VladKry

Face Depixelizer based on "PULSE: Self-Supervised Photo Upsampling via Latent Space Exploration of Generative Models" repository.

Data and Code for paper Outlining and Filling: Hierarchical Query Graph Generation for Answering Complex Questions over Knowledge Graph is available for research purposes.

InsightFace: 2D and 3D Face Analysis Project on MXNet and PyTorch

Using Hotel Data to predict High Value And Potential VIP Guests

Multi-Agent Reinforcement Learning for Active Voltage Control on Power Distribution Networks (MAPDN)

[ECCV 2020] XingGAN for Person Image Generation

Code for pre-training CharacterBERT models (as well as BERT models).

A PyTorch implementation of "DGC-Net: Dense Geometric Correspondence Network"

Depth-Aware Video Frame Interpolation (CVPR 2019)

Python PID Tuner - Based on a FOPDT model obtained using a Open Loop Process Reaction Curve

Tensorflow implementation for Self-supervised Graph Learning for Recommendation

Radar-to-Lidar: Heterogeneous Place Recognition via Joint Learning

CDTrans: Cross-domain Transformer for Unsupervised Domain Adaptation

Official implementation for the paper "Attentive Prototypes for Source-free Unsupervised Domain Adaptive 3D Object Detection"

One implementation of the paper "DMRST: A Joint Framework for Document-Level Multilingual RST Discourse Segmentation and Parsing".

Pytorch implementation of Bert and Pals: Projected Attention Layers for Efficient Adaptation in Multi-Task Learning

High performance distributed framework for training deep learning recommendation models based on PyTorch.

[UNMAINTAINED] Automated machine learning for analytics & production

Container : Context Aggregation Network

Pytorch implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"