Pixel-wise segmentation on VOC2012 dataset using pytorch.

Last update: Dec 30, 2022

Overview

PiWiSe

Pixel-wise segmentation on the VOC2012 dataset using pytorch.

For a more complete implementation of segmentation networks checkout semseg.

Note:

FCN differs from original implementation see this issue
SegNet does not match original paper performance see here
PSPNet misses "atrous convolution" (conv layers of ResNet101 should be amended to preserve image size)

Keeping this in mind feel free to PR. Thank you!

Setup

See dataset examples here.

Download

Download image archive and extract and do:

mkdir data
mv VOCdevkit/VOC2012/JPEGImages data/images
mv VOCdevkit/VOC2012/SegmentationClass data/classes
rm -rf VOCdevkit

Install

We recommend using pyenv:

pyenv virtualenv 3.6.0 piwise
pyenv activate piwise

then install requirements with pip install -r requirements.txt.

Usage

For latest documentation use:

python main.py --help

Supported model parameters are fcn8, fcn16, fcn32, unet, segnet1, segnet2, pspnet.

Training

If you want to have visualization open an extra tab with:

python -m visdom.server -port 5000

Train the SegNet model 30 epochs with cuda support, visualization and checkpoints every 100 steps:

python main.py --cuda --model segnet2 train --datadir data \
    --num-epochs 30 --num-workers 4 --batch-size 4 \
    --steps-plot 50 --steps-save 100

Evaluation

Then we want to do semantic segmentation on foo.jpg:

python main.py --model segnet2 --state segnet2-30-0 eval foo.jpg foo.png

The segmented class image can now be found at foo.png.

Results

These are some results based on segnet after 40 epoches. Set

loss_weights[0] = 1 / 1

to deal gracefully with the unbalanced problem.

Input	Output	Ground Truth

Pixel-wise segmentation on VOC2012 dataset using pytorch.

Related tags

Overview

PiWiSe

Setup

Download

Install

Usage

Training

Evaluation

Results

Owner

Bodo Kaiser

FairyTailor: Multimodal Generative Framework for Storytelling

Build tensorflow keras model pipelines in a single line of code. Created by Ram Seshadri. Collaborators welcome. Permission granted upon request.

Learning with Noisy Labels via Sparse Regularization, ICCV2021

Invariant Causal Prediction for Block MDPs

ML From Scratch

ViViT: Curvature access through the generalized Gauss-Newton's low-rank structure

Pytorch implementation of BRECQ, ICLR 2021

Object detection GUI based on PaddleDetection

AlgoVision - A Framework for Differentiable Algorithms and Algorithmic Supervision

PipeTransformer: Automated Elastic Pipelining for Distributed Training of Large-scale Models

UFPR-ADMR-v2 Dataset

SnapMix: Semantically Proportional Mixing for Augmenting Fine-grained Data (AAAI 2021)

A curated list of awesome Machine Learning frameworks, libraries and software.

Semantic Segmentation with Pytorch-Lightning

Hierarchical User Intent Graph Network for Multimedia Recommendation

Get started with Machine Learning with Python - An introduction with Python programming examples

Picasso: A CUDA-based Library for Deep Learning over 3D Meshes

Vehicles Counting using YOLOv4 + DeepSORT + Flask + Ngrok

tf2-keras implement yolov5

Python KNN model: Predicting a probability of getting a work visa. Tableau: Non-immigrant visas over the years.