Using fully convolutional networks for semantic segmentation with caffe for the cityscapes dataset

Last update: Jun 06, 2022

Overview

Using fully convolutional networks for semantic segmentation (Shelhamer et al.) with caffe for the cityscapes dataset

How to get started

Download the cityscapes dataset and the vgg-16-layer net
Modify the images in the dataset with cut_images.py or downscale_images.py for less resource demanding training and evaluation
Create the 32 pixel stride net with net_32.py
Modify the paths in train.txt and val.txt (first line: path to training/validation images, second line: path to annotations)
Start training with solve_start.py
Run evaluate_models.py to evaluate your model or create_eval_images.py to create images with pixel label ids

Sources

Fully Convolutional Models for Semantic Segmentation:

Shelhamer, Evan, Jonathon Long, and Trevor Darrell. "Fully Convolutional Networks for Semantic Segmentation." PAMI, 2016, URL http://fcn.berkeleyvision.org

Cityscapes Dataset (Semantic Understanding of Urban Street Scenes):

Cordts, Marius, et al. "The cityscapes dataset." CVPR Workshop on The Future of Datasets in Vision. 2015, URL https://www.cityscapes-dataset.com

Caffe Deep Learning Framework:

Jia, Yangqing, et al. "Caffe: Convolutional architecture for fast feature embedding." Proceedings of the 22nd ACM international conference on Multimedia. ACM, 2014, URL http://caffe.berkeleyvision.org

Using fully convolutional networks for semantic segmentation with caffe for the cityscapes dataset

Related tags

Overview

How to get started

Sources

Fully Convolutional Models for Semantic Segmentation:

Cityscapes Dataset (Semantic Understanding of Urban Street Scenes):

Caffe Deep Learning Framework:

Owner

Simon Guist

This is a tensorflow-based rotation detection benchmark, also called AlphaRotate.

This repository is the code of the paper Accelerating Deep Reinforcement Learning for Digital Twin Network Optimization with Evolutionary Strategies

Learning to Disambiguate Strongly Interacting Hands via Probabilistic Per-Pixel Part Segmentation [3DV 2021 Oral]

Repository providing a wide range of self-supervised pretrained models for computer vision tasks.

This is an official source code for implementation on Extensive Deep Temporal Point Process

Step by Step on how to create an vision recognition model using LOBE.ai, export the model and run the model in an Azure Function

GPU Programming with Julia - course at the Swiss National Supercomputing Centre (CSCS), ETH Zurich

Code accompanying "Dynamic Neural Relational Inference" from CVPR 2020

Text-to-Image generation

Implementation of Nyström Self-attention, from the paper Nyströmformer

Code for the Paper "Diffusion Models for Handwriting Generation"

Trading Gym is an open source project for the development of reinforcement learning algorithms in the context of trading.

Deep Inside Convolutional Networks - This is a caffe implementation to visualize the learnt model

pq is a jq-like Pickle file viewer

Yggdrasil - A simplistic bot designed to streamline your server experience

Monitora la qualità della ricezione dei segnali radio nelle province siciliane.

Tensorflow implementation of "Learning Deep Features for Discriminative Localization"

Framework for evaluating ANNS algorithms on billion scale datasets.

Causal Influence Detection for Improving Efficiency in Reinforcement Learning

[CVPR 2021] Official PyTorch Implementation for "Iterative Filter Adaptive Network for Single Image Defocus Deblurring"