FCN-semantic-segmentation

Simple end-to-end semantic segmentation using fully convolutional networks [1]. Takes a pretrained 34-layer ResNet [2], removes the fully connected layers, and adds transposed convolution layers with skip connections from lower layers. Initialises upsampling convolutions with bilinear interpolation filters and zeros the final (classification) layer.

Uses an independent cross-entropy loss per class. Trained with SGD with momentum, plus weight decay only on convolutional weights. Calculates and plots class-wise and mean intersection-over-union. Checkpoints the network every epoch.

Note: This code does not achieve great results (achieves ~40 IoU fairly quickly, but converges there). Contributions to fix this are welcome! The goal of this repo is to provide strong, simple and efficient baselines for semantic segmentation using the FCN method, so this shouldn't be restricted to using ResNet 34 etc.

Requirements

Instructions

Install all of the required software. To feasibly run the training, CUDA is needed. The crop size and batch size can be tailored to your GPU memory (the default crop and batch sizes use ~10GB of GPU RAM).
Register on the Cityscapes website to access the dataset.
Download and extract the training/validation RGB data (leftImg8bit_trainvaltest) and ground truth data (gtFine_trainvaltest).
Run python main.py <options>.

First a Dataset object is set up, returning the RGB inputs, one-hot targets (for independent classification) and label targets. During training, the images are randomly cropped and horizontally flipped. Testing calculates IoU scores and produces a subset of coloured predictions that match the coloured ground truth.

References

[1] Fully convolutional networks for semantic segmentation
[2] Deep Residual Learning for Image Recognition

Fully convolutional networks for semantic segmentation

Related tags

Overview

FCN-semantic-segmentation

Requirements

Instructions

References

Owner

Kai Arulkumaran

CompilerGym is a library of easy to use and performant reinforcement learning environments for compiler tasks

Python library for analysis of time series data including dimensionality reduction, clustering, and Markov model estimation

Inteligência artificial criada para realizar interação social com idosos.

Attention-driven Robot Manipulation (ARM) which includes Q-attention

METER: Multimodal End-to-end TransformER

Semi-SDP Semi-supervised parser for semantic dependency parsing.

A gesture recognition system powered by OpenPose, k-nearest neighbours, and local outlier factor.

Some pre-commit hooks for OpenMMLab projects

Object detection (YOLO) with pytorch, OpenCV and python

Bridging the Gap between Label- and Reference based Synthesis(ICCV 2021)

Official Implementation of CVPR 2022 paper: "Mimicking the Oracle: An Initial Phase Decorrelation Approach for Class Incremental Learning"

Vector.ai assignment

code for Grapadora research paper experimentation

The code is an implementation of Feedback Convolutional Neural Network for Visual Localization and Segmentation.

Conservative and Adaptive Penalty for Model-Based Safe Reinforcement Learning

Official Pytorch Implementation of Adversarial Instance Augmentation for Building Change Detection in Remote Sensing Images.

Ascend your Jupyter Notebook usage

MTCNN face detection implementation for TensorFlow, as a PIP package.

ResNEsts and DenseNEsts: Block-based DNN Models with Improved Representation Guarantees

Taichi Course Homework Template