Dynamic Neural Representational Decoders for High-Resolution Semantic Segmentation

Last update: May 28, 2022

Related tags

Deep Learning NRD_decoder

Overview

Dynamic Neural Representational Decoders for High-Resolution Semantic Segmentation

Requirements

This repository needs mmsegmentation

Training

To train the model(s) in the paper, run this command:

python tools/train.py ./configs/NRD/ade20k/NRD_r101_512x512_164k_ade20k.py

The batch size is 16 in this work. Please change the 'samples_per_gpu' in configs/base/datasets/.. accordingly

Evaluation

To evaluate my model at single-scale inference, run:

python tools/eval.py ./configs/NRD/ade20k/NRD_r101_512x512_164k_ade20k.py  {path-to-checkpoint-file}   --eval mIoU

Pre-trained Models

Results

Our model achieves the following performance on :

[Semantic segmentation results]

Model name	datasets	mIoU	mIoU (ms)
NRD-r101	ade20k (val)	44.01	45.62
NRD-x101	ade20k (val)	44.34	46.35
NRD-r101	pascal-context(val)	52.31 (59 classes)	54.1 (59 classes)
NRD-r101	pascal-context(val)	47.5 (60 classes)	40.9 (60 classes)
NRD-r50	Cityscapes (val)	79.8	80.8
NRD-r101	Cityscapes (val)	80.7	82.0

Contributing

The code is mostly taken from mmsegmentation mmsegmentation is released under the Apache 2.0 license.

Dynamic Neural Representational Decoders for High-Resolution Semantic Segmentation

Related tags

Overview

Dynamic Neural Representational Decoders for High-Resolution Semantic Segmentation

Requirements

Training

Evaluation

Pre-trained Models

Results

[Semantic segmentation results]

Contributing

Owner

Asymmetric metric learning for knowledge transfer

Implementation of the paper NAST: Non-Autoregressive Spatial-Temporal Transformer for Time Series Forecasting.

Restricted Boltzmann Machines in Python.

Understanding and Overcoming the Challenges of Efficient Transformer Quantization

A python module for scientific analysis of 3D objects based on VTK and Numpy

A new benchmark for Icon Question Answering (IconQA) and a large-scale icon dataset Icon645.

Breast Cancer Classification Model is applied on a different dataset

Editing a classifier by rewriting its prediction rules

The toolkit to generate auto labeled datasets

Mining-the-Social-Web-3rd-Edition - The official online compendium for Mining the Social Web, 3rd Edition (O'Reilly, 2018)

TagLab: an image segmentation tool oriented to marine data analysis

Open-source implementation of Google Vizier for hyper parameters tuning

Repo for EchoVPR: Echo State Networks for Visual Place Recognition

Code for Estimating Multi-cause Treatment Effects via Single-cause Perturbation (NeurIPS 2021)

DyStyle: Dynamic Neural Network for Multi-Attribute-Conditioned Style Editing

Pipeline code for Sequential-GAM(Genome Architecture Mapping).

Deep learning model, heat map, data prepo

A collection of papers about Transformer in the field of medical image analysis.

AtlasNet: A Papier-Mâché Approach to Learning 3D Surface Generation

Code for the ICML 2021 paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"