Understanding Convolution for Semantic Segmentation

Last update: Dec 31, 2022

Overview

TuSimple-DUC

by Panqu Wang, Pengfei Chen, Ye Yuan, Ding Liu, Zehua Huang, Xiaodi Hou, and Garrison Cottrell.

Introduction

This repository is for Understanding Convolution for Semantic Segmentation (WACV 2018), which achieved state-of-the-art result on the CityScapes, PASCAL VOC 2012, and Kitti Road benchmark.

Requirement

We tested our code on:

Ubuntu 16.04, Python 2.7 with

MXNet (0.11.0), numpy(1.13.1), cv2(3.2.0), PIL(4.2.1), and cython(0.25.2)

Usage

Clone the repository:

git clone [email protected]:TuSimple/TuSimple-DUC.git
python setup.py develop --user

Download the pretrained model from Google Drive.

Build MXNet (only tested on the TuSimple version):

git clone --recursive [email protected]:TuSimple/mxnet.git
vim make/config.mk (we should have USE_CUDA = 1, modify USE_CUDA_PATH, and have USE_CUDNN = 1 to enable GPU usage.)
make -j
cd python
python setup.py develop --user

For more MXNet tutorials, please refer to the official documentation.

Training:
```
cd train
python train_model.py ../configs/train/train_cityscapes.cfg
```
The paths/dirs in the .cfg file need to be specified by the user.

Testing

cd test
python predict_full_image.py ../configs/test/test_full_image.cfg

The paths/dirs in the .cfg file need to be specified by the user.

Results:

Modify the result_dir path in the config file to save the label map and visualizations. The expected scores are:

(single scale testing denotes as 'ss' and multiple scale testing denotes as 'ms')
- ResNet101-DUC-HDC on CityScapes testset (mIoU): 79.1(ss) / 80.1(ms)
- ResNet152-DUC on VOC2012 (mIoU): 83.1(ss)

Citation

If you find the repository is useful for your research, please consider citing:

@article{wang2017understanding,
  title={Understanding convolution for semantic segmentation},
  author={Wang, Panqu and Chen, Pengfei and Yuan, Ye and Liu, Ding and Huang, Zehua and Hou, Xiaodi and Cottrell, Garrison},
  journal={arXiv preprint arXiv:1702.08502},
  year={2017}
}

Questions

Please contact [email protected] or [email protected] .

Understanding Convolution for Semantic Segmentation

Related tags

Overview

TuSimple-DUC

Introduction

Requirement

Usage

Citation

Questions

Owner

TuSimple

A PyTorch implementation of EfficientNet and EfficientNetV2 (coming soon!)

Cryptocurrency Prediction with Artificial Intelligence (Deep Learning via LSTM Neural Networks)

Process text, including tokenizing and representing sentences as vectors and Applying some concepts like RNN, LSTM and GRU to create a classifier can detect the language in which a sentence is written from among 17 languages.

This project is a loose implementation of paper "Algorithmic Financial Trading with Deep Convolutional Neural Networks: Time Series to Image Conversion Approach"

Scalable Optical Flow-based Image Montaging and Alignment

Multi-objective gym environments for reinforcement learning.

Pyramid addon for OpenAPI3 validation of requests and responses.

This repo contains the official code of our work SAM-SLR which won the CVPR 2021 Challenge on Large Scale Signer Independent Isolated Sign Language Recognition.

[ICCV 2021] Official Pytorch implementation for Discriminative Region-based Multi-Label Zero-Shot Learning SOTA results on NUS-WIDE and OpenImages

Fuzzing tool (TFuzz): a fuzzing tool based on program transformation

Stream images from a connected camera over MQTT, view using Streamlit, record to file and sqlite

Benchmark tools for Compressive LiDAR-to-map registration

This is RFA-Toolbox, a simple and easy-to-use library that allows you to optimize your neural network architectures using receptive field analysis (RFA) and create graph visualizations of your architecture.

Some tentative models that incorporate label propagation to graph neural networks for graph representation learning in nodes, links or graphs.

A containerized REST API around OpenAI's CLIP model.

NLMpy - A Python package to create neutral landscape models

Code for the paper One Thing One Click: A Self-Training Approach for Weakly Supervised 3D Semantic Segmentation, CVPR 2021.

Generate fine-tuning samples & Fine-tuning the model & Generate samples by transferring Note On

Python implementation of "Single Image Haze Removal Using Dark Channel Prior"

[Arxiv preprint] Causality-inspired Single-source Domain Generalization for Medical Image Segmentation (code&data-processing pipeline)