A Strong Baseline for Image Semantic Segmentation

Introduction

This project is an open source semantic segmentation toolbox based on PyTorch. It is based on the codes of our Tianchi competition in 2021 (https://tianchi.aliyun.com/competition/entrance/531860/introduction).
In the competition, our team won the third place (please see Tianchi_README.md).

Overview

The master branch works with PyTorch 1.6+.The project now supports popular and contemporary semantic segmentation frameworks, e.g. UNet, DeepLabV3+, HR-Net etc.

Requirements

Support

Backbone

ResNet (CVPR'2016)
SeNet (CVPR'2018)
IBN-Net (CVPR'2018)
EfficientNet (CVPR'2020)

Methods

Tricks

Tools

large image inference (cut and merge)
post process (crf/superpixels)

Quick Start

Train a model

python train.py --config_file ${CONFIG_FILE}

CONFIG_FILE: File of training config about model

Examples:
We trained our model in Tianchi competition according to the following script:
Stage 1 (160e)

python train.py --config_file configs/tc_seg/tc_seg_res_unet_r34_ibn_a_160e.yml

Stage 2 (swa 24e)

python train.py --config_file configs/tc_seg/tc_seg_res_unet_r34_ibn_a_swa.yml

Inference with pretrained models

python inference.py --config_file ${CONFIG_FILE}

CONFIG_FILE: File of inference config about model

Predict large image with pretrained models

python predict_demo.py --config_file ${CONFIG_FILE} --rs_img_file ${IMAGE_FILE_PATH} --temp_img_save_path ${TEMP_CUT_PATH} -temp_seg_map_save_path ${TEMP_SAVE_PATH} --save_seg_map_file ${SAVE_SEG_FILE}

CONFIG_FILE: File of inference config about model
IMAGE_FILE_PATH: File of large input image to predict
TEMP_CUT_PATH: Temp folder of small cutting samples
TEMP_SAVE_PATH: Temp folder of predict results of cutting samples
SAVE_SEG_FILE: Predict result of the large image

A Strong Baseline for Image Semantic Segmentation

Related tags

Overview

A Strong Baseline for Image Semantic Segmentation

Introduction

Overview

Requirements

Support

Backbone

Methods

Tricks

Tools

Quick Start

Train a model

Inference with pretrained models

Predict large image with pretrained models

Owner

Clark He

traiNNer is an open source image and video restoration (super-resolution, denoising, deblurring and others) and image to image translation toolbox based on PyTorch.

The repository for our EMNLP 2021 paper "Finnish Dialect Identification: The Effect of Audio and Text"

Yolact-keras实例分割模型在keras当中的实现

PrimitiveNet: Primitive Instance Segmentation with Local Primitive Embedding under Adversarial Metric (ICCV 2021)

PyTorch implementation of "Transparency by Design: Closing the Gap Between Performance and Interpretability in Visual Reasoning"

Discriminative Region Suppression for Weakly-Supervised Semantic Segmentation

Dynamica causal Bayesian optimisation

Main Results on ImageNet with Pretrained Models

Image Classification - A research on image classification and auto insurance claim prediction, a systematic experiments on modeling techniques and approaches

Job Assignment System by Real-time Emotion Detection

Scalable training for dense retrieval models.

Implementation of the pix2pix model on satellite images

offical implement of our Lifelong Person Re-Identification via Adaptive Knowledge Accumulation in CVPR2021

Teaching end to end workflow of deep learning

This is an example implementation of the paper "Cross Domain Robot Imitation with Invariant Representation".

DenseNet Implementation in Keras with ImageNet Pretrained Models

This repo is about to create the Streamlit application for given ML model.

A simple rest api that classifies pneumonia infection weather it is Normal, Pneumonia Virus or Pneumonia Bacteria from a chest-x-ray image.

EGNN - Implementation of E(n)-Equivariant Graph Neural Networks, in Pytorch

RM Operation can equivalently convert ResNet to VGG, which is better for pruning; and can help RepVGG perform better when the depth is large.