TernausNet: U-Net with VGG11 Encoder Pre-Trained on ImageNet for Image Segmentation

Introduction

TernausNet is a modification of the celebrated UNet architecture that is widely used for binary Image Segmentation. For more details, please refer to our arXiv paper.

Pre-trained encoder speeds up convergence even on the datasets with a different semantic features. Above curve shows validation Jaccard Index (IOU) as a function of epochs for Aerial Imagery

This architecture was a part of the winning solutiuon (1st out of 735 teams) in the Carvana Image Masking Challenge.

Installation

pip install ternausnet

Citing TernausNet

Please cite TernausNet in your publications if it helps your research:

@ARTICLE{arXiv:1801.05746,
         author = {V. Iglovikov and A. Shvets},
          title = {TernausNet: U-Net with VGG11 Encoder Pre-Trained on ImageNet for Image Segmentation},
        journal = {ArXiv e-prints},
         eprint = {1801.05746},
           year = 2018
        }

Example of the train and test pipeline

https://github.com/ternaus/robot-surgery-segmentation

UNet model with VGG11 encoder pre-trained on Kaggle Carvana dataset

Related tags

Overview

TernausNet: U-Net with VGG11 Encoder Pre-Trained on ImageNet for Image Segmentation

Introduction

Installation

Citing TernausNet

Example of the train and test pipeline

Owner

Vladimir Iglovikov

Implementation of Pooling by Sliced-Wasserstein Embedding (NeurIPS 2021)

A Differentiable Recipe for Learning Visual Non-Prehensile Planar Manipulation

Inteligência artificial criada para realizar interação social com idosos.

Robotics environments

3rd place solution for the Weather4cast 2021 Stage 1 Challenge

Evaluating Cross-lingual Sentence Representations

The final project for "Applying AI to Wearable Device Data" course from "AI for Healthcare" - Udacity.

Code release for "Detecting Twenty-thousand Classes using Image-level Supervision".

Code for the paper "How Attentive are Graph Attention Networks?"

Practical Single-Image Super-Resolution Using Look-Up Table

Implementing DropPath/StochasticDepth in PyTorch

This repository contains the implementation of Deep Detail Enhancment for Any Garment proposed in Eurographics 2021

Source code for paper "Document-Level Relation Extraction with Adaptive Thresholding and Localized Context Pooling", AAAI 2021

A simple command line tool for text to image generation, using OpenAI's CLIP and a BigGAN.

Regularized Frank-Wolfe for Dense CRFs: Generalizing Mean Field and Beyond

Python module providing a framework to trace individual edges in an image using Gaussian process regression.

This repo is to be freely used by ML devs to check the GAN performances without coding from scratch.

Official implementation of the ICCV 2021 paper: "The Power of Points for Modeling Humans in Clothing".

Official code for the paper: Deep Graph Matching under Quadratic Constraint (CVPR 2021)

Punctuation Restoration using Transformer Models for High-and Low-Resource Languages