PyTorch implementation of ENet

Last update: Dec 29, 2022

Overview

PyTorch-ENet

PyTorch (v1.1.0) implementation of ENet: A Deep Neural Network Architecture for Real-Time Semantic Segmentation, ported from the lua-torch implementation ENet-training created by the authors.

This implementation has been tested on the CamVid and Cityscapes datasets. Currently, a pre-trained version of the model trained in CamVid and Cityscapes is available here.

Dataset	Classes ¹	Input resolution	Batch size	Epochs	Mean IoU (%)	GPU memory (GiB)	Training time (hours)²
CamVid	11	480x360	10	300	52.1³	4.2	1
Cityscapes	19	1024x512	4	300	59.5⁴	5.4	20

¹ When referring to the number of classes, the void/unlabeled class is always excluded.
² These are just for reference. Implementation, datasets, and hardware changes can lead to very different results. Reference hardware: Nvidia GTX 1070 and an AMD Ryzen 5 3600 3.6GHz. You can also train for 100 epochs or so and get similar mean IoU (± 2%).
³ Test set.
⁴ Validation set.

Installation

Local pip

Python 3 and pip
Set up a virtual environment (optional, but recommended)
Install dependencies using pip: pip install -r requirements.txt

Docker image

Build the image: docker build -t enet .
Run: docker run -it --gpus all --ipc host enet

Usage

Run main.py, the main script file used for training and/or testing the model. The following options are supported:

python main.py [-h] [--mode {train,test,full}] [--resume]
               [--batch-size BATCH_SIZE] [--epochs EPOCHS]
               [--learning-rate LEARNING_RATE] [--lr-decay LR_DECAY]
               [--lr-decay-epochs LR_DECAY_EPOCHS]
               [--weight-decay WEIGHT_DECAY] [--dataset {camvid,cityscapes}]
               [--dataset-dir DATASET_DIR] [--height HEIGHT] [--width WIDTH]
               [--weighing {enet,mfb,none}] [--with-unlabeled]
               [--workers WORKERS] [--print-step] [--imshow-batch]
               [--device DEVICE] [--name NAME] [--save-dir SAVE_DIR]

For help on the optional arguments run: python main.py -h

Examples: Training

python main.py -m train --save-dir save/folder/ --name model_name --dataset name --dataset-dir path/root_directory/

Examples: Resuming training

python main.py -m train --resume True --save-dir save/folder/ --name model_name --dataset name --dataset-dir path/root_directory/

Examples: Testing

python main.py -m test --save-dir save/folder/ --name model_name --dataset name --dataset-dir path/root_directory/

Project structure

Folders

data: Contains instructions on how to download the datasets and the code that handles data loading.
metric: Evaluation-related metrics.
models: ENet model definition.
save: By default, main.py will save models in this folder. The pre-trained models can also be found here.

Files

args.py: Contains all command-line options.
main.py: Main script file used for training and/or testing the model.
test.py: Defines the Test class which is responsible for testing the model.
train.py: Defines the Train class which is responsible for training the model.
transforms.py: Defines image transformations to convert an RGB image encoding classes to a torch.LongTensor and vice versa.

PyTorch implementation of ENet

Related tags

Overview

PyTorch-ENet

Installation

Local pip

Docker image

Usage

Examples: Training

Examples: Resuming training

Examples: Testing

Project structure

Folders

Files

Owner

David Silva

Experiments for distributed optimization algorithms

Neural-fractal - Create Fractals Using Complex-Valued Neural Networks!

Simple and understandable swin-transformer OCR project

Calibrate your listeners! Robust communication-based training for pragmatic speakers. Findings of EMNLP 2021.

"Graph Neural Controlled Differential Equations for Traffic Forecasting", AAAI 2022

Implementation of "The Power of Scale for Parameter-Efficient Prompt Tuning"

A Probabilistic End-To-End Task-Oriented Dialog Model with Latent Belief States towards Semi-Supervised Learning

Code release for SLIP Self-supervision meets Language-Image Pre-training

This project is used for the paper Differentiable Programming of Isometric Tensor Network

This application is the basic of automated online-class-joiner(for YıldızEdu) within the right time. Gets the ZOOM link by scheduled date and time.

Continuous Query Decomposition for Complex Query Answering in Incomplete Knowledge Graphs

NP DRAW paper released code

NVIDIA Merlin is an open source library providing end-to-end GPU-accelerated recommender systems, from feature engineering and preprocessing to training deep learning models and running inference in production.

Dense Contrastive Learning (DenseCL) for self-supervised representation learning, CVPR 2021.

Code for the ICME 2021 paper "Exploring Driving-Aware Salient Object Detection via Knowledge Transfer"

The InterScript dataset contains interactive user feedback on scripts generated by a T5-XXL model.

Fiddle is a Python-first configuration library particularly well suited to ML applications.

The implementation for "Comprehensive Knowledge Distillation with Causal Intervention".

SE-MSCNN: A Lightweight Multi-scaled Fusion Network for Sleep Apnea Detection Using Single-Lead ECG Signals

Official pytorch implementation of paper "Inception Convolution with Efficient Dilation Search" (CVPR 2021 Oral).