Code for the paper "PortraitNet: Real-time portrait segmentation network for mobile device" @ CAD&Graphics2019

Last update: Dec 01, 2022

Overview

PortraitNet

Code for the paper "PortraitNet: Real-time portrait segmentation network for mobile device". @ CAD&Graphics 2019

Introduction

We propose a real-time portrait segmentation model, called PortraitNet, that can run effectively and efficiently on mobile device. PortraitNet is based on a lightweight U-shape architecture with two auxiliary losses at the training stage, while no additional cost is required at the testing stage for portrait inference.

Portrait segmentation applications on mobile device.

Experimental setup

Requirements

python 2.7
PyTorch 0.3.0.post4
Jupyter Notebook
pip install easydict matplotlib tqdm opencv-python scipy pyyaml numpy

Download datasets

EG1800 Since several image URL links are invalid in the original EG1800 dataset, we finally use 1447 images for training and 289 images for validation.
Supervise-Portrait Supervise-Portrait is a portrait segmentation dataset collected from the public human segmentation dataset Supervise.ly using the same data process as EG1800.

Training

Network Architecture

Overview of PortraitNet.

Training Steps

Download the datasets (EG1800 or Supervise-Portriat). If you want to training at your own dataset, you need to modify data/datasets.py and data/datasets_portraitseg.py.
Prepare training/testing files, like data/select_data/eg1800_train.txt and data/select_data/eg1800_test.txt.
Select and modify the parameters in the folder of config.
Start the training with single gpu:

cd myTrain
python2.7 train.py

Testing

In the folder of myTest:

you can use EvalModel.ipynb to test on testing datasets.
you can use VideoTest.ipynb to test on a single image or video.

Visualization

Using tensorboard to visualize the training process:

cd path_to_save_model
tensorboard --logdir='./log'

Download models

from Dropbox:

mobilenetv2_eg1800_with_two_auxiliary_losses(Training on EG1800 with two auxiliary losses)
mobilenetv2_supervise_portrait_with_two_auxiliary_losses(Training on Supervise-Portrait with two auxiliary losses)
mobilenetv2_total_with_prior_channel(Training on Human with prior channel)

from Baidu Cloud:

mobilenetv2_eg1800_with_two_auxiliary_losses(Training on EG1800 with two auxiliary losses)
mobilenetv2_supervise_portrait_with_two_auxiliary_losses(Training on Supervise-Portrait with two auxiliary losses)
mobilenetv2_total_with_prior_channel(Training on Human with prior channel)

Code for the paper "PortraitNet: Real-time portrait segmentation network for mobile device" @ CAD&Graphics2019

Related tags

Overview

PortraitNet

Introduction

Experimental setup

Requirements

Download datasets

Training

Network Architecture

Training Steps

Testing

Visualization

Download models

Owner

This is the repo for the paper `SumGNN: Multi-typed Drug Interaction Prediction via Efficient Knowledge Graph Summarization'. (published in Bioinformatics'21)

The Official PyTorch Implementation of "LSGM: Score-based Generative Modeling in Latent Space" (NeurIPS 2021)

Repository for code and dataset for our EMNLP 2021 paper - “So You Think You’re Funny?”: Rating the Humour Quotient in Standup Comedy.

Deep Dual Consecutive Network for Human Pose Estimation (CVPR2021)

Python implementation of the multistate Bennett acceptance ratio (MBAR)

Self-Supervised Multi-Frame Monocular Scene Flow (CVPR 2021)

WSDM2022 Challenge - Large scale temporal graph link prediction

Repository for the "Gotta Go Fast When Generating Data with Score-Based Models" paper

Algorithmic Trading using RNN

CHERRY is a python library for predicting the interactions between viral and prokaryotic genomes

rastrainer is a QGIS plugin to training remote sensing semantic segmentation model based on PaddlePaddle.

Official implementation of "SinIR: Efficient General Image Manipulation with Single Image Reconstruction" (ICML 2021)

Udacity's CS101: Intro to Computer Science - Building a Search Engine

An experimental technique for efficiently exploring neural architectures.

HSC4D: Human-centered 4D Scene Capture in Large-scale Indoor-outdoor Space Using Wearable IMUs and LiDAR. CVPR 2022

Code for paper Novel View Synthesis via Depth-guided Skip Connections

A collection of Reinforcement Learning algorithms from Sutton and Barto's book and other research papers implemented in Python.

GraphLily: A Graph Linear Algebra Overlay on HBM-Equipped FPGAs

Notebooks em Python para Métodos Eletromagnéticos

ShapeGlot: Learning Language for Shape Differentiation