Deep Learning for Human Part Discovery in Images - Chainer implementation

Last update: Sep 25, 2022

Overview

Deep Learning for Human Part Discovery in Images - Chainer implementation

NOTE: This is not official implementation. Original paper is Deep Learning for Human Part Discovery in Images.

We are now reproducing the experiments in the original paper. Any contribution will be welcomed!

Requirements

Python 2.7.11+
- Chainer 1.10+
- numpy 1.9+
- scipy 0.16+
- six
- matplotlib
- tqdm
- cv2 (opencv)

Preparation

Data

bash prepare.sh

This script downloads VOC 2010 dataset (http://host.robots.ox.ac.uk/pascal/VOC/voc2010/VOCtrainval_03-May-2010.tar) and the authors' original dataset (http://www2.informatik.uni-freiburg.de/~oliveira/datasets/Sitting.tar.gz).

Model

You can download pre-trained FCN model from here.

We will use weights of this model and train new model on VOC dataset.

Start training

python train.py -g 0 -b 3 -e 3000 -l on -s on

Possible options

python train.py --help

GPU memory requirement

Citation from the original paper:

Each minibatch consists of just one image. The learning rate and momentum are fixed to 1e 10 and 0.99, respectively. We train the refinement layer by layer, which takes two days per refinement layer. Thus, the overall training starting from the pre-trained VGG network took 10 days on a single GPU.

Current maximum batchsize is 3 for 12 GB memory GPU.

Also it was confirmed that MBP (Late 2016, memory 16 GiB) can run with batchsize 1.

Result

Now in prep.

Visualize Prediction

python visualize.py -f PATH_TO_IMAGE_FILE

LICENSE

MIT LICENSE.

Author

shiba24, August 2016.

Contributors

bobye

Deep Learning for Human Part Discovery in Images - Chainer implementation

Related tags

Overview

Deep Learning for Human Part Discovery in Images - Chainer implementation

Requirements

Preparation

Data

Model

Start training

Possible options

GPU memory requirement

Result

Visualize Prediction

LICENSE

Author

Contributors

Owner

Shintaro Shiba

Detectorch - detectron for PyTorch

This repo contains research materials released by members of the Google Brain team in Tokyo.

GPU Accelerated Non-rigid ICP for surface registration

Orbivator AI - To Determine which features of data (measurements) are most important for diagnosing breast cancer and find out if breast cancer occurs or not.

Collect super-resolution related papers, data, repositories

Implementation of Uniformer, a simple attention and 3d convolutional net that achieved SOTA in a number of video classification tasks

Based on the given clinical dataset, Predict whether the patient having Heart Disease or Not having Heart Disease

Project repo for the paper SILT: Self-supervised Lighting Transfer Using Implicit Image Decomposition

The Simplest DCGAN Implementation

Like ThreeJS but for Python and based on wgpu

PyTorch version implementation of DORN

U^2-Net - Portrait matting This repository explores possibilities of using the original u^2-net model for portrait matting.

YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with ONNX, TensorRT, ncnn, and OpenVINO supported.

TAP: Text-Aware Pre-training for Text-VQA and Text-Caption, CVPR 2021 (Oral)

We envision models that are pre-trained on a vast range of domain-relevant tasks to become key for molecule property prediction

Train Scene Graph Generation for Visual Genome and GQA in PyTorch >= 1.2 with improved zero and few-shot generalization.

Python scripts form performing stereo depth estimation using the high res stereo model in PyTorch .

Adversarial Color Enhancement: Generating Unrestricted Adversarial Images by Optimizing a Color Filter

Ensembling Off-the-shelf Models for GAN Training

This source code is implemented using keras library based on "Automatic ocular artifacts removal in EEG using deep learning"