ADGAN - The Implementation of paper Controllable Person Image Synthesis with Attribute-Decomposed GAN

Last update: Dec 29, 2022

Overview

ADGAN

PyTorch | project page | paper

PyTorch implementation for controllable person image synthesis.

Controllable Person Image Synthesis with Attribute-Decomposed GAN
Yifang Men, Yiming Mao, Yuning Jiang, Wei-Ying Ma, Zhouhui Lian, Peking University & ByteDance AI Lab, CVPR 2020(Oral).

Component Attribute Transfer

Pose Transfer

Requirement

python 3
pytorch(>=1.0)
torchvision
numpy
scipy
scikit-image
pillow
pandas
tqdm
dominate

Getting Started

You can directly download our generated images (in Deepfashion) from Google Drive.

Installation

Clone this repo:

git clone https://github.com/menyifang/ADGAN.git
cd ADGAN

Data Preperation

We use DeepFashion dataset and provide our dataset split files, extracted keypoints files and extracted segmentation files for convience.

The dataset structure is recommended as:

+—deepfashion
|   +—fashion_resize
|       +--train (files in 'train.lst')
|          +-- e.g. fashionMENDenimid0000008001_1front.jpg
|       +--test (files in 'test.lst')
|          +-- e.g. fashionMENDenimid0000056501_1front.jpg
|       +--trainK(keypoints of person images)
|          +-- e.g. fashionMENDenimid0000008001_1front.jpg.npy
|       +--testK
|          +-- e.g. fashionMENDenimid0000056501_1front.jpg.npy
|   +—semantic_merge
|   +—fashion-resize-pairs-train.csv
|   +—fashion-resize-pairs-test.csv
|   +—fashion-resize-annotation-pairs-train.csv
|   +—fashion-resize-annotation-pairs-test.csv
|   +—train.lst
|   +—test.lst
|   +—vgg19-dcbb9e9d.pth
|   +—vgg_conv.pth
...

Person images

Download person images from deep fasion dataset in-shop clothes retrival benchmark and download dataset split from Google Drive.
Crop the images. Split the raw images into the train split (fashion_resize/train) and the test split (fashion_resize/test). Launch

python tool/generate_fashion_datasets.py

Note: In our settings, we crop the images of DeepFashion into the resolution of 176x256 in a center-crop manner.

Keypoints files

Download train/test pairs and train/test key points annotations from Google Drive, including fashion-resize-pairs-train.csv, fashion-resize-pairs-test.csv, fashion-resize-annotation-train.csv, fashion-resize-annotation-train.csv. Put these four files under the deepfashion directory.
Generate the pose heatmaps. Launch

python tool/generate_pose_map_fashion.py

Segmentation files

Extract human segmentation results from existing human parser (e.g. Look into Person) and merge into 8 categories. Our segmentation results are provided in Google Drive, including ‘semantic_merge2’ and ‘semantic_merge3’ in different merge manner. Put one of them under the deepfashion directory.

Optionally, you can also generate these files by yourself.

Keypoints files

We use OpenPose to generate keypoints.

Download pose estimator from Google Drive. Put it under the root folder ADGAN.
Change the paths input_folder and output_path in tool/compute_coordinates.py. And then launch

python2 compute_coordinates.py

Dataset split files

python2 tool/create_pairs_dataset.py

Train a model

bash ./scripts/train.sh

Test a model

Download our pretrained model from Google Drive. Modify your data path and launch

bash ./scripts/test.sh

Evaluation

We adopt SSIM, IS, DS, CX for evaluation. This part is finished by Yiming Mao.

1) SSIM

For evaluation, Tensorflow 1.4.1(python3) is required.

python tool/getMetrics_market.py

2) DS Score

Download pretrained on VOC 300x300 model and install propper caffe version SSD. Put it in the ssd_score forlder.

python compute_ssd_score_fashion.py --input_dir path/to/generated/images

3) CX (Contextual Score)

Refer to folder ‘cx’ to compute contextual score.

Citation

If you use this code for your research, please cite our paper:

@inproceedings{men2020controllable,
  title={Controllable Person Image Synthesis with Attribute-Decomposed GAN},
  author={Men, Yifang and Mao, Yiming and Jiang, Yuning and Ma, Wei-Ying and Lian, Zhouhui},
  booktitle={Computer Vision and Pattern Recognition (CVPR), 2020 IEEE Conference on},
  year={2020}
}

Acknowledgments

Our code is based on PATN and thanks for their great work.

ADGAN - The Implementation of paper Controllable Person Image Synthesis with Attribute-Decomposed GAN

Related tags

Overview

ADGAN

PyTorch | project page | paper

Requirement

Getting Started

Installation

Data Preperation

Train a model

Test a model

Evaluation

1) SSIM

2) DS Score

3) CX (Contextual Score)

Citation

Acknowledgments

Owner

Men Yifang

Official code implementation for "Personalized Federated Learning using Hypernetworks"

Taking A Closer Look at Domain Shift: Category-level Adversaries for Semantics Consistent Domain Adaptation

Non-Homogeneous Poisson Process Intensity Modeling and Estimation using Measure Transport

A Real-ESRGAN equipped Colab notebook for CLIP Guided Diffusion

YOLOX-RMPOLY

Experiments and code to generate the GINC small-scale in-context learning dataset from "An Explanation for In-context Learning as Implicit Bayesian Inference"

The official implementation of A Unified Game-Theoretic Interpretation of Adversarial Robustness.

A crash course in six episodes for software developers who want to become machine learning practitioners.

Specificity-preserving RGB-D Saliency Detection

E2e music remastering system - End-to-end Music Remastering System Using Self-supervised and Adversarial Training

This code provides various models combining dilated convolutions with residual networks

This is the official Pytorch-version code of FlatGCN (Flattened Graph Convolutional Networks for Recommendation).

Implementation of Geometric Vector Perceptron, a simple circuit for 3d rotation equivariance for learning over large biomolecules, in Pytorch. Idea proposed and accepted at ICLR 2021

Code and Data for the paper: Molecular Contrastive Learning with Chemical Element Knowledge Graph [AAAI 2022]

Code for Emergent Translation in Multi-Agent Communication

Detector for Log4Shell exploitation attempts

Mixed Neural Likelihood Estimation for models of decision-making

Rethinking Transformer-based Set Prediction for Object Detection

OpenAi's gym environment wrapper to vectorize them with Ray

The code of paper 'Learning to Aggregate and Personalize 3D Face from In-the-Wild Photo Collection'