Liquid Warping GAN with Attention: A Unified Framework for Human Image Synthesis

Last update: Jan 05, 2023

Related tags

Deep Learning iPERCore

Overview

Impersonator++

Update News

15/04/2021, iPERCore-0.2.0, including
- Add Training
- Add Novel View Synthesis
- Add Motion Imitation with bullet-time effects
- Add Motion Imitation with multi-view outputs
- Add Appearance Transfer
- Add A Unified synthesizer: Motion Imitation + Appearance Transfer + Novel View Synthesis
- Update torch 1.8+ with RTX30+ GPUs.
12/20/2020, A precompiled version on Windows has been released! [Usage]
12/10/2020, iPERCore-0.1.1, supports Windows.
12/06/2020, iPERCore-0.1, all the base codes. The motion imitation scripts.

See the details of developing logs.

Liquid Warping GAN with Attention: A Unified Framework for Human Image Synthesis, including human motion imitation, appearance transfer, and novel view synthesis. Currently the paper is under review of IEEE TPAMI. It is an extension of our previous ICCV project impersonator, and it has a more powerful ability in generalization and produces higher-resolution results (512 x 512, 1024 x 1024) than the previous ICCV version.

🧾 Colab Notebook	Released (Windows)	📑 Paper	📱 Website	📂 Dataset	💡 Bilibili	✒ Forum
	[Usage]	paper	website	Dataset	bilibili	Forum

Installation

See more details, including system dependencies, python requirements and setups in install.md. Please follows the instructions in install.md to install this firstly.

Notice that imags_size=512 need at least 9.8GB GPU memory. if you are using a middle-level GPU(e.g. RTX 2060), you should change the image_size to 384 or 256. The following table can be used as a reference:

image_size	preprocess	personalize	run_imitator	recommended gpu
256x256	3.1 GB	4.3 GB	1.1 GB	RTX 2060 / RTX 2070
384x384	3.1 GB	7.9 GB	1.5 GB	GTX 1080Ti / RTX 2080Ti / Titan Xp
512x512	3.1 GB	9.8 GB	2 GB	GTX 1080Ti / RTX 2080Ti / Titan Xp
1024x1024	3.1 GB	20 GB	-	RTX Titan / P40 / V100 32G

Run demos

1. Run on Google Colab

2. Run with Console (scripts) mode

See scripts_runner for more details.

3. Run with GUI mode

Coming soon!

Citation

@misc{liu2020liquid,
      title={Liquid Warping GAN with Attention: A Unified Framework for Human Image Synthesis}, 
      author={Wen Liu and Zhixin Piao, Zhi Tu, Wenhan Luo, Lin Ma and Shenghua Gao},
      year={2020},
      eprint={2011.09055},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

@InProceedings{lwb2019,
    title={Liquid Warping GAN: A Unified Framework for Human Motion Imitation, Appearance Transfer and Novel View Synthesis},
    author={Wen Liu and Zhixin Piao, Min Jie, Wenhan Luo, Lin Ma and Shenghua Gao},
    booktitle={The IEEE International Conference on Computer Vision (ICCV)},
    year={2019}
}

Liquid Warping GAN with Attention: A Unified Framework for Human Image Synthesis

Related tags

Overview

Impersonator++

Update News

Installation

Run demos

1. Run on Google Colab

2. Run with Console (scripts) mode

3. Run with GUI mode

Citation

Owner

🙄 Difficult algorithm, Simple code.

This repository includes the official project for the paper: TransMix: Attend to Mix for Vision Transformers.

This provides the R code and data to replicate results in "The USS Trustee’s risky strategy"

A PyTorch toolkit for 2D Human Pose Estimation.

Code for the paper "Adversarial Generator-Encoder Networks"

Code for Boundary-Aware Segmentation Network for Mobile and Web Applications

Reliable probability face embeddings

[CVPR 2021] MiVOS - Scribble to Mask module

Implementation for our ICCV2021 paper: Internal Video Inpainting by Implicit Long-range Propagation

Official Pytorch implementation of Scene Representation Networks: Continuous 3D-Structure-Aware Neural Scene Representations

OpenGAN: Open-Set Recognition via Open Data Generation

Office source code of paper UniFuse: Unidirectional Fusion for 360$^\circ$ Panorama Depth Estimation

Ultra-Data-Efficient GAN Training: Drawing A Lottery Ticket First, Then Training It Toughly

(AAAI 2021) Progressive One-shot Human Parsing

Problem-943.-ACMP - Problem 943. ACMP

[CVPR'21 Oral] Seeing Out of tHe bOx: End-to-End Pre-training for Vision-Language Representation Learning

Latex code for making neural networks diagrams

The implementation of "Bootstrapping Semantic Segmentation with Regional Contrast".

Generalized Decision Transformer for Offline Hindsight Information Matching

【ACMMM 2021】DSANet: Dynamic Segment Aggregation Network for Video-Level Representation Learning