3D-CariGAN: An End-to-End Solution to 3D Caricature Generation from Normal Face Photos

Last update: Oct 09, 2022

Related tags

Overview

3D-CariGAN: An End-to-End Solution to 3D Caricature Generation from Normal Face Photos

This repository contains the source code and dataset for the paper 3D-CariGAN: An End-to-End Solution to 3D Caricature Generation from Normal Face Photos by Zipeng Ye, Mengfei Xia, Yanan Sun, Ran Yi, Minjing Yu, Juyong Zhang, Yu-Kun Lai and Yong-Jin Liu, which is accepted by IEEE Transactions on Visualization and Computer Graphics (TVCG).

This repository contains two parts: dataset and source code.

2D and 3D Caricature Dataset

2D Caricature Dataset

We collect 5,343 hand-drawn portrait caricature images from Pinterest.com and WebCaricature dataset with facial landmarks extracted by a landmark detector, followed by human interaction for correction if needed.

The 2D dataset is in cari_2D_dataset.zip file.

3D Caricature Dataset

We use the method to generate 5,343 3D caricature meshes of the same topology. We align the pose of the generated 3D caricature meshes with the pose of a template 3D head using an ICP method, where we use 5 key landmarks in eyes, nose and mouth as the landmarks for ICP. We normalize the coordinates of the 3D caricature mesh vertices by translating the center of meshes to the origin and scaling them to the same size.

The 3D dataset is in cari_3D_dataset.zip file.

3DCariPCA

We use the 3D caricature dataset to build a PCA model. We use sklearn.decomposition.PCA to build 3DCariPCA. The PCA model is pca200_icp.model file. You could use joblib to load the model and use it.

Download

You can download the two datasets and PCA in google drive and BaiduYun (code: 3kz8).

Source Code

Running Environment

Ubuntu 16.04 + Python3.7

You can install the environment directly by using conda env create -f env.yml in conda.

Training

We use our 3D caricature dataset and CelebA-Mask-HQ dataset to train 3D-CariGAN. You could download CelebA-Mask-HQ dataset and then reconstruct their 3D normal heads of all images. The 3D normal heads are for calculating loss.

Inferring

The inferring code is cari_pipeline.py file in pipeline folder. You could train your model or use our pre-trained model.

The pipeline includes two optional sub-program eye_complete and color_complete, which are implemented by C++. You should compile them and then use them. The eye_complete is for completing the eye part of mesh and the color_complete is for texture completion.

Pre-trained Model

You can download pre-trained model latest.pth in google drive and BaiduYun (code: 3kz8). You should put it into ./checkpoints.

Additional notes

Please cite the following paper if the dataset and code help your research:

Citation:

@article{ye2021caricature,
 author = {Ye, Zipeng and Xia, Mengfei and Sun, Yanan and Yi, Ran and Yu, Minjing and Zhang, Juyong and Lai, Yu-Kun and Liu, Yong-Jin},
 title = {3D-CariGAN: An End-to-End Solution to 3D Caricature Generation from Normal Face Photos},
 journal = {IEEE Transactions on Visualization and Computer Graphics},
 year = {2021},
 doi={10.1109/TVCG.2021.3126659},
}

The paper will be published.

3D-CariGAN: An End-to-End Solution to 3D Caricature Generation from Normal Face Photos

Related tags

Overview

3D-CariGAN: An End-to-End Solution to 3D Caricature Generation from Normal Face Photos

2D and 3D Caricature Dataset

2D Caricature Dataset

3D Caricature Dataset

3DCariPCA

Download

Source Code

Running Environment

Training

Inferring

Pre-trained Model

Additional notes

Owner

sssegmentation is a general framework for our research on strongly supervised semantic segmentation.

TCTrack: Temporal Contexts for Aerial Tracking (CVPR2022)

A PyTorch Reimplementation of TecoGAN: Temporally Coherent GAN for Video Super-Resolution

YOLOv5 detection interface - PyQt5 implementation

PyTorch implementation for our paper "Deep Facial Synthesis: A New Challenge"

A curated list of awesome Machine Learning frameworks, libraries and software.

This repo contains implementation of different architectures for emotion recognition in conversations.

Solver for Large-Scale Rank-One Semidefinite Relaxations

Explaining Deep Neural Networks - A comparison of different CAM methods based on an insect data set

paper: Hyperspectral Remote Sensing Image Classification Using Deep Convolutional Capsule Network

Crossover Learning for Fast Online Video Instance Segmentation (ICCV 2021)

Unifying Global-Local Representations in Salient Object Detection with Transformer

공공장소에서 눈만 돌리면 CCTV가 보인다는 말이 과언이 아닐 정도로 CCTV가 우리 생활에 깊숙이 자리 잡았습니다.

Official code repository for ICCV 2021 paper: Gravity-Aware Monocular 3D Human Object Reconstruction

Json2Xml tool will help you convert from json COCO format to VOC xml format in Object Detection Problem.

Studying Python release adoptions by looking at PyPI downloads

Official repository for Fourier model that can generate periodic signals

[CVPR 2021] NormalFusion: Real-Time Acquisition of Surface Normals for High-Resolution RGB-D Scanning

Relative Human dataset, CVPR 2022

EncT5: Fine-tuning T5 Encoder for Non-autoregressive Tasks