A DCGAN to generate anime faces using custom mined dataset

Last update: Jan 03, 2023

Overview

Anime-Face-GAN-Keras

A DCGAN to generate anime faces using custom dataset in Keras.

Dataset

The dataset is created by crawling anime database websites using curl. The script anime_dataset_gen.py crawls and processes the images into 64x64 PNG images with only the faces cropped.

Examples of the dataset:

Network

This implementation of GAN uses deconv layers in Keras (networks are initialized in the GAN_Nets.py file). I have tried various combinations of layers such as :
Conv + Upsampling
Conv + bilinear
Conv + Subpixel Upscaling
But none of these combinations yielded any decent results. The case was either GAN fails to generate images that resembles faces or it generates same or very similar looking faces for all batches (generator collapse). But these were my results, maybe techniques such as mini-batch discrimination, z-layers could be used to get better results.

Training

Only simple GAN training methods are used. Training is done on about 22,000 images. Images are not loaded entirely into memory instead, each time a batch is sampled, only the sampled images are loaded. An overview of what happens each step is:
-Sample images from dataset (real data)
-Generate images using generator (gaussian noise as input) (fake data)
-Add noise to labels of real and fake data
-Train discriminator on real data -Train discriminator on fake data
-Train GAN on fake images and real data labels
Training is done for 10,000 steps. In my setup (GTX 660; i5 4670) it takes 10-11 secs for each step.

Loss plot:

Full Training as a GIF: (images sampled every 100 step)

Faces generated at the end of 10,000 steps:

The faces look pretty good IMO, might look more like an actual face with more training, more data and probably with a better network.

Resources

https://github.com/tdrussell/IllustrationGAN
https://github.com/jayleicn/animeGAN
https://github.com/forcecore/Keras-GAN-Animeface-Character

https://distill.pub/2016/deconv-checkerboard/
https://kivantium.net/keras-bilinear

A DCGAN to generate anime faces using custom mined dataset

Related tags

Overview

Anime-Face-GAN-Keras

Dataset

Examples of the dataset:

Network

Training

Loss plot:

Full Training as a GIF: (images sampled every 100 step)

Faces generated at the end of 10,000 steps:

Resources

Owner

Pavitrakumar P

Structure Information is the Key: Self-Attention RoI Feature Extractor in 3D Object Detection

UA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrainian Language

Code + pre-trained models for the paper Keeping Your Eye on the Ball Trajectory Attention in Video Transformers

Starter kit for getting started in the Music Demixing Challenge.

SAFL: A Self-Attention Scene Text Recognizer with Focal Loss

Code for our CVPR 2022 Paper "GEN-VLKT: Simplify Association and Enhance Interaction Understanding for HOI Detection"

PyKaldi GOP-DNN on Epa-DB

Code and datasets for the paper "Combining Events and Frames using Recurrent Asynchronous Multimodal Networks for Monocular Depth Prediction" (RA-L, 2021)

The PASS dataset: pretrained models and how to get the data - PASS: Pictures without humAns for Self-Supervised Pretraining

Skyformer: Remodel Self-Attention with Gaussian Kernel and Nystr\"om Method (NeurIPS 2021)

Deep Video Matting via Spatio-Temporal Alignment and Aggregation [CVPR2021]

Source code for our paper "Molecular Mechanics-Driven Graph Neural Network with Multiplex Graph for Molecular Structures"

A Real-World Benchmark for Reinforcement Learning based Recommender System

Model parallel transformers in Jax and Haiku

Detecting drunk people through thermal images using Deep Learning (CNN)

Implementation of the "PSTNet: Point Spatio-Temporal Convolution on Point Cloud Sequences" paper.

Warning: This project does not have any current developer. See bellow.

SplineConv implementation for Paddle.

A mini-course offered to Undergrad chemistry students

BanditPAM: Almost Linear-Time k-Medoids Clustering