A tensorflow/keras implementation of StyleGAN to generate images of new Pokemon.

Last update: Jul 26, 2022

Overview

PokeGAN

A tensorflow/keras implementation of StyleGAN to generate images of new Pokemon.

Dataset

The model has been trained on dataset that includes 819 pokémon.
You can download dataset from this kaggle link.

Dependencies

I have used the following versions for code work:

python==3.8.8
tensorflow==2.4.1
tensorflow-gpu==2.4.1
numpy==1.19.1
h5py==2.10.0

Note

There are several difficulties in pokemon generation using GAN :

The difficulty of GAN training is well known; changing a hyperparameter can greatly change the results.
The dataset size is too small! 819 different pokemon images are not enough. For this reason, I applied data augmentation on the data; these are the transformations applied :

img_transf = tf.keras.Sequential([
            	tf.keras.layers.experimental.preprocessing.RandomContrast(factor=(0.05, 0.15)),
                image_aug.RandomBrightness(brightness_delta=(-0.15, 0.15)),
                image_aug.PowerLawTransform(gamma=(0.8,1.2)),
                image_aug.RandomSaturation(sat=(0, 2)),
                image_aug.RandomHue(hue=(0, 0.15)),
                tf.keras.layers.experimental.preprocessing.RandomFlip("horizontal"),
	    	tf.keras.layers.experimental.preprocessing.RandomTranslation(height_factor=(-0.10, 0.10), width_factor=(-0.10, 0.10)),
		tf.keras.layers.experimental.preprocessing.RandomZoom(height_factor=(-0.10, 0.10), width_factor=(-0.10, 0.10)),
		tf.keras.layers.experimental.preprocessing.RandomRotation(factor=(-0.10, 0.10))])

StyleGAN training is very expensive! I trained the model starting from a 4x4 resolution up to the final resolution of 256x256. The model was trained for 8 days using a Tesla V100 32GB SXM2.
To get better results you need to use higher resolutions and train for longer time.

Results

These are some examples of new pokémon generated by the model :

New Generated Pokémon

More results

You can see hundreds of new pokemon here.
I repeat again it : to get better results (better details in pokemon) is necessary to train for more time.

References

This code implementation is inspired by the unofficial keras implementation of styleGAN.

A tensorflow/keras implementation of StyleGAN to generate images of new Pokemon.

Related tags

Overview

PokeGAN

Dataset

Dependencies

Note

Results

More results

References

Owner

An implementation for the ICCV 2021 paper Deep Permutation Equivariant Structure from Motion.

The repo contains the code of the ACL2020 paper `Dice Loss for Data-imbalanced NLP Tasks`

Self-Supervised Monocular DepthEstimation with Internal Feature Fusion(arXiv), BMVC2021

Machine Learning Time-Series Platform

Official repository for the NeurIPS 2021 paper Get Fooled for the Right Reason: Improving Adversarial Robustness through a Teacher-guided curriculum Learning Approach

A collection of educational notebooks on multi-view geometry and computer vision.

Recognize numbers from an (28 x 28) image using neural networks

This code provides a PyTorch implementation for OTTER (Optimal Transport distillation for Efficient zero-shot Recognition), as described in the paper.

This project deploys a yolo fastest model in the form of tflite on raspberry 3b+. The model is from another repository of mine called -Trash-Classification-Car

This repository contains a pytorch implementation of "StereoPIFu: Depth Aware Clothed Human Digitization via Stereo Vision".

A tool to prepare websites grabbed with wget for local viewing.

AI Face Mesh: This is a simple face mesh detection program based on Artificial intelligence.

ImageBART: Bidirectional Context with Multinomial Diffusion for Autoregressive Image Synthesis

PyTorch Implementation of our paper Explain Me the Painting: Multi-Topic Knowledgeable Art Description Generation

Mengzi Pretrained Models

Repository for open research on optimizers.

PyTorch implementation for the Neuro-Symbolic Sudoku Solver leveraging the power of Neural Logic Machines (NLM)

Cold Brew: Distilling Graph Node Representations with Incomplete or Missing Neighborhoods

Fbone (Flask bone) is a Flask (Python microframework) starter/template/bootstrap/boilerplate application.

Implementation of trRosetta and trDesign for Pytorch, made into a convenient package