Official implementation of the paper Image Generators with Conditionally-Independent Pixel Synthesis https://arxiv.org/abs/2011.13775

Last update: Dec 21, 2022

Overview

CIPS -- Official Pytorch Implementation

of the paper Image Generators with Conditionally-Independent Pixel Synthesis

Requirements

pip install -r requirements.txt

Usage

First create lmdb datasets:

python prepare_data.py images --out LMDB_PATH --n_worker N_WORKER --size SIZE1,SIZE2,SIZE3,... DATASET_PATH

This will convert images to jpeg and pre-resizes it.

To train on FFHQ-256 or churches please run:

python3 -m torch.distributed.launch --nproc_per_node=8 --master_port=1234 train.py --n_sample=8 --batch=4 --fid_batch=8 --Generator=CIPSskip --output_dir=skip-[ffhq/churches] --img2dis --num_workers=16 DATASET_PATH

To train on patches add --crop=PATCH_SIZE. PATCH_SIZE has to be a power of 2.

Pretrained Checkpoints

Generate samples

To play with the models please download checkpoints and check out a notebook.ipynb

Progressive training

We also tried to train progressively on FFHQ starting from 256×256 initialization and got FID 10.07. We will update the paper with the training details soon. Checkpoint name is ffhq1024.pt. Samples are below.

Citation

If you found our work useful, please don't forget to cite

@article{anokhin2020image,
  title={Image Generators with Conditionally-Independent Pixel Synthesis},
  author={Anokhin, Ivan and Demochkin, Kirill and Khakhulin, Taras and Sterkin, Gleb and Lempitsky, Victor and Korzhenkov, Denis},
  journal={arXiv preprint arXiv:2011.13775},
  year={2020}
}

The code is heavely based on the styleganv2 pytorch implementation

Nvidia-licensed CUDA kernels (fused_bias_act_kernel.cu, upfirdn2d_kernel.cu) is for non-commercial use only.

Official implementation of the paper Image Generators with Conditionally-Independent Pixel Synthesis https://arxiv.org/abs/2011.13775

Related tags

Overview

CIPS -- Official Pytorch Implementation

Requirements

Usage

Pretrained Checkpoints

Generate samples

Progressive training

Citation

Owner

Multimodal Lab @ Samsung AI Center Moscow

YOLOX Win10 Project

Neural Scene Flow Fields using pytorch-lightning, with potential improvements

OptaPlanner wrappers for Python. Currently significantly slower than OptaPlanner in Java or Kotlin.

ML powered analytics engine for outlier detection and root cause analysis.

[ WSDM '22 ] On Sampling Collaborative Filtering Datasets

This Artificial Intelligence program can take a black and white/grayscale image and generate a realistic or plausible colorized version of the same picture.

The source code for the Cutoff data augmentation approach proposed in this paper: "A Simple but Tough-to-Beat Data Augmentation Approach for Natural Language Understanding and Generation".

Using multidimensional LSTM neural networks to create a forecast for Bitcoin price

Effective Use of Transformer Networks for Entity Tracking

Unsupervised Semantic Segmentation by Contrasting Object Mask Proposals.

A "gym" style toolkit for building lightweight Neural Architecture Search systems

Bridging Vision and Language Model

Probabilistic Programming and Statistical Inference in PyTorch

[NeurIPS'21] Shape As Points: A Differentiable Poisson Solver

Non-Metric Space Library (NMSLIB): An efficient similarity search library and a toolkit for evaluation of k-NN methods for generic non-metric spaces.

Build fully-functioning computer vision models with PyTorch

🐸STT integration examples

A deep learning network built with TensorFlow and Keras to classify gender and estimate age.

Source code of "Hold me tight! Influence of discriminative features on deep network boundaries"

Efficient neural networks for analog audio effect modeling