Official Pytorch implementation of "DivCo: Diverse Conditional Image Synthesis via Contrastive Generative Adversarial Network" (CVPR'21)

Last update: Nov 22, 2022

Related tags

Overview

DivCo: Diverse Conditional Image Synthesis via Contrastive Generative Adversarial Network

Pytorch implementation for our DivCo. We propose a simple yet effective regularization term named latent-augmented contrastive loss that can be applied to arbitrary conditional generative adversarial networks in different tasks to alleviate the mode collapse issue and improve the diversity.

Contact: Rui Liu ([email protected])

Paper

DivCo: Diverse Conditional Image Synthesis via Contrastive Generative Adversarial Network
Rui Liu, Yixiao Ge, Ching Lam Choi, Xiaogang Wang, and Hongsheng Li
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021
[arxiv]

Citing DivCo

If you find DivCo useful in your research, please consider citing:

@inproceedings{Liu_DivCo,
  author = {Liu, Rui and Ge, Yixiao and Choi, Ching Lam and Wang, Xiaogang and Li, Hongsheng},
  booktitle = {IEEE Conference on Computer Vision and Pattern Recognition},
  title = {DivCo: Diverse Conditional Image Synthesis via Contrastive Generative Adversarial Network},
  year = {2021}
}

Framework

Usage

Prerequisites

Python >= 3.6
Pytorch >= 0.4.0 and corresponding torchvision (https://pytorch.org/)

Install

Clone this repo:

git clone https://github.com/ruiliu-ai/DivCo.git

Training Examples

Download datasets for each task into the dataset folder

mkdir datasets

Label-conditoned Image Generation

Dataset: CIFAR-10
Baseline: DCGAN

cd DivCo/DivCo-DCGAN
python train.py --dataroot ./datasets/Cifar10

Paried Image-to-image Translation

Paired Data: facades and maps
Baseline: BicycleGAN

You can download the facades and maps datasets from the BicycleGAN [Github Project].
We employ the network architecture of the BicycleGAN and follow its training process.

cd DivCo/DivCo-BicycleGAN
python train.py --dataroot ./datasets/facades

Unpaired Image-to-image Translation

Unpaired Data: Yosemite (summer <-> winter) and Cat2Dog (cat <-> dog)
Baseline: DRIT

You can download the datasets from the DRIT [Github Project].
Specify --concat 0 for Cat2Dog to handle large shape variation translation

cd DivCo/DivCo-DRIT
python train.py --dataroot ./datasets/cat2dog --concat 0 --lambda_contra 0.1
python train.py --dataroot ./datasets/yosemite --concat 1 --lambda_contra 1.0

Pre-trained Models

Download and save them into

./models/

Evaluation

For BicycleGAN, DRIT and MSGAN, please follow the instructions of corresponding github projects of the baseline frameworks for more evaluation details.

Testing Examples

DivCo-DCGAN

python test.py --dataroot ./datasets/Cifar10 --resume ./models/DivCo-DCGAN/00199.pth

DivCo-BicycleGAN

python test.py --dataroot ./datasets/facades --checkpoints_dir ./models/DivCo-BicycleGAN/facades --epoch 400

python test.py --dataroot ./datasets/maps --checkpoints_dir ./models/DivCo-BicycleGAN/maps --epoch 400

DivCo-DRIT

python test.py --dataroot ./datasets/yosemite --resume ./models/DivCo-DRIT/yosemite/01199.pth --concat 1

python test.py --dataroot ./datasets/cat2dog --resume ./models/DivCo-DRIT/cat2dog/01199.pth --concat 0

Official Pytorch implementation of "DivCo: Diverse Conditional Image Synthesis via Contrastive Generative Adversarial Network" (CVPR'21)

Related tags

Overview

DivCo: Diverse Conditional Image Synthesis via Contrastive Generative Adversarial Network

Paper

Citing DivCo

Framework

Usage

Prerequisites

Install

Training Examples

Label-conditoned Image Generation

Paried Image-to-image Translation

Unpaired Image-to-image Translation

Pre-trained Models

Evaluation

Testing Examples

Reference

Quantitative Evaluation Metrics

Owner

QI-Q RoboMaster2022 CV Algorithm

DNA-RECON { Automatic Web Reconnaissance Tool }

This is an open solution to the Home Credit Default Risk challenge 🏡

Tackling Obstacle Tower Challenge using PPO & A2C combined with ICM.

FedCV: A Federated Learning Framework for Diverse Computer Vision Tasks

Official PyTorch Implementation of Rank & Sort Loss [ICCV2021]

Gradient representations in ReLU networks as similarity functions

Finite-temperature variational Monte Carlo calculation of uniform electron gas using neural canonical transformation.

Social Network Ads Prediction

통일된 DataScience 폴더 구조 제공 및 가상환경 작업의 부담감 해소

Code for ICCV2021 paper SPEC: Seeing People in the Wild with an Estimated Camera

PiCIE: Unsupervised Semantic Segmentation using Invariance and Equivariance in clustering (CVPR2021)

Distributed Asynchronous Hyperparameter Optimization in Python

Source code for the paper "Periodic Traveling Waves in an Integro-Difference Equation With Non-Monotonic Growth and Strong Allee Effect"

This repo contains the source code and a benchmark for predicting user's utilities with Machine Learning techniques for Computational Persuasion

CCCL: Contrastive Cascade Graph Learning.

The source code for the Cutoff data augmentation approach proposed in this paper: "A Simple but Tough-to-Beat Data Augmentation Approach for Natural Language Understanding and Generation".

Colab notebook for openai/glide-text2im.

Code for the paper: "On the Bottleneck of Graph Neural Networks and Its Practical Implications"

[CVPR21] LightTrack: Finding Lightweight Neural Network for Object Tracking via One-Shot Architecture Search