A PyTorch implementation of the paper "Semantic Image Synthesis via Adversarial Learning" in ICCV 2017

Last update: Nov 25, 2022

Related tags

Deep Learning dong_iccv_2017

Overview

Semantic Image Synthesis via Adversarial Learning

This is a PyTorch implementation of the paper Semantic Image Synthesis via Adversarial Learning.

Requirements

PyTorch 0.2
Torchvision
Pillow
fastText.py (Note: if you have a problem when loading a pretrained model, try my fixed code)
NLTK

Pretrained word vectors for fastText

Download a pretrained English word vectors. You can see the list of pretrained vectors on this page.

Datasets

Oxford-102 flowers: images and captions
Caltech-200 birds: images and captions

The caption data is from this repository. After downloading, modify CONFIG file so that all paths of the datasets point to the data you downloaded.

Run

scripts/train_text_embedding_[birds/flowers].sh
Train a visual-semantic embedding model using the method of Kiros et al..
scripts/train_[birds/flowers].sh
Train a GAN using a pretrained text embedding model.
scripts/test_[birds/flowers].sh
Generate some examples using original images and semantically relevant texts.

Results

Acknowledgements

We would like to thank Hao Dong, who is one of the first authors of the paper Semantic Image Synthesis via Adversarial Learning, for providing helpful advice for the implementation.

A PyTorch implementation of the paper "Semantic Image Synthesis via Adversarial Learning" in ICCV 2017

Related tags

Overview

Semantic Image Synthesis via Adversarial Learning

Requirements

Pretrained word vectors for fastText

Datasets

Run

Results

Acknowledgements

Owner

Seonghyeon Nam

Detectron2 is FAIR's next-generation platform for object detection and segmentation.

This repo holds the code of TransFuse: Fusing Transformers and CNNs for Medical Image Segmentation

Supervised multi-SNE (S-multi-SNE): Multi-view visualisation and classification

Cweqgen - The CW Equation Generator

Lightweight library to build and train neural networks in Theano

Official Pytorch implementation of "DivCo: Diverse Conditional Image Synthesis via Contrastive Generative Adversarial Network" (CVPR'21)

Attention-based CNN-LSTM and XGBoost hybrid model for stock prediction

Learning Continuous Signed Distance Functions for Shape Representation

Semi-supervised Implicit Scene Completion from Sparse LiDAR

CNN designed for pansharpening

Geometry-Aware Learning of Maps for Camera Localization (CVPR2018)

[CVPR'22] COAP: Learning Compositional Occupancy of People

C3D is a modified version of BVLC caffe to support 3D ConvNets.

auto-tuning momentum SGD optimizer

[CVPR2021] De-rendering the World's Revolutionary Artefacts

Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome

A Research-oriented Federated Learning Library and Benchmark Platform for Graph Neural Networks. Accepted to ICLR'2021 - DPML and MLSys'21 - GNNSys workshops.

Stacked Hourglass Network with a Multi-level Attention Mechanism: Where to Look for Intervertebral Disc Labeling

TensorFlow code for the neural network presented in the paper: "Structural Language Models of Code" (ICML'2020)

Bayesian regularization for functional graphical models.