SMIS - Semantically Multi-modal Image Synthesis(CVPR 2020)

Last update: Dec 01, 2022

Related tags

Deep Learning SMIS

Overview

Semantically Multi-modal Image Synthesis

Project page / Paper / Demo

Semantically Multi-modal Image Synthesis(CVPR2020).
Zhen Zhu, Zhiliang Xu, Ansheng You, Xiang Bai

Requirements

torch>=1.0.0
torchvision
dominate
dill
scikit-image
tqdm
opencv-python

Getting Started

Data Preperation

DeepFashion
Note: We provide an example of the DeepFashion dataset. That is slightly different from the DeepFashion used in our paper due to the impact of the COVID-19.

Cityscapes
The Cityscapes dataset can be downloaded at here

ADE20K
The ADE20K dataset can be downloaded at here

Test/Train the models

Download the tar of the pretrained models from the Google Drive Folder. Save it in checkpoints/ and unzip it. There are deepfashion.sh, cityscapes.sh and ade20k.sh in the scripts folder. Change the parameters like --dataroot and so on, then comment or uncomment some code to test/train model. And you can specify the --test_mask for SMIS test.

Acknowledgments

Our code is based on the popular SPADE

SMIS - Semantically Multi-modal Image Synthesis(CVPR 2020)

Related tags

Overview

Semantically Multi-modal Image Synthesis

Project page / Paper / Demo

Requirements

Getting Started

Data Preperation

Test/Train the models

Acknowledgments

Owner

PyTorch implementation of the Pose Residual Network (PRN)

SE3 Pose Interp - Interpolate camera pose or trajectory in SE3, pose interpolation, trajectory interpolation

Differentiable simulation for system identification and visuomotor control

Paper Title: Heterogeneous Knowledge Distillation for Simultaneous Infrared-Visible Image Fusion and Super-Resolution

A3C LSTM Atari with Pytorch plus A3G design

A Simplied Framework of GAN Inversion

Automatic Differentiation Multipole Moment Molecular Forcefield

Face Detection and Alignment using Multi-task Cascaded Convolutional Networks (MTCNN)

A simple baseline for 3d human pose estimation in tensorflow. Presented at ICCV 17.

Fine-tune pretrained Convolutional Neural Networks with PyTorch

Codes for 'Dual Parameterization of Sparse Variational Gaussian Processes'

Pytorch-Swin-Unet-V2 - a modified version of Swin Unet based on Swin Transfomer V2

Official repository for ABC-GAN

Official repository for the paper "Self-Supervised Models are Continual Learners" (CVPR 2022)

Neural network for recognizing the gender of people in photos

Source code for "Understanding Knowledge Integration in Language Models with Graph Convolutions"

KoRean based ELECTRA pre-trained models (KR-ELECTRA) for Tensorflow and PyTorch

CVPR2022 (Oral) - Rethinking Semantic Segmentation: A Prototype View

Code for a seq2seq architecture with Bahdanau attention designed to map stereotactic EEG data from human brains to spectrograms, using the PyTorch Lightning.

Unofficial Pytorch Lightning implementation of Contrastive Syn-to-Real Generalization (ICLR, 2021)