Spatially-Adaptive Pixelwise Networks for Fast Image Translation, CVPR 2021

Last update: Dec 28, 2022

Related tags

Deep Learning ASAPNet

Overview

Image Translation with ASAPNets

Spatially-Adaptive Pixelwise Networks for Fast Image Translation, CVPR 2021

Webpage | Paper | Video

Installation

install requirements:

pip install -r requirements.txt

Code Structure

The code is heavily based on the official implementation of SPADE, and therefore has the saome structure:

train.py, test.py: the entry point for training and testing.
trainers/pix2pix_trainer.py: harnesses and reports the progress of training.
models/pix2pix_model.py: creates the networks, and compute the losses.
models/networks/: defines the architecture of all models.
options/: creates option lists using argparse package. More individuals are dynamically added in other files as well. Please see the section below.
data/: defines the class for loading images and label maps.

The ASAPNets generator is implementaed in:

models/networks/generator: defines the architecture of the ASAPNets generator.

Dataset Preparation

facades

run:

cd data 
bash facadesHR_download_and_extract.sh

This will extract the facades full resolution images into datasets/facadesHR.

cityscapes

download the dataset into datasets/cityscapes and arrange in folders: train_images, train_labels, val_images, val_labels

Generating Images Using Pretrained Models

Pretraned models can be downloaded from here. Save the models under the checkpoints/ folder. Images can be generated using the command:

# Facades 512
bash test_facades512.sh

# Facades 1024
bash test_facades512.sh

# Cityscapes
bash test_cityscapes.sh

The outputs images will appear at the./results/ folder.

Training New Models

New models can be trained with the following commands. Prepare dataset in the ./datasets/ folder. Arrange in folders: train_images, train_labels, val_images, val_labels . For custom datasets, the easiest way is to use ./data/custom_dataset.py by specifying the option --dataset_mode custom, along with --label_dir [path_to_labels] --image_dir [path_to_images]. You also need to specify options such as --label_nc for the number of label classes in the dataset, --contain_dontcare_label to specify whether it has an unknown label, or --no_instance to denote the dataset doesn't have instance maps.

Run:

python train.py --name [experiment_name] --dataset_mode custom --label_dir [path_to_labels] -- image_dir [path_to_images] --label_nc [num_labels]

There are many additional options you can specify, please explore the ./options files. To specify the number of GPUs to utilize, use --gpu_ids.

Testing

Testing is similar to testing pretrained models.

python test.py --name [name_of_experiment] --dataset_mode [dataset_mode] --dataroot [path_to_dataset]

you can load the parameters used from training by specifying --load_from_opt_file.

Acknowledgments

This code is heavily based on the official implementation of SPADE. We thank the authors for sharing their code publicly!

License

Attribution-NonCommercial-ShareAlike 4.0 International (see file).

Citation

@inproceedings{RottShaham2020ASAP,
  title={Spatially-Adaptive Pixelwise Networks for Fast Image Translation},
  author={Rott Shaham, Tamar and Gharbi, Michael and Zhang, Richard and Shechtman, Eli and Michaeli, Tomer},
  booktitle={Computer Vision and Pattern Recognition (CVPR)},
  year={2021}
}

Spatially-Adaptive Pixelwise Networks for Fast Image Translation, CVPR 2021

Related tags

Overview

Image Translation with ASAPNets

Spatially-Adaptive Pixelwise Networks for Fast Image Translation, CVPR 2021

Webpage | Paper | Video

Installation

Code Structure

Dataset Preparation

facades

cityscapes

Generating Images Using Pretrained Models

Training New Models

Testing

Acknowledgments

License

Citation

Owner

Tamar Rott Shaham

a short visualisation script for pyvideo data

Code corresponding to The Introspective Agent: Interdependence of Strategy, Physiology, and Sensing for Embodied Agents

Pytorch port of Google Research's LEAF Audio paper

Local Multi-Head Channel Self-Attention for FER2013

Send text to girlfriend in the morning

M3DSSD: Monocular 3D Single Stage Object Detector

A state of the art of new lightweight YOLO model implemented by TensorFlow 2.

Face-Recognition-based-Attendance-System - An implementation of Attendance System in python.

fcn by tensorflow

Code for GNMR in ICDE 2021

Commonality in Natural Images Rescues GANs: Pretraining GANs with Generic and Privacy-free Synthetic Data - Official PyTorch Implementation (CVPR 2022)

Automatic Data-Regularized Actor-Critic (Auto-DrAC)

LWCC: A LightWeight Crowd Counting library for Python that includes several pretrained state-of-the-art models.

CenterFace(size of 7.3MB) is a practical anchor-free face detection and alignment method for edge devices.

PyTorch implementation for paper "Full-Body Visual Self-Modeling of Robot Morphologies".

Codebase for ECCV18 "The Sound of Pixels"

MMDetection3D is an open source object detection toolbox based on PyTorch

Code repo for "Towards Interpretable Deep Networks for Monocular Depth Estimation" paper.

Scripts for training an AI to play the endless runner Subway Surfers using a supervised machine learning approach by imitation and a convolutional neural network (CNN) for image classification

Code for ACM MM 2020 paper "NOH-NMS: Improving Pedestrian Detection by Nearby Objects Hallucination"