[ICCV'2021] Image Inpainting via Conditional Texture and Structure Dual Generation

Last update: Dec 11, 2022

Related tags

Deep Learning CTSDG

Overview

CTSDG

Paper | Pre-trained Models | BibTex

Image Inpainting via Conditional Texture and Structure Dual Generation

Xiefan Guo, Hongyu Yang, Di Huang
In ICCV'2021

Introduction

Generator. Image inpainting is cast into two subtasks, i.e., structure-constrained texture synthesis (left, blue) and texture-guided structure reconstruction (right, red), and the two parallel-coupled streams borrow encoded deep features from each other. The Bi-GFF module and CFA module are stacked at the end of the generator to further refine the results.

Discriminator. The texture branch estimates the generated texture, while the structure branch guides structure reconstruction.

Prerequisites

Python >= 3.6
PyTorch >= 1.0
NVIDIA GPU + CUDA cuDNN

Getting Started

Installation

Clone this repo:

git clone https://github.com/Xiefan-Guo/CTSDG.git
cd CTSDG

Install PyTorch and dependencies from http://pytorch.org
Install python requirements:

pip install -r requirements.txt

Datasets

Image Dataset. We evaluate the proposed method on the CelebA, Paris StreetView, and Places2 datasets, which are widely adopted in the literature.

Mask Dataset. Irregular masks are obtained from Irregular Masks and classified based on their hole sizes relative to the entire image with an increment of 10%.

Training

Analogous to PConv by Liu et.al, initial training followed by finetuning are performed.

python train.py \
  --image_root [path to image directory] \
  --mask_root [path mask directory]

python train.py \
  --image_root [path to image directory] \
  --mask_root [path to mask directory] \
  --pre_trained [path to checkpoints] \
  --finetune True

Distributed training support. You can train model in distributed settings.

python -m torch.distributed.launch --nproc_per_node=N_GPU train.py

Testing

To test the model, you run the following code.

python test.py \
  --pre_trained [path to checkpoints] \
  --image_root [path to image directory] \
  --mask_root [path to mask directory] \
  --result_root [path to output directory] \
  --number_eval [number of images to test]

Citation

If any part of our paper and repository is helpful to your work, please generously cite with:

@InProceedings{Guo_2021_ICCV,
    author    = {Guo, Xiefan and Yang, Hongyu and Huang, Di},
    title     = {Image Inpainting via Conditional Texture and Structure Dual Generation},
    booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
    month     = {October},
    year      = {2021},
    pages     = {14134-14143}
}

[ICCV'2021] Image Inpainting via Conditional Texture and Structure Dual Generation

Related tags

Overview

CTSDG

Paper | Pre-trained Models | BibTex

Introduction

Prerequisites

Getting Started

Installation

Datasets

Training

Testing

Citation

Owner

Xiefan Guo

Quadruped-command-tracking-controller - Quadruped command tracking controller (flat terrain)

Classifying cat and dog images using Kaggle dataset

A Python library created to assist programmers with complex mathematical functions

Json2Xml tool will help you convert from json COCO format to VOC xml format in Object Detection Problem.

Nvdiffrast - Modular Primitives for High-Performance Differentiable Rendering

competitions-v2

A state-of-the-art semi-supervised method for image recognition

Colar: Effective and Efficient Online Action Detection by Consulting Exemplars, CVPR 2022.

A machine learning benchmark of in-the-wild distribution shifts, with data loaders, evaluators, and default models.

Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)

Human head pose estimation using Keras over TensorFlow.

Lab Materials for MIT 6.S191: Introduction to Deep Learning

Library for converting from RGB / GrayScale image to base64 and back.

Semi-Autoregressive Transformer for Image Captioning

A multi-scale unsupervised learning for deformable image registration

This is the official Pytorch-version code of FlatGCN (Flattened Graph Convolutional Networks for Recommendation).

In this project I played with mlflow, streamlit and fastapi to create a training and prediction app on digits

Python Classes: Medical Insurance Project using Object Oriented Programming Concepts

Real-Time Seizure Detection using EEG: A Comprehensive Comparison of Recent Approaches under a Realistic Setting

Hysterese plugin with two temperature offset areas