Intrinsic Image Harmonization

Last update: Dec 21, 2022

Related tags

Deep Learning IntrinsicHarmony

Overview

Intrinsic Image Harmonization [Paper]

Zonghui Guo, Haiyong Zheng, Yufeng Jiang, Zhaorui Gu, Bing Zheng

Here we provide PyTorch implementation and the trained model of our framework.

Prerequisites

Linux
Python 3
CPU or NVIDIA GPU + CUDA CuDNN

Train/Test

Download iHarmony4 dataset, and our HVIDIT dataset Google Drive or BaiduCloud (access code: akbi).
Train a model:

CUDA_VISIBLE_DEVICES=0 python train.py --model retinexltifpm  --name retinexltifpm_allihd  --dataset_root <dataset_dir> --dataset_name IHD --batch_size xx --init_port xxxx

Test the model

CUDA_VISIBLE_DEVICES=0 python test.py --model retinexltifpm  --name retinexltifpm_allihd  --dataset_root <dataset_dir> --dataset_name IHD --batch_size xx --init_port xxxx

Apply a pre-trained model

Download the pretrained model from Google Drive or BaiduCloud (access code: 20m6), and put net_G.pth in the directory checkpoints/experiment. Run:

CUDA_VISIBLE_DEVICES=0 python test.py --model retinexltifpm  --name experiment  --dataset_root <dataset_dir> --dataset_name IHD --batch_size xx --init_port xxxx

Evaluation

We provide the code in ih_evaluation.py. Run:

CUDA_VISIBLE_DEVICES=0 python evaluation/ih_evaluation.py --dataroot <dataset_dir> --result_root  results/experiment/test_latest/images/ --evaluation_type our --dataset_name ALL

Quantitative Result

Dataset	Metrics	Composite	Ours (iHarmony4)	Ours (iHarmony4+HVIDIT)
HCOCO	PSNR MSE fMSE	33.99 69.37 996.59	37.61 23.25 386.39	37.77 21.84 367.38
HAdobe5k	PSNR MSE fMSE	28.52 345.54 2051.61	36.20 42.21 296.76	36.49 39.53 266.49
HFlickr	PSNR MSE fMSE	28.43 264.35 1574.37	31.74 100.86 676.71	32.08 96.87 635.60
Hday2night	PSNR MSE fMSE	34.36 109.65 1409.98	36.48 50.64 755.88	36.60 50.37 763.33
HVIDIT	PSNR MSE fMSE	38.72 53.12 1604.41	- - -	41.83 22.49 691.06
ALL	PSNR MSE fMSE	32.07 167.39 1386.12	36.53 37.95 399.34	36.96 35.33 388.50

Bibtex

If you use this code for your research, please cite our papers.

@InProceedings{Guo_2021_CVPR,
    author    = {Guo, Zonghui and Zheng, Haiyong and Jiang, Yufeng and Gu, Zhaorui and Zheng, Bing},
    title     = {Intrinsic Image Harmonization},
    booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
    month     = {June},
    year      = {2021},
    pages     = {16367-16376}
}

Acknowledgement

For some of the data modules and model functions used in this source code, we need to acknowledge the repo of DoveNet and CycleGAN.

You might also like...

python library for invisible image watermark (blind image watermark)

invisible-watermark invisible-watermark is a python library and command line tool for creating invisible watermark over image.(aka. blink image waterm

572 Jan 7, 2023

AOT-GAN for High-Resolution Image Inpainting (codebase for image inpainting)

AOT-GAN for High-Resolution Image Inpainting Arxiv Paper | AOT-GAN: Aggregated Contextual Transformations for High-Resolution Image Inpainting Yanhong

214 Jan 3, 2023

Code for Dual Contrastive Learning for Unsupervised Image-to-Image Translation, NTIRE, CVPRW 2021.

arXiv Dual Contrastive Learning Adversarial Generative Networks (DCLGAN) We provide our PyTorch implementation of DCLGAN, which is a simple yet powerf

119 Dec 4, 2022

Deep Image Search is an AI-based image search engine that includes deep transfor learning features Extraction and tree-based vectorized search.

Deep Image Search - AI-Based Image Search Engine Deep Image Search is an AI-based image search engine that includes deep transfer learning features Ex

139 Jan 1, 2023

Third party Pytorch implement of Image Processing Transformer (Pre-Trained Image Processing Transformer arXiv:2012.00364v2)

ImageProcessingTransformer Third party Pytorch implement of Image Processing Transformer (Pre-Trained Image Processing Transformer arXiv:2012.00364v2)

61 Jan 1, 2023

[CVPR 2021] Teachers Do More Than Teach: Compressing Image-to-Image Models (CAT)

Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019). A PyTorch implementation.

Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set —— PyTorch implementation This is an unofficial offici

833 Dec 28, 2022

Comments

Model Inference

Hello, is there a way to infer the model by reading an image and passing the image and its mask to the model and getting the harmonized output? Without the need to store the image's path in a text file and reading it from the text file then loading the image?

opened by AhmedHashish123 2
visdom interface is blank

first，thanks for your excellent work！ When I execute the training code, the visdom interface does not display the result picture and the training loss. it works when I execute the code of dovenet. could you tell me how to solve this problem? thanks again

opened by Ligouhi 0

Releases(v1.0)

v1.0(Feb 9, 2022)

Code version of our CVPR work [Paper].
Source code(tar.gz)
Source code(zip)

Intrinsic Image Harmonization

Related tags

Overview

Intrinsic Image Harmonization [Paper]

Prerequisites

Train/Test

Apply a pre-trained model

Evaluation

Quantitative Result

Bibtex

Acknowledgement

You might also like...

python library for invisible image watermark (blind image watermark)

AOT-GAN for High-Resolution Image Inpainting (codebase for image inpainting)

Code for Dual Contrastive Learning for Unsupervised Image-to-Image Translation, NTIRE, CVPRW 2021.

Deep Image Search is an AI-based image search engine that includes deep transfor learning features Extraction and tree-based vectorized search.

Third party Pytorch implement of Image Processing Transformer (Pre-Trained Image Processing Transformer arXiv:2012.00364v2)

[CVPR 2021] Teachers Do More Than Teach: Compressing Image-to-Image Models (CAT)

Official implementation of "SinIR: Efficient General Image Manipulation with Single Image Reconstruction" (ICML 2021)

This is the PyTorch implementation of GANs N’ Roses: Stable, Controllable, Diverse Image to Image Translation

Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019). A PyTorch implementation.

Comments

Model Inference

visdom interface is blank

Releases(v1.0)

v1.0(Feb 9, 2022)

Owner

VISION @ OUC

Contra is a lightweight, production ready Tensorflow alternative for solving time series prediction challenges with AI

ECAENet (TensorFlow and Keras)

Implementation and replication of ProGen, Language Modeling for Protein Generation, in Jax

Code & Data for Enhancing Photorealism Enhancement

Contextualized Perturbation for Textual Adversarial Attack, NAACL 2021

Official repository for MixFaceNets: Extremely Efficient Face Recognition Networks

GeDML is an easy-to-use generalized deep metric learning library

PySLM Python Library for Selective Laser Melting and Additive Manufacturing

Code for DeepCurrents: Learning Implicit Representations of Shapes with Boundaries

Learning Tracking Representations via Dual-Branch Fully Transformer Networks

Open source Python module for computer vision

On Generating Extended Summaries of Long Documents

Official implementation of CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification

Predicting future trajectories of people in cameras of novel scenarios and views.

Source code for Zalo AI 2021 submission

Official implement of "CAT: Cross Attention in Vision Transformer".

Official PyTorch code for Hierarchical Conditional Flow: A Unified Framework for Image Super-Resolution and Image Rescaling (HCFlow, ICCV2021)

TensorFlow 101: Introduction to Deep Learning for Python Within TensorFlow

Evaluation framework for testing segmentation networks in PyTorch

Semantic Edge Detection with Diverse Deep Supervision