Large Scale Multi-Illuminant (LSMI) Dataset for Developing White Balance Algorithm under Mixed Illumination

Last update: Jan 04, 2023

Overview

Large Scale Multi-Illuminant (LSMI) Dataset for Developing White Balance Algorithm under Mixed Illumination (ICCV 2021)

Dataset License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

About

[Project site] [Arxiv] [Download Dataset] [Video]

This is an official repository of "Large Scale Multi-Illuminant (LSMI) Dataset for Developing White Balance Algorithm under Mixed Illumination", which is accepted as a poster in ICCV 2021.

This repository provides

Preprocessing code of "Large Scale Multi Illuminant (LSMI) Dataset"
Code of Pixel-level illumination inference U-Net
Pre-trained model parameter for testing U-Net

Requirements

Our running environment is as follows:

Python version 3.8.3
Pytorch version 1.7.0
CUDA version 11.2

We provide a docker image, which supports all extra requirements (ex. dcraw,rawpy,tensorboard...), including specified version of python, pytorch, CUDA above.

You can download the docker image here.

The following instructions are assumed to run in a docker container that uses the docker image we provided.

Getting Started

Clone this repo

In the docker container, clone this repository first.

git clone https://github.com/DY112/LSMI-dataset.git

Download the LSMI dataset

You should first download the LSMI dataset from here.

The dataset is composed of 3 sub-folers named "galaxy", "nikon", "sony".

Folders named by each camera include several scenes, and each scene folder contains full-resolution RAW files and JPG files that is converted to sRGB color space.

Move all three folders to the root of cloned repository.

Preprocess the LSMI dataset

Convert raw images to tiff files

To convert original 1-channel bayer-pattern images to 3-channel RGB tiff images, run following code:
```
python 0_cvt2tiff.py
```
You should modify SOURCE and EXT variables properly.

The converted tiff files are generated at the same location as the source file.
Make mixture map
```
python 1_make_mixture_map.py
```
Change the CAMERA variable properly to the target directory you want.

.npy tpye mixture map data will be generated at each scene's directory.
Crop
```
python 2_preprocess_data.py
```
The image and the mixture map are resized as a square with a length of the SIZE variable inside the code, and the ground-truth image is also generated.

We set the size to 256 to test the U-Net, and 512 for train the U-Net.

Here, to test the pre-trained U-Net, set size to 256.

The new dataset is created in a folder with the name of the CAMERA_SIZE. (Ex. galaxy_256)

Use U-Net for pixel-level AWB

You can download pre-trained model parameter here.

Pre-trained model is trained on 512x512 data with random crop & random pixel level relighting augmentation method.

Locate downloaded models folder into SVWB_Unet.

Test U-Net
```
cd SVWB_Unet
sh test.sh
```
Train U-Net
```
cd SVWB_Unet
sh train.sh
```

Large Scale Multi-Illuminant (LSMI) Dataset for Developing White Balance Algorithm under Mixed Illumination

Related tags

Overview

Large Scale Multi-Illuminant (LSMI) Dataset for Developing White Balance Algorithm under Mixed Illumination (ICCV 2021)

Dataset License

About

Requirements

Getting Started

Clone this repo

Download the LSMI dataset

Preprocess the LSMI dataset

Use U-Net for pixel-level AWB

Owner

DongYoung Kim

Code repo for "Towards Interpretable Deep Networks for Monocular Depth Estimation" paper.

Emotional conditioned music generation using transformer-based model.

RL algorithm PPO and IRL algorithm AIRL written with Tensorflow.

LabelImg is a graphical image annotation tool.

PyTorch implementation of NeurIPS 2021 paper: "CoFiNet: Reliable Coarse-to-fine Correspondences for Robust Point Cloud Registration"

Training data extraction on GPT-2

The comma.ai Calibration Challenge!

CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped

Optimized code based on M2 for faster image captioning training

Does Pretraining for Summarization Reuqire Knowledge Transfer?

FastFCN: Rethinking Dilated Convolution in the Backbone for Semantic Segmentation.

Code for paper "ASAP-Net: Attention and Structure Aware Point Cloud Sequence Segmentation"

Learning Intents behind Interactions with Knowledge Graph for Recommendation, WWW2021

Cleaned up code for DSTC 10: SIMMC 2.0 track: subtask 2: multimodal coreference resolution

Modifications of the official PyTorch implementation of StyleGAN3. Let's easily generate images and videos with StyleGAN2/2-ADA/3!

Causal-BALD: Deep Bayesian Active Learning of Outcomes to Infer Treatment-Effects from Observational Data.

[ICLR 2021, Spotlight] Large Scale Image Completion via Co-Modulated Generative Adversarial Networks

Deep-learning-roadmap - All You Need to Know About Deep Learning - A kick-starter

Implementation of the method described in the Speech Resynthesis from Discrete Disentangled Self-Supervised Representations.

A basic neural network for image segmentation.