Official code for "Stereo Waterdrop Removal with Row-wise Dilated Attention (IROS2021)"

Last update: Oct 01, 2022

Related tags

Deep Learning Stereo-Waterdrop-Removal

Overview

Stereo-Waterdrop-Removal-with-Row-wise-Dilated-Attention

This repository includes official codes for "Stereo Waterdrop Removal with Row-wise Dilated Attention (IROS2021)".

Stereo Waterdrop Removal with Row-wise Dilated Attention
Zifan Shi, Na Fan, Dit-Yan Yeung, Qifeng Chen
HKUST

[Paper] [Datasets]

Introduction

Existing vision systems for autonomous driving or robots are sensitive to waterdrops adhered to windows or camera lenses. Most recent waterdrop removal approaches take a single image as input and often fail to recover the missing content behind waterdrops faithfully. Thus, we propose a learning-based model for waterdrop removal with stereo images. A real-world dataset that contains stereo images with and without waterdrops is provided to benefit the related research.

Installation

Clone this repo.

git clone https://github.com/VivianSZF/Stereo-Waterdrop-Removal.git
cd Stereo-Waterdrop-Removal/

We have tested our code on Ubuntu 18.04 LTS with PyTorch 1.6.0 and CUDA 10.2. Please install dependencies by

conda env create -f environment.yml

Datasets

The dataset can be downloaded from the link.

'train', 'val' and 'test' refer to training set, validation set and test set captured by ZED 2. 'test_mynt' contains test images from MYNT EYE camera. In each folder, '000' denotes the waterdrop-free image (Ground truth). 'xxx_0' is the left image while 'xxx_1' is the right image. The dataset can be put under the 'dataset' folder.

Training

The arguments for training are listed in train.py. To train the model, run with the following code

sh train.sh

The checkpoints and the validation ressults will be saved into ./result/{exp_name}/train/.

Test

Download the pretrained checkpoints and put them under ./result/{exp_name}/train/. The arguments for test are listed in test.py. You can specify them in test.sh and run the command

sh test.sh

The output images are available under ./result/{exp_name}/test/

Citation

@inproceedings{shi2021stereo,
  title = {Stereo Waterdrop Removal with Row-wise Dilated Attention},
  author = {Shi, Zifan and Fan, Na and Yeung, Dit-Yan and Chen, Qifeng},
  booktitle = {IROS},
  year = {2021}
}

Official code for "Stereo Waterdrop Removal with Row-wise Dilated Attention (IROS2021)"

Related tags

Overview

Stereo-Waterdrop-Removal-with-Row-wise-Dilated-Attention

Introduction

Installation

Datasets

Training

Test

Citation

Owner

A check for whether the dependency jobs are all green.

Shōgun

Convert game ISO and archives to CD CHD for emulation on Linux.

Official implementation of "CrossPoint: Self-Supervised Cross-Modal Contrastive Learning for 3D Point Cloud Understanding" (CVPR, 2022)

keyframes-CNN-RNN(action recognition)

Implementation of NÜWA, state of the art attention network for text to video synthesis, in Pytorch

A neuroanatomy-based augmented reality experience powered by computer vision. Features 3D visuals of the Atlas Brain Map slices.

Group-Free 3D Object Detection via Transformers

Car Parking Tracker Using OpenCv

Pytorch implementation of paper "Learning Co-segmentation by Segment Swapping for Retrieval and Discovery"

GAN-based Matrix Factorization for Recommender Systems

A curated list and survey of awesome Vision Transformers.

ESPNet: Efficient Spatial Pyramid of Dilated Convolutions for Semantic Segmentation

Efficient semidefinite bounds for multi-label discrete graphical models.

A TikTok-like recommender system for GitHub repositories based on Gorse

A simple image/video to Desmos graph converter run locally

ViSER: Video-Specific Surface Embeddings for Articulated 3D Shape Reconstruction

Towards Improving Embedding Based Models of Social Network Alignment via Pseudo Anchors

Activity image-based video retrieval

List some popular DeepFake models e.g. DeepFake, FaceSwap-MarekKowal, IPGAN, FaceShifter, FaceSwap-Nirkin, FSGAN, SimSwap, CihaNet, etc.