Implementation for "Exploiting Aliasing for Manga Restoration" (CVPR 2021)

Last update: Dec 15, 2022

Related tags

Overview

[CVPR Paper](To appear) | [Project Website](To appear) | BibTex

Introduction

As a popular entertainment art form, manga enriches the line drawings details with bitonal screentones. However, manga resources over the Internet usually show screentone artifacts because of inappropriate scanning/rescaling resolution. In this paper, we propose an innovative two-stage method to restore quality bitonal manga from degraded ones. Our key observation is that the aliasing induced by downsampling bitonal screentones can be utilized as informative clues to infer the original resolution and screentones. First, we predict the target resolution from the degraded manga via the Scale Estimation Network (SE-Net) with spatial voting scheme. Then, at the target resolution, we restore the region-wise bitonal screentones via the Manga Restoration Network (MR-Net) discriminatively, depending on the degradation degree. Specifically, the original screentones are directly restored in pattern-identifiable regions, and visually plausible screentones are synthesized in pattern-agnostic regions. Quantitative evaluation on synthetic data and visual assessment on real-world cases illustrate the effectiveness of our method.

Example Results

Belows shows an example of our restored manga image. The image comes from the Manga109 dataset.

Pretrained models

Download the models below and put it under release_model/.

MangaRestoration

Run

Requirements:
- Install python3.6
- Install pytorch (tested on Release 1.1.0)
Testing:
- Place your test images under datazip/manga1/test.
- Prepare images filelist using flist.py.
- Modify manga.json to set path to data.
- Run python testreal.py -c [config_file] -n [model_name] -s [image_size] .
- For example, python testreal.py -c configs/manga.json -n resattencv -s 256
- You can also use python testreal.py -c [config_file] -n [model_name] -s [image_size] -sl [scale] to specify the scale factor.
- Note that the Convex interpolation refinement requires large GPU memory, you can enable it by setting (bilinear=False) in MangaRestorator to restore images. Defaultly, we set bilinear=True.

Citation

If any part of our paper and code is helpful to your work, please generously cite with:

@inproceedings{xie2021exploiting,
  author = {Minshan Xie and Menghan Xia and Tien-Tsin Wong},
  title = {Exploiting Aliasing for Manga Restoration},
  booktitle = {The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
  year = {2021}
}

Implementation for "Exploiting Aliasing for Manga Restoration" (CVPR 2021)

Related tags

Overview

[CVPR Paper](To appear) | [Project Website](To appear) | BibTex

Introduction

Example Results

Pretrained models

Run

Citation

Reference

Owner

Arabic Car License Recognition. A solution to the kaggle competition Machathon 3.0.

For visualizing the dair-v2x-i dataset

Official implementation of "UCTransNet: Rethinking the Skip Connections in U-Net from a Channel-wise Perspective with Transformer"

Histocartography is a framework bringing together AI and Digital Pathology

Code for the paper "TadGAN: Time Series Anomaly Detection Using Generative Adversarial Networks"

Implementations of LSTM: A Search Space Odyssey variants and their training results on the PTB dataset.

📚 A collection of all the Deep Learning Metrics that I came across which are not accuracy/loss.

The description of FMFCC-A (audio track of FMFCC) dataset and Challenge resluts.

A naive ROS interface for visualDet3D.

A pytorch reproduction of { Co-occurrence Feature Learning from Skeleton Data for Action Recognition and Detection with Hierarchical Aggregation }.

Help you understand Manual and w/ Clutch point while driving.

SemiNAS: Semi-Supervised Neural Architecture Search

Code for the paper "Learning-Augmented Algorithms for Online Steiner Tree"

Caffe: a fast open framework for deep learning.

A Protein-RNA Interface Predictor Based on Semantics of Sequences

Voice control for Garry's Mod

Google-drive-to-sqlite - Create a SQLite database containing metadata from Google Drive

Annotate datasets with a semi-trained or fully trained YOLOv5 model

6D Grasping Policy for Point Clouds

This repository contains the source code for the paper First Order Motion Model for Image Animation