Real-ESRGAN: Training Real-World Blind Super-Resolution with Pure Synthetic Data

Last update: Dec 27, 2022

Related tags

Deep Learning vs-realesrgan

Overview

Real-ESRGAN

Real-ESRGAN: Training Real-World Blind Super-Resolution with Pure Synthetic Data

Ported from https://github.com/xinntao/Real-ESRGAN

Dependencies

NumPy
PyTorch, preferably with CUDA. Note that torchvision and torchaudio are not required and hence can be omitted from the command.
VapourSynth

Installation

pip install --upgrade vsrealesrgan
python -m vsrealesrgan

Usage

from vsrealesrgan import RealESRGAN

ret = RealESRGAN(clip)

See __init__.py for the description of the parameters.

Comments

Installing on portable vapoursynth?
I'm getting this error:

` python -m pip install --upgrade vsrealesrgan Collecting vsrealesrgan Using cached vsrealesrgan-3.1.0-py3-none-any.whl (7.4 kB) Collecting tqdm Using cached tqdm-4.64.0-py2.py3-none-any.whl (78 kB) Requirement already satisfied: numpy in d:\vapoursynth\lib\site-packages (from vsrealesrgan) (1.22.3) Collecting VapourSynth>=55 Using cached VapourSynth-58.zip (558 kB) Preparing metadata (setup.py) ... error error: subprocess-exited-with-error

× python setup.py egg_info did not run successfully. │ exit code: 1 ╰─> [15 lines of output] Traceback (most recent call last): File "C:\Users*\AppData\Local\Temp\pip-install-2415kpn4\vapoursynth_712c69d39f4a4718a3f6b523a85b39eb\setup.py", line 64, in dll_path = query(winreg.HKEY_LOCAL_MACHINE, REGISTRY_PATH, REGISTRY_KEY) File "C:\Users*\AppData\Local\Temp\pip-install-2415kpn4\vapoursynth_712c69d39f4a4718a3f6b523a85b39eb\setup.py", line 38, in query reg_key = winreg.OpenKey(hkey, path, 0, winreg.KEY_READ) FileNotFoundError: [WinError 2] The system cannot find the file specified

During handling of the above exception, another exception occurred: Traceback (most recent call last): File "<string>", line 2, in <module> File "<pip-setuptools-caller>", line 34, in <module> File "C:\Users\**\AppData\Local\Temp\pip-install-2415kpn4\vapoursynth_712c69d39f4a4718a3f6b523a85b39eb\setup.py", line 67, in <module> raise OSError("Couldn't detect vapoursynth installation path") OSError: Couldn't detect vapoursynth installation path [end of output]

note: This error originates from a subprocess, and is likely not a problem with pip. error: metadata-generation-failed

× Encountered error while generating package metadata. ╰─> See above for output.

note: This is an issue with the package mentioned above, not pip. hint: See above for details. `
opened by manus693 8
'vapoursynth.VideoFrame' object is not subscriptable

Error on frame 15 request: 'vapoursynth.VideoFrame' object is not subscriptable

py3.6.4 vs.core.version: VapourSynth Video Processing Library\nCopyright (c) 2012-2018 Fredrik Mellbin\nCore R44\nAPI R3.5\nOptions: -\n torch.version: 1.10.0+cu111

vpy: import vapoursynth as vs import sys sys.path.append("C:\C\Transcoding\VapourSynth\core64\plugins\Scripts") import mvsfunc as mvf sys.path.append(r"C:\Users\liujing\AppData\Local\Programs\Python\Python36\Lib\site-packages\vsrealesrgan") from vsrealesrgan import RealESRGAN

core = vs.get_core(accept_lowercase=True) source = core.ffms2.Source(sourcename) source = mvf.ToRGB(source,depth=32) source = RealESRGAN(source) source= mvf.ToYUV(source,depth=16) source.set_output()

opened by splinter21 4

TensorRT "Ran out of input"?

Using:

# Imports
import vapoursynth as vs
# getting Vapoursynth core
core = vs.core
import site
import os
# Adding torch dependencies to PATH
path = site.getsitepackages()[0]+'/torch_dependencies/'
path = path.replace('\\', '/')
os.environ["PATH"] = path + os.pathsep + os.environ["PATH"]
# Loading Plugins
core.std.LoadPlugin(path="i:/Hybrid/64bit/vsfilters/Support/fmtconv.dll")
core.std.LoadPlugin(path="i:/Hybrid/64bit/vsfilters/SourceFilter/LSmashSource/vslsmashsource.dll")
# source: 'G:\TestClips&Co\files\test.avi'
# current color space: YUV420P8, bit depth: 8, resolution: 640x352, fps: 25, color matrix: 470bg, yuv luminance scale: limited, scanorder: progressive
# Loading G:\TestClips&Co\files\test.avi using LWLibavSource
clip = core.lsmas.LWLibavSource(source="G:/TestClips&Co/files/test.avi", format="YUV420P8", stream_index=0, cache=0, prefer_hw=0)
# Setting color matrix to 470bg.
clip = core.std.SetFrameProps(clip, _Matrix=5)
clip = clip if not core.text.FrameProps(clip,'_Transfer') else core.std.SetFrameProps(clip, _Transfer=5)
clip = clip if not core.text.FrameProps(clip,'_Primaries') else core.std.SetFrameProps(clip, _Primaries=5)
# Setting color range to TV (limited) range.
clip = core.std.SetFrameProp(clip=clip, prop="_ColorRange", intval=1)
# making sure frame rate is set to 25
clip = core.std.AssumeFPS(clip=clip, fpsnum=25, fpsden=1)
clip = core.std.SetFrameProp(clip=clip, prop="_FieldBased", intval=0)
original = clip
from vsrealesrgan import RealESRGAN
# adjusting color space from YUV420P8 to RGBH for VsRealESRGAN
clip = core.resize.Bicubic(clip=clip, format=vs.RGBH, matrix_in_s="470bg", range_s="limited")
# resizing using RealESRGAN
clip = RealESRGAN(clip=clip, device_index=0, trt=True, trt_cache_path="G:/Temp", num_streams=4) # 2560x1408
# resizing 2560x1408 to 640x352
# adjusting resizing
clip = core.resize.Bicubic(clip=clip, format=vs.RGBS, range_s="limited")
clip = core.fmtc.resample(clip=clip, w=640, h=352, kernel="lanczos", interlaced=False, interlacedd=False)
original = core.resize.Bicubic(clip=original, width=640, height=352)
# adjusting output color from: RGBS to YUV420P8 for x264Model
clip = core.resize.Bicubic(clip=clip, format=vs.YUV420P8, matrix_s="470bg", range_s="limited", dither_type="error_diffusion")
original = core.text.Text(clip=original,text="Original",scale=1,alignment=7)
clip = core.text.Text(clip=clip,text="Filtered",scale=1,alignment=7)
stacked = core.std.StackHorizontal([original,clip])
# Output
stacked.set_output()

I get

Failed to evaluate the script: Python exception: Ran out of input

Traceback (most recent call last):
File "src\cython\vapoursynth.pyx", line 2866, in vapoursynth._vpy_evaluate
File "src\cython\vapoursynth.pyx", line 2867, in vapoursynth._vpy_evaluate
File "C:\Users\Selur\Desktop\test_2.vpy", line 32, in 
clip = RealESRGAN(clip=clip, device_index=0, trt=True, trt_cache_path="G:/Temp", num_streams=4) # 2560x1408
File "I:\Hybrid\64bit\Vapoursynth\Lib\site-packages\torch\autograd\grad_mode.py", line 27, in decorate_context
return func(*args, **kwargs)
File "I:\Hybrid\64bit\Vapoursynth\Lib\site-packages\vsrealesrgan\__init__.py", line 284, in RealESRGAN
module = [torch.load(trt_engine_path) for _ in range(num_streams)]
File "I:\Hybrid\64bit\Vapoursynth\Lib\site-packages\vsrealesrgan\__init__.py", line 284, in 
module = [torch.load(trt_engine_path) for _ in range(num_streams)]
File "I:\Hybrid\64bit\Vapoursynth\Lib\site-packages\torch\serialization.py", line 795, in load
return _legacy_load(opened_file, map_location, pickle_module, **pickle_load_args)
File "I:\Hybrid\64bit\Vapoursynth\Lib\site-packages\torch\serialization.py", line 1002, in _legacy_load
magic_number = pickle_module.load(f, **pickle_load_args)
EOFError: Ran out of input

Works fine with trt=False.

->Any idea what is going wrong there?

opened by Selur 3

[REQ] SwinIR port

Hi there, 1st of all thanks for your great work in porting all those goods to VapourSynth !

Dunno if it's the right place to ask, but it would be great to have SwinIR by @JingyunLiang in VS too:

SwinIR: Image Restoration Using Swin Transformer

Hope that inspires !

opened by forart 1

Vapoursynth R58 support

When trying to install vs-realesrgan in Vapoursynth R58 I get:

I:\Hybrid\64bit\Vapoursynth>python -m pip install --upgrade vsrealesrgan
Collecting vsrealesrgan
  Using cached vsrealesrgan-2.0.0-py3-none-any.whl (12 kB)
Collecting VapourSynth>=55
  Using cached VapourSynth-57.zip (567 kB)
  Preparing metadata (setup.py) ... error
  error: subprocess-exited-with-error

  × python setup.py egg_info did not run successfully.
  │ exit code: 1
  ╰─> [15 lines of output]
      Traceback (most recent call last):
        File "C:\Users\Selur\AppData\Local\Temp\pip-install-7_na63f8\vapoursynth_4864864388024a95a1e8b4adda80b293\setup.py", line 64, in <module>
          dll_path = query(winreg.HKEY_LOCAL_MACHINE, REGISTRY_PATH, REGISTRY_KEY)
        File "C:\Users\Selur\AppData\Local\Temp\pip-install-7_na63f8\vapoursynth_4864864388024a95a1e8b4adda80b293\setup.py", line 38, in query
          reg_key = winreg.OpenKey(hkey, path, 0, winreg.KEY_READ)
      FileNotFoundError: [WinError 2] Das System kann die angegebene Datei nicht finden

      During handling of the above exception, another exception occurred:

      Traceback (most recent call last):
        File "<string>", line 2, in <module>
        File "<pip-setuptools-caller>", line 34, in <module>
        File "C:\Users\Selur\AppData\Local\Temp\pip-install-7_na63f8\vapoursynth_4864864388024a95a1e8b4adda80b293\setup.py", line 67, in <module>
          raise OSError("Couldn't detect vapoursynth installation path")
      OSError: Couldn't detect vapoursynth installation path
      [end of output]

  note: This error originates from a subprocess, and is likely not a problem with pip.
error: metadata-generation-failed

× Encountered error while generating package metadata.
╰─> See above for output.

note: This is an issue with the package mentioned above, not pip.
hint: See above for details.

any idea how to fix this?

opened by Selur 0

'vapoursynth.VideoFrame' object has no attribute 'get_read_array'

I have been trying to use this plugin, however I get the below error when trying to preview the video in VapourSynth Editor r19-mod-2-x86_64

Error on frame 0 request: 'vapoursynth.VideoFrame' object has no attribute 'get_read_array'

The code I am getting this error from is below

from vapoursynth import core
from vsrealesrgan import RealESRGAN
import havsfunc as haf
import vapoursynth as vs
video = core.ffms2.Source(source='EDIT.mkv')
video = haf.QTGMC(video, Preset="slow", MatchPreset="slow", MatchPreset2="slow", SourceMatch=3, TFF=True)
video = core.std.SelectEvery(clip=video, cycle=2, offsets=0)
video = core.std.Crop(clip=video, left=8, right=8, top=0, bottom=0)
video = core.resize.Spline36(clip=video, width=640, height=480)
video = core.resize.Bicubic(clip=video, format=vs.RGBS, matrix_in_s="470bg", range_s="limited")
video = RealESRGAN(clip=video, device_index=0)
video = core.resize.Bicubic(clip=video, format=vs.YUV420P10, matrix_s="470bg", range_s="limited")
video = core.resize.Spline36(clip=video, width=1440, height=1080)
video = core.std.AssumeFPS(clip=video, fpsnum=30000, fpsden=1001)
video.set_output()

opened by silentsudin 0

Releases(v4.0.1)

v4.0.1(Dec 4, 2022)
Switch to PyTorch again for inference. A few parameters are added and some parameters are removed.

Add official ESRGAN x4 model and realesr-general-x4v3 model.

See Discussions for benchmarks of some models.
Source code(tar.gz)
Source code(zip)
CUDA-11.7_cuDNN-8.6.0_TensorRT-8.5.1.7_win64.7z(447.57 MB)
v3.1.0(Apr 24, 2022)
Replace RealESRGANv2-animevideo model with realesr-animevideov3 model.

Change the default of model to 3.

Source code(tar.gz)
Source code(zip)
v3.0.0(Mar 27, 2022)
Switch to ONNX Runtime for inferencing.

Rename model_type parameter to model.

Rename tile_x, tile_y parameters to tile_w, tile_h.

Remove pre_pad parameter.

Source code(tar.gz)
Source code(zip)
v2.0.0(Dec 26, 2021)
Add a check to make sure model files have been downloaded.

Only VS API4 is supported now.

Add support for RealESRGANv2-animevideo models.

Remove scale and anime parameters.

Add model_type parameter.

Source code(tar.gz)
Source code(zip)
v1.2.0(Sep 8, 2021)
Change tile parameter to tile_x and tile_y.

Add support for VS API4.

Source code(tar.gz)
Source code(zip)
v1.1.0(Sep 1, 2021)
Add model optimized for anime.

Rename half parameter to fp16 for intuition.

Source code(tar.gz)
Source code(zip)
v1.0.0(Aug 16, 2021)
Initial release.

Source code(tar.gz)
Source code(zip)
model(Aug 16, 2021)

Source code(tar.gz)
Source code(zip)
ESRGAN_SRx4_DF2KOST_official-ff704c30.pth(63.82 MB)
realesr-animevideov3.onnx(2.37 MB)
realesr-animevideov3.pth(2.38 MB)
realesr-general-wdn-x4v3.pth(4.65 MB)
realesr-general-x4v3.pth(4.65 MB)
RealESRGANv2-animevideo-xsx2.pth(2.30 MB)
RealESRGANv2-animevideo-xsx4.pth(2.38 MB)
RealESRGAN_x2plus.onnx(63.89 MB)
RealESRGAN_x2plus.pth(63.95 MB)
RealESRGAN_x4plus.onnx(63.90 MB)
RealESRGAN_x4plus.pth(63.93 MB)
RealESRGAN_x4plus_anime_6B.onnx(17.09 MB)
RealESRGAN_x4plus_anime_6B.pth(17.10 MB)

Owner

Holy Wu

GitHub Repository

Learning Logic Rules for Document-Level Relation Extraction

LogiRE Learning Logic Rules for Document-Level Relation Extraction We propose to introduce logic rules to tackle the challenges of doc-level RE. Equip

41 Dec 26, 2022

Complex-Valued Neural Networks (CVNN)Complex-Valued Neural Networks (CVNN)

Complex-Valued Neural Networks (CVNN) Done by @NEGU93 - J. Agustin Barrachina Using this library, the only difference with a Tensorflow code is that y

1 Nov 12, 2021

🤗 Transformers: State-of-the-art Natural Language Processing for Pytorch, TensorFlow, and JAX.

English | 简体中文 | 繁體中文 | 한국어 State-of-the-art Natural Language Processing for Jax, PyTorch and TensorFlow 🤗 Transformers provides thousands of pretrai

77.4k Jan 05, 2023

Pytorch Implementation of "Desigining Network Design Spaces", Radosavovic et al. CVPR 2020.

RegNet Pytorch Implementation of "Desigining Network Design Spaces", Radosavovic et al. CVPR 2020. Paper | Official Implementation RegNet offer a very

2 Feb 11, 2022

Unified Pre-training for Self-Supervised Learning and Supervised Learning for ASR

UniSpeech The family of UniSpeech: UniSpeech (ICML 2021): Unified Pre-training for Self-Supervised Learning and Supervised Learning for ASR UniSpeech-

282 Jan 09, 2023

Towards Interpretable Deep Metric Learning with Structural Matching

DIML Created by Wenliang Zhao*, Yongming Rao*, Ziyi Wang, Jiwen Lu, Jie Zhou This repository contains PyTorch implementation for paper Towards Interpr

75 Nov 11, 2022

The official repository for our paper "The Neural Data Router: Adaptive Control Flow in Transformers Improves Systematic Generalization".

Codebase for learning control flow in transformers The official repository for our paper "The Neural Data Router: Adaptive Control Flow in Transformer

24 Oct 15, 2022

PyTorch implementation of neural style randomization for data augmentation

README Augment training images for deep neural networks by randomizing their visual style, as described in our paper: https://arxiv.org/abs/1809.05375

84 Nov 23, 2022

PyTorch Code for the paper "VSE++: Improving Visual-Semantic Embeddings with Hard Negatives"

Improving Visual-Semantic Embeddings with Hard Negatives Code for the image-caption retrieval methods from VSE++: Improving Visual-Semantic Embeddings

441 Dec 05, 2022

Official repository for the paper "Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks"

Easy-To-Hard The official repository for the paper "Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks". Gett

52 Sep 08, 2022

Re-implememtation of MAE (Masked Autoencoders Are Scalable Vision Learners) using PyTorch.

mae-repo PyTorch re-implememtation of "masked autoencoders are scalable vision learners". In this repo, it heavily borrows codes from codebase https:/

1 Dec 14, 2021

RIFE: Real-Time Intermediate Flow Estimation for Video Frame Interpolation

RIFE RIFE: Real-Time Intermediate Flow Estimation for Video Frame Interpolation Ported from https://github.com/hzwer/arXiv2020-RIFE Dependencies NumPy

49 Jan 07, 2023

Opinionated code formatter, just like Python's black code formatter but for Beancount

beancount-black Opinionated code formatter, just like Python's black code formatter but for Beancount Try it out online here Features MIT licensed - b

16 Oct 11, 2022

Official implementation of FCL-taco2: Fast, Controllable and Lightweight version of Tacotron2 @ ICASSP 2021

FCL-Taco2: Towards Fast, Controllable and Lightweight Text-to-Speech synthesis (ICASSP 2021) Paper | Demo Block diagram of FCL-taco2, where the decode

39 Sep 28, 2022

This repository contains the implementation of the following paper: Cross-Descriptor Visual Localization and Mapping

Cross-Descriptor Visual Localization and Mapping This repository contains the implementation of the following paper: "Cross-Descriptor Visual Localiza

81 Oct 06, 2022

The LaTeX and Python code for generating the paper, experiments' results and visualizations reported in each paper is available (whenever possible) in the paper's directory

This repository contains the software implementation of most algorithms used or developed in my research. The LaTeX and Python code for generating the

3 Jan 03, 2023

Hands-On Machine Learning for Algorithmic Trading, published by Packt

Hands-On Machine Learning for Algorithmic Trading Hands-On Machine Learning for Algorithmic Trading, published by Packt This is the code repository fo

981 Dec 29, 2022

Repository for open research on optimizers.

Open Optimizers Repository for open research on optimizers. This is a test in sharing research/exploration as it happens. If you use anything from thi

6 Jun 24, 2022

Apply a perspective transformation to a raster image inside Inkscape (no need to use an external software such as GIMP or Krita).

Raster Perspective Apply a perspective transformation to bitmap image using the selected path as envelope, without the need to use an external softwar

19 Dec 22, 2022

一个免费开源一键搭建的通用验证码识别平台，大部分常见的中英数验证码识别都没啥问题。

captcha_server 一个免费开源一键搭建的通用验证码识别平台，大部分常见的中英数验证码识别都没啥问题。使用方法 python = 3.8 以上环境 pip install -r requirements.txt -i https://pypi.douban.com/simple gun

189 Dec 02, 2022

Real-ESRGAN: Training Real-World Blind Super-Resolution with Pure Synthetic Data

Related tags

Overview

Real-ESRGAN

Dependencies

Installation

Usage

Comments

Installing on portable vapoursynth?

'vapoursynth.VideoFrame' object is not subscriptable

TensorRT "Ran out of input"?

[REQ] SwinIR port

Vapoursynth R58 support

'vapoursynth.VideoFrame' object has no attribute 'get_read_array'

Releases(v4.0.1)

v4.0.1(Dec 4, 2022)

v3.1.0(Apr 24, 2022)

v3.0.0(Mar 27, 2022)

v2.0.0(Dec 26, 2021)

v1.2.0(Sep 8, 2021)

v1.1.0(Sep 1, 2021)

v1.0.0(Aug 16, 2021)

model(Aug 16, 2021)

Owner

Holy Wu

Learning Logic Rules for Document-Level Relation Extraction

Complex-Valued Neural Networks (CVNN)Complex-Valued Neural Networks (CVNN)

🤗 Transformers: State-of-the-art Natural Language Processing for Pytorch, TensorFlow, and JAX.

Pytorch Implementation of "Desigining Network Design Spaces", Radosavovic et al. CVPR 2020.

Unified Pre-training for Self-Supervised Learning and Supervised Learning for ASR

Towards Interpretable Deep Metric Learning with Structural Matching

The official repository for our paper "The Neural Data Router: Adaptive Control Flow in Transformers Improves Systematic Generalization".

PyTorch implementation of neural style randomization for data augmentation

PyTorch Code for the paper "VSE++: Improving Visual-Semantic Embeddings with Hard Negatives"

Official repository for the paper "Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks"

Re-implememtation of MAE (Masked Autoencoders Are Scalable Vision Learners) using PyTorch.

RIFE: Real-Time Intermediate Flow Estimation for Video Frame Interpolation

Opinionated code formatter, just like Python's black code formatter but for Beancount

Official implementation of FCL-taco2: Fast, Controllable and Lightweight version of Tacotron2 @ ICASSP 2021

This repository contains the implementation of the following paper: Cross-Descriptor Visual Localization and Mapping

The LaTeX and Python code for generating the paper, experiments' results and visualizations reported in each paper is available (whenever possible) in the paper's directory

Hands-On Machine Learning for Algorithmic Trading, published by Packt

Repository for open research on optimizers.

Apply a perspective transformation to a raster image inside Inkscape (no need to use an external software such as GIMP or Krita).

一个免费开源一键搭建的通用验证码识别平台，大部分常见的中英数验证码识别都没啥问题。