Using VapourSynth with super resolution models and speeding them up with TensorRT.

Last update: Jan 05, 2023

Related tags

Overview

VSGAN-tensorrt-docker

Using image super resolution models with vapoursynth and speeding them up with TensorRT. Using NVIDIA/Torch-TensorRT combined with rlaphoenix/VSGAN. This repo makes the usage of tiling and ESRGAN models very easy. Models can be found on the wiki page. Further model architectures are planned to be added later on.

Currently working:

ESRGAN
RealESRGAN (adjust model load manually in inference.py, settings wont be adjusted automatically currently)

Usage:

# install docker, command for arch
yay -S docker nvidia-docker nvidia-container-toolkit
# Put the dockerfile in a directory and run that inside that directory
docker build -t vsgan_tensorrt:latest .
# run with a mounted folder
docker run --privileged --gpus all -it --rm -v /home/Desktop/tensorrt:/workspace/tensorrt vsgan_tensorrt:latest
# you can use it in various ways, ffmpeg example
vspipe --y4m inference.py - | ffmpeg -i pipe: example.mkv

If docker does not want to start, try this before you use docker:

# fixing docker errors
systemctl start docker
sudo chmod 666 /var/run/docker.sock

Windows is mostly similar, but the path needs to be changed slightly:

Example for C://path
docker run --privileged --gpus all -it --rm -v //c/path:/workspace/tensorrt vsgan_tensorrt:latest

If you don't want to use docker, vapoursynth install commands are here and a TensorRT example is here.

Set the input video path in inference.py and access videos with the mounted folder.

It is also possible to directly pipe the video into mpv, but you most likely wont be able to archive realtime speed. Change the mounted folder path to your own videofolder and use the mpv dockerfile instead. If you use a very efficient model, it may be possible on a very good GPU. Only tested in Manjaro.

yay -S pulseaudio

# i am not sure if it is needed, but go into pulseaudio settings and check "make pulseaudio network audio devices discoverable in the local network" and reboot

# start docker
docker run --rm -i -t \
    --network host \
    -e DISPLAY \
    -v /home/Schreibtisch/test/:/home/mpv/media \
    --ipc=host \
    --privileged \
    --gpus all \
    -e PULSE_COOKIE=/run/pulse/cookie \
    -v ~/.config/pulse/cookie:/run/pulse/cookie \
    -e PULSE_SERVER=unix:${XDG_RUNTIME_DIR}/pulse/native \
    -v ${XDG_RUNTIME_DIR}/pulse/native:${XDG_RUNTIME_DIR}/pulse/native \
    vsgan_tensorrt:latest
    
# run mpv
vspipe --y4m inference.py - | mpv -

Comments

Invalid data found when processing input

Hey when i start the inference.py script this happen :

someone can help me ?


> ffmpeg version N-62110-g4d45f5acbd-static https://johnvansickle.com/ffmpeg/  Copyright (c) 2000-2022 the FFmpeg developers
>   built with gcc 8 (Debian 8.3.0-6)
>   configuration: --enable-gpl --enable-version3 --enable-static --disable-debug --disable-ffplay --disable-indev=sndio --disable-outdev=sndio --cc=gcc --enable-fontconfig --enable-frei0r --enable-gnutls --enable-gmp --enable-libgme --enable-gray --enable-libaom --enable-libfribidi --enable-libass --enable-libvmaf --enable-libfreetype --enable-libmp3lame --enable-libopencore-amrnb --enable-libopencore-amrwb --enable-libopenjpeg --enable-librubberband --enable-libsoxr --enable-libspeex --enable-libsrt --enable-libvorbis --enable-libopus --enable-libtheora --enable-libvidstab --enable-libvo-amrwbenc --enable-libvpx --enable-libwebp --enable-libx264 --enable-libx265 --enable-libxml2 --enable-libdav1d --enable-libxvid --enable-libzvbi --enable-libzimg
>   libavutil      57. 26.100 / 57. 26.100
>   libavcodec     59. 33.100 / 59. 33.100
>   libavformat    59. 24.100 / 59. 24.100
>   libavdevice    59.  6.100 / 59.  6.100
>   libavfilter     8. 40.100 /  8. 40.100
>   libswscale      6.  6.100 /  6.  6.100
>   libswresample   4.  6.100 /  4.  6.100
>   libpostproc    56.  5.100 / 56.  5.100
> Information: Generating grammar tables from /usr/lib/python3.8/lib2to3/Grammar.txt
> Information: Generating grammar tables from /usr/lib/python3.8/lib2to3/PatternGrammar.txt
> Script evaluation failed:
> Python exception: libtorch_cuda_cu.so: cannot open shared object file: No such file or directory
> 
> Traceback (most recent call last):
>   File "src\cython\vapoursynth.pyx", line 2890, in vapoursynth._vpy_evaluate
>   File "src\cython\vapoursynth.pyx", line 2891, in vapoursynth._vpy_evaluate
>   File "inference.py", line 85, in <module>
>     clip = ESRGAN_inference(clip=clip, model_path="/workspace/RealESRGAN_x4plus_anime_6B.pth", tile_x=480, tile_y=480, tile_pad=16, fp16=False, tta=False, tta_mode=1)
>   File "/workspace/tensorrt/src/esrgan.py", line 680, in ESRGAN_inference
>     import torch_tensorrt
>   File "/usr/local/lib/python3.8/dist-packages/torch_tensorrt/__init__.py", line 11, in <module>
>     from torch_tensorrt._compile import *
>   File "/usr/local/lib/python3.8/dist-packages/torch_tensorrt/_compile.py", line 2, in <module>
>     from torch_tensorrt import _enums
>   File "/usr/local/lib/python3.8/dist-packages/torch_tensorrt/_enums.py", line 1, in <module>
>     from torch_tensorrt._C import dtype, DeviceType, EngineCapability, TensorFormat
> ImportError: libtorch_cuda_cu.so: cannot open shared object file: No such file or directory
> 
> pipe:: Invalid data found when processing input

opened by NeoBurgerYT 10

Module not found 'scipy'

I can't run my inference.py without getting this error message. Can someone direct me to where I can get the repo?

File "/usr/local/lib/python3.8/dist-packages/mmedit/core/evaluation/metrics.py", line 7, in from scipy.ndimage import convolve ModuleNotFoundError: No module named 'scipy'

pipe:: Invalid data found when processing input

opened by terminatedkhla 8
Tutorial?

Hi! This is amazing technology! I’m blown away. I’d love to contact you directly on how to use it in colab, I’m quite confused with the process. I’ve tried running it but not sure I’m running it correctly. Thanks in advance!

opened by AIManifest 6
Trying On A M1 Mac

So I followed this tutorial https://www.youtube.com/watch?v=B134jvhO8yk&t=0s But when docker run --privileged --gpus all -it --rm -v /home/vsgan_path/:/workspace/tensorrt styler00dollar/vsgan_tensorrt:latest it just gives me an error that it doesn't find the right amd64 or somthing and I rage quit deleted it without seeing the full error. PLS HELP ME :(

opened by Ghostkwebb 6

Crash when using RIFE ensemble models in vsmlrt

I get this error

vapoursynth.Error: operator (): expects 8 input planes

from this

import vapoursynth as vs
from vapoursynth import core
core = vs.core
import vsmlrt

clip = core.lsmas.LWLibavSource(source=r"R:\output.mkv",cache=1, prefer_hw=1)
clip = core.resize.Bicubic(clip, matrix_in_s="709", transfer_in_s='709', format=vs.RGBS)
clip = vsmlrt.RIFE(clip, multi=4, model=46, backend=vsmlrt.Backend.TRT(fp16=True), tilesize=[1920,1088])
clip = core.std.AssumeFPS(clip=clip, fpsnum=60, fpsden=1)
clip = core.resize.Bicubic(clip, format=vs.RGB24, matrix_in_s="709")
clip.set_output()

opened by banjaminicc 4

Support for AITemplate?

There is something that came out recently and it's look promising in terms of performance/speed. Would it be possible to implement it for ESERGAN mode? https://github.com/facebookincubator/AITemplate

opened by kodxana 4
CUDA out of Memory

System Specs: Ryzen 9 5900HX, NVidia 3070 Mobile, Arch Linux (EndeavorOS) on Kernel 5.17.2

Whenever I try to run a model that is relying on CUDA, for example cugan, the program exits with

Error: Failed to retrieve frame 0 with error: CUDA out of memory. Tried to allocate 148.00 MiB (GPU 0; 7.80 GiB total capacity; 5.53 GiB already allocated; 68.56 MiB free; 5.69 GiB reserved in total by PyTorch)

and stops after having output 4 frames.

However, TensorRT works fine for models that support it (like RealESRGAN for example).

Edit: Running nvidia-smi while the command is executed reveals that vspipe is allocating GPU Memory, but <2 GiB of VRAM, far from the 8GiB my model has.

opened by mmkzer0 4
No module named 'vsbasicvsrpp'

Traceback (most recent call last): File "src\cython\vapoursynth.pyx", line 2832, in vapoursynth._vpy_evaluate File "src\cython\vapoursynth.pyx", line 2833, in vapoursynth._vpy_evaluate File "inference.py", line 12, in from vsbasicvsrpp import BasicVSRPP ModuleNotFoundError: No module named 'vsbasicvsrpp'

opened by xt851231 4
Google colab request?

I recently stumbled upon this VSGAN-tensorrt-docker and found it so incredible! Could anyone make a google colab notebook that features everything from this VSGAN-tensorrt-docker, so that we could experience the speed of TensorRT! Thanks in advance!

opened by mikebilly 3
model conversion from onnx to trt

@styler00dollar this is not issue but a question, I read the scripts in inference.py and found real-esrgan 2x is loaded from trt engine file, since real-2x uses dynamic shapes as input, could you share any ideas how to convert this model to trt, thanks!

opened by deism 3
ESRGAN with full episode

Hello,

I'm trying to upscale MKV files of full episodes with ESRGAN. I tried using vspipe -c y4m inference.py - | ffmpeg -i pipe: example.mkv, and it seems to run up to the point where it starts to give an ETA. Once there the time doesn't move and eventually, it says it was killed.

Can you give me some tips on how to make this work better? I'm not familiar with most of the tools I've been given.

opened by Ultramonte 2
[SUGGESTION] per-scene processing

Hi there, this project is awesome so thanks for your - voluntary - work !

Since GANs-based processing is quite heavy computing task, it could be very useful to split it into multiple "segments" to allow parallel/scalable/collaborative/resumable instances.

We suggest you to check @master-of-zen's Av1an framework, wich implements it.

Hope that inspires.

opened by forart 1

Releases(models)

models(Feb 11, 2022)

Just a place to store models. Sources are in the README.

ffmpeg was compiled with markus-perl/ffmpeg-build-script.
Source code(tar.gz)
Source code(zip)
4x_fatal_Anime_500000_G.onnx(63.83 MB)
4x_fatal_Anime_500000_G.pth(63.85 MB)
compact2x_ncnn.tar(2.30 MB)
cugan_pro-conservative-up2x.pth(4.91 MB)
cugan_pro-conservative-up2x_opset13.onnx(4.92 MB)
cugan_pro-conservative-up2x_opset14.onnx(4.92 MB)
cugan_pro-conservative-up2x_opset15.onnx(4.92 MB)
cugan_pro-conservative-up2x_opset16.onnx(4.92 MB)
cugan_pro-conservative-up2x_opset17.onnx(4.92 MB)
cugan_pro-conservative-up3x.pth(4.92 MB)
cugan_pro-conservative-up3x_opset13.onnx(4.93 MB)
cugan_pro-conservative-up3x_opset14.onnx(4.93 MB)
cugan_pro-conservative-up3x_opset15.onnx(4.93 MB)
cugan_pro-conservative-up3x_opset16.onnx(4.93 MB)
cugan_pro-conservative-up3x_opset17.onnx(4.93 MB)
cugan_pro-denoise3x-up2x.pth(4.91 MB)
cugan_pro-denoise3x-up2x_opset13.onnx(4.92 MB)
cugan_pro-denoise3x-up2x_opset14.onnx(4.92 MB)
cugan_pro-denoise3x-up2x_opset15.onnx(4.92 MB)
cugan_pro-denoise3x-up2x_opset16.onnx(4.92 MB)
cugan_pro-denoise3x-up2x_opset17.onnx(4.92 MB)
cugan_pro-denoise3x-up3x.pth(4.92 MB)
cugan_pro-denoise3x-up3x_opset13.onnx(4.93 MB)
cugan_pro-denoise3x-up3x_opset14.onnx(4.93 MB)
cugan_pro-denoise3x-up3x_opset15.onnx(4.93 MB)
cugan_pro-denoise3x-up3x_opset16.onnx(4.93 MB)
cugan_pro-denoise3x-up3x_opset17.onnx(4.93 MB)
cugan_pro-no-denoise-up2x.pth(4.91 MB)
cugan_pro-no-denoise-up2x_opset13.onnx(4.92 MB)
cugan_pro-no-denoise-up2x_opset14.onnx(4.92 MB)
cugan_pro-no-denoise-up2x_opset15.onnx(4.92 MB)
cugan_pro-no-denoise-up2x_opset16.onnx(4.92 MB)
cugan_pro-no-denoise-up2x_opset17.onnx(4.92 MB)
cugan_pro-no-denoise-up3x.pth(4.92 MB)
cugan_pro-no-denoise-up3x_opset13.onnx(4.93 MB)
cugan_pro-no-denoise-up3x_opset14.onnx(4.93 MB)
cugan_pro-no-denoise-up3x_opset15.onnx(4.93 MB)
cugan_pro-no-denoise-up3x_opset16.onnx(4.93 MB)
cugan_pro-no-denoise-up3x_opset17.onnx(4.93 MB)
cugan_pro-no-denoise3x-up3x.pth(4.92 MB)
cugan_pro-no-denoise3x-up3x_opset13.onnx(4.93 MB)
cugan_pro-no-denoise3x-up3x_opset14.onnx(4.93 MB)
cugan_pro-no-denoise3x-up3x_opset15.onnx(4.93 MB)
cugan_pro-no-denoise3x-up3x_opset16.onnx(4.93 MB)
cugan_pro-no-denoise3x-up3x_opset17.onnx(4.93 MB)
cugan_up2x-latest-conservative.pth(4.90 MB)
cugan_up2x-latest-conservative_opset13.onnx(4.92 MB)
cugan_up2x-latest-conservative_opset14.onnx(4.92 MB)
cugan_up2x-latest-conservative_opset15.onnx(4.92 MB)
cugan_up2x-latest-conservative_opset16.onnx(4.92 MB)
cugan_up2x-latest-conservative_opset17.onnx(4.92 MB)
cugan_up2x-latest-denoise1x.pth(4.90 MB)
cugan_up2x-latest-denoise1x_opset13.onnx(4.92 MB)
cugan_up2x-latest-denoise1x_opset14.onnx(4.92 MB)
cugan_up2x-latest-denoise1x_opset15.onnx(4.92 MB)
cugan_up2x-latest-denoise1x_opset16.onnx(4.92 MB)
cugan_up2x-latest-denoise1x_opset17.onnx(4.92 MB)
cugan_up2x-latest-denoise2x.pth(4.90 MB)
cugan_up2x-latest-denoise2x_opset13.onnx(4.92 MB)
cugan_up2x-latest-denoise2x_opset14.onnx(4.92 MB)
cugan_up2x-latest-denoise2x_opset15.onnx(4.92 MB)
cugan_up2x-latest-denoise2x_opset16.onnx(4.92 MB)
cugan_up2x-latest-denoise2x_opset17.onnx(4.92 MB)
cugan_up2x-latest-denoise3x.pth(4.90 MB)
cugan_up2x-latest-denoise3x_opset13.onnx(4.92 MB)
cugan_up2x-latest-denoise3x_opset14.onnx(4.92 MB)
cugan_up2x-latest-denoise3x_opset15.onnx(4.92 MB)
cugan_up2x-latest-denoise3x_opset16.onnx(4.92 MB)
cugan_up2x-latest-denoise3x_opset17.onnx(4.92 MB)
cugan_up2x-latest-no-denoise.pth(4.90 MB)
cugan_up2x-latest-no-denoise_opset13.onnx(4.92 MB)
cugan_up2x-latest-no-denoise_opset14.onnx(4.92 MB)
cugan_up2x-latest-no-denoise_opset15.onnx(4.92 MB)
cugan_up2x-latest-no-denoise_opset16.onnx(4.92 MB)
cugan_up2x-latest-no-denoise_opset17.onnx(4.92 MB)
cugan_up3x-latest-conservative.onnx(4.91 MB)
cugan_up3x-latest-conservative.pth(4.91 MB)
cugan_up3x-latest-denoise3x.onnx(4.91 MB)
cugan_up3x-latest-denoise3x.pth(4.91 MB)
cugan_up3x-latest-no-denoise.onnx(4.91 MB)
cugan_up3x-latest-no-denoise.pth(4.91 MB)
cugan_up4x-latest-conservative.onnx(5.37 MB)
cugan_up4x-latest-conservative.pth(5.37 MB)
cugan_up4x-latest-denoise3x.onnx(5.37 MB)
cugan_up4x-latest-denoise3x.pth(5.37 MB)
cugan_up4x-latest-no-denoise.onnx(5.37 MB)
cugan_up4x-latest-no-denoise.pth(5.37 MB)
DF2K_JPEG_ncnn.tar.gz(29.46 MB)
DF2K_ncnn.tar.gz(29.46 MB)
dpir_drunet_color.onnx(124.52 MB)
dpir_drunet_deblocking_color.onnx(124.52 MB)
dpir_drunet_deblocking_grayscale.onnx(124.51 MB)
dpir_drunet_gray.onnx(124.51 MB)
EGVSR_iter420000.pth(9.89 MB)
eisai_anime_interp_full.ckpt(23.73 MB)
eisai_dtm.pt(56.88 KB)
eisai_ssl.pt(10.53 MB)
ffmpeg(69.71 MB)
ffmpeg_colab(72.76 MB)
FILM.tar.gz(366.17 MB)
GMFSS_union_fusionnet_vanilla.pkl(7.92 MB)
GMFSS_union_fusionnet_wgan.pkl(7.92 MB)
GMFSS_union_metric_vanilla.pkl(183.07 KB)
GMFSS_union_metric_wgan.pkl(183.07 KB)
GMFupSS_flownet.pkl(18.04 MB)
GMFupSS_fusionnet.pkl(7.88 MB)
GMFupSS_metric.pkl(158.82 KB)
IFRNet_GoPro.pth(18.94 MB)
IFRNet_L_GoPro.pth(75.16 MB)
IFRNet_L_Vimeo90K.pth(75.16 MB)
IFRNet_S_GoPro.pth(10.71 MB)
IFRNet_S_Vimeo90K.pth(10.71 MB)
IFRNet_Vimeo90K.pth(18.93 MB)
IFUNet.pth(123.46 MB)
M2M.pth(29.10 MB)
PANx2_DF2K.pth(1.02 MB)
PANx3_DF2K.pth(1.02 MB)
PANx4_DF2K.pth(1.06 MB)
RealBasicVSR_x4.pth(200.72 MB)
realesr-animevideov3.onnx(2.37 MB)
realesr-general-wdn-x4v3_opset13.onnx(4.63 MB)
realesr-general-wdn-x4v3_opset14.onnx(4.63 MB)
realesr-general-wdn-x4v3_opset15.onnx(4.63 MB)
realesr-general-wdn-x4v3_opset16.onnx(4.63 MB)
RealESRGANv2-animevideo-xsx2.pth(2.30 MB)
RealESRGANv2-animevideo-xsx2_opset13.onnx(2.29 MB)
RealESRGANv2-animevideo-xsx2_opset15.onnx(2.29 MB)
RealESRGANv2-animevideo-xsx2_opset16.onnx(2.29 MB)
RealESRGANv2-animevideo-xsx4.onnx(2.37 MB)
RealESRGANv2-animevideo-xsx4.pth(2.38 MB)
RealESRGAN_x4plus_anime_6B.pth(17.10 MB)
RealESRGAN_x4plus_anime_6B_opset13.onnx(17.08 MB)
RealESRGAN_x4plus_anime_6B_opset14.onnx(17.08 MB)
RealESRGAN_x4plus_anime_6B_opset15.onnx(17.08 MB)
RealESRGAN_x4plus_anime_6B_opset16.onnx(17.08 MB)
rife40.pth(32.15 MB)
rife40_ensembleFalse_fastTrue_opset16.onnx(19.76 MB)
rife40_ensembleTrue_fastFalse_opset16.onnx(32.29 MB)
rife41.pth(32.15 MB)
rife41_ensembleFalse_fastTrue_opset16.onnx(19.77 MB)
rife41_ensembleTrue_fastFalse_opset16.onnx(32.30 MB)
rife42.pth(32.11 MB)
rife42_ensembleFalse_fastTrue_opset16.onnx(19.74 MB)
rife42_ensembleTrue_fastFalse_opset16.onnx(32.27 MB)
rife43.pth(32.11 MB)
rife43_ensembleFalse_fastTrue_opset16.onnx(19.74 MB)
rife43_ensembleTrue_fastFalse_opset16.onnx(32.27 MB)
rife44.pth(32.11 MB)
rife44_ensembleFalse_fastTrue_opset16.onnx(19.74 MB)
rife44_ensembleTrue_fastFalse_opset16.onnx(32.27 MB)
rife45.pth(20.16 MB)
rife45_ensembleFalse_opset16.onnx(20.21 MB)
rife45_ensembleTrue_opset16.onnx(20.25 MB)
rife46.pth(20.28 MB)
rife46_ensembleFalse_opset16.onnx(20.31 MB)
rife46_ensembleFalse_opset17.onnx(20.32 MB)
rife46_ensembleTrue_opset16.onnx(20.34 MB)
rife46_ensembleTrue_opset17.onnx(20.37 MB)
rvpV1_105661_G.pt(68.09 MB)
rvpV1_105661_G.pth(67.61 MB)
scunet_color_15.pth(68.64 MB)
scunet_color_25.pth(68.64 MB)
scunet_color_50.pth(68.64 MB)
scunet_color_real_gan.pth(68.64 MB)
scunet_color_real_psnr.pth(68.64 MB)
sc_efficientformerv2_s0+rife46_84119_224.pth(12.92 MB)
sc_efficientformerv2_s0_12263_224.pth(12.91 MB)
sc_efficientformerv2_s0_29735_224.pth(12.91 MB)
sc_efficientnetv2b0+rife46_flow_1362_256.pth(22.75 MB)
sc_efficientnetv2b0_17957_256.pth(22.73 MB)
sc_efficientnetv2b0_int8_18964_256.pth(23.21 MB)
sc_maxvit_small+rife46_1512_224.pth(258.29 MB)
sc_maxvit_small_9072_224.pth(258.25 MB)
sc_regnetz_005_33142_256.pth(23.64 MB)
sc_repvgg_b0_7575_256.pth(55.76 MB)
sc_resnetrs50_4840_256.pth(128.69 MB)
sc_resnetv2_50_1815_256.pth(89.97 MB)
sc_rexnet_100_7264_256.pth(13.72 MB)
sc_swinv2_small_window16+rife46_1814_256.pth(192.04 MB)
sc_swinv2_small_window16_10412_256.pth(191.94 MB)
sc_TimeSformer_2592_224.pth(241.98 MB)
sc_uniformerv2_b16_36288_224.pth(513.07 MB)
sepconv.pth(51.76 MB)
stmfnet.pth(80.68 MB)
sudo_RealESRGAN2x_Dropout_3.799.042_opset13.onnx(16.94 MB)
sudo_RealESRGAN2x_Dropout_3.799.042_opset14.onnx(16.94 MB)
sudo_RealESRGAN2x_Dropout_3.799.042_opset15.onnx(16.94 MB)
sudo_RealESRGAN2x_Dropout_3.799.042_opset16.onnx(16.94 MB)
sudo_rife4_269.662_testV1_ensembleFalse_fastTrue.bin(9.86 MB)
sudo_rife4_269.662_testV1_ensembleFalse_fastTrue.param(19.76 KB)
sudo_rife4_269.662_testV1_ensembleTrue_fastFalse.bin(26.49 MB)
sudo_rife4_269.662_testV1_ensembleTrue_fastFalse.param(61.86 KB)
sudo_rife4_269.662_testV1_ensembleTrue_fastTrue.bin(19.72 MB)
sudo_rife4_269.662_testV1_ensembleTrue_fastTrue.param(41.15 KB)
sudo_rife4_269.662_testV1_scale1.pth(32.15 MB)
sudo_UltraCompact_2x_1.121.175_G.pth(1.16 MB)
sudo_UltraCompact_2x_1.121.175_G_opset13.onnx(1.16 MB)
sudo_UltraCompact_2x_1.121.175_G_opset14.onnx(1.16 MB)
sudo_UltraCompact_2x_1.121.175_G_opset15.onnx(1.16 MB)
sudo_UltraCompact_2x_1.121.175_G_opset16.onnx(1.16 MB)
vapsr2x_opset16.onnx(1.35 MB)
vapsr3x_opset16.onnx(1.38 MB)
vapsr4x_opset16.onnx(1.41 MB)
vs_precompiled_colab.7z(112.06 MB)
waifu2x_anime_style_art_noise1_model.onnx(1.09 MB)
waifu2x_anime_style_art_noise2_model.onnx(1.09 MB)
waifu2x_anime_style_art_noise3_model.onnx(1.09 MB)
waifu2x_anime_style_art_rgb_noise0_model.onnx(1.11 MB)
waifu2x_anime_style_art_rgb_noise1_model.onnx(1.11 MB)
waifu2x_anime_style_art_rgb_noise2_model.onnx(1.11 MB)
waifu2x_anime_style_art_rgb_noise3_model.onnx(1.11 MB)
waifu2x_anime_style_art_rgb_scale2.0x_model.onnx(1.11 MB)
waifu2x_anime_style_art_scale2.0x_model.onnx(1.09 MB)
waifu2x_cunet_noise0_model.onnx(4.90 MB)
waifu2x_cunet_noise0_scale2.0x_model.onnx(4.91 MB)
waifu2x_cunet_noise1_model.onnx(4.90 MB)
waifu2x_cunet_noise1_scale2.0x_model.onnx(4.91 MB)
waifu2x_cunet_noise2_model.onnx(4.90 MB)
waifu2x_cunet_noise2_scale2.0x_model.onnx(4.91 MB)
waifu2x_cunet_noise3_model.onnx(4.90 MB)
waifu2x_cunet_noise3_scale2.0x_model.onnx(4.91 MB)
waifu2x_cunet_scale2.0x_model.onnx(4.91 MB)
waifu2x_photo_noise0_model.onnx(1.11 MB)
waifu2x_photo_noise1_model.onnx(1.11 MB)
waifu2x_photo_noise2_model.onnx(1.11 MB)
waifu2x_photo_noise3_model.onnx(1.11 MB)
waifu2x_photo_scale2.0x_model.onnx(1.11 MB)
waifu2x_ukbench_scale2.0x_model.onnx(1.11 MB)
waifu2x_upconv_7_anime_style_art_rgb_noise0_scale2.0x_model.onnx(2.10 MB)
waifu2x_upconv_7_anime_style_art_rgb_noise1_scale2.0x_model.onnx(2.10 MB)
waifu2x_upconv_7_anime_style_art_rgb_noise2_scale2.0x_model.onnx(2.10 MB)
waifu2x_upconv_7_anime_style_art_rgb_noise3_scale2.0x_model.onnx(2.10 MB)
waifu2x_upconv_7_anime_style_art_rgb_scale2.0x_model.onnx(2.10 MB)
waifu2x_upconv_7_photo_noise0_scale2.0x_model.onnx(2.10 MB)
waifu2x_upconv_7_photo_noise1_scale2.0x_model.onnx(2.10 MB)
waifu2x_upconv_7_photo_noise2_scale2.0x_model.onnx(2.10 MB)
waifu2x_upconv_7_photo_noise3_scale2.0x_model.onnx(2.10 MB)
waifu2x_upconv_7_photo_scale2.0x_model.onnx(2.10 MB)
waifu2x_upresnet10_noise0_scale2.0x_model.onnx(1.61 MB)
waifu2x_upresnet10_noise1_scale2.0x_model.onnx(1.61 MB)
waifu2x_upresnet10_noise2_scale2.0x_model.onnx(1.61 MB)
waifu2x_upresnet10_noise3_scale2.0x_model.onnx(1.61 MB)
waifu2x_upresnet10_scale2.0x_model.onnx(1.61 MB)

Owner

I like Google Colab and Python.

GitHub Repository

PolyphonicFormer: Unified Query Learning for Depth-aware Video Panoptic Segmentation

PolyphonicFormer: Unified Query Learning for Depth-aware Video Panoptic Segmentation Winner method of the ICCV-2021 SemKITTI-DVPS Challenge. [arxiv] [

38 Jan 03, 2023

Tensorflow2 Keras-based Semantic Segmentation Models Implementation

1 Feb 08, 2022

InterFaceGAN - Interpreting the Latent Space of GANs for Semantic Face Editing

InterFaceGAN - Interpreting the Latent Space of GANs for Semantic Face Editing Figure: High-quality facial attributes editing results with InterFaceGA

1.3k Jan 09, 2023

Code and data for "Broaden the Vision: Geo-Diverse Visual Commonsense Reasoning" (EMNLP 2021).

GD-VCR Code for Broaden the Vision: Geo-Diverse Visual Commonsense Reasoning (EMNLP 2021). Research Questions and Aims: How well can a model perform o

24 Oct 13, 2022

Train CPPNs as a Generative Model, using Generative Adversarial Networks and Variational Autoencoder techniques to produce high resolution images.

cppn-gan-vae tensorflow Train Compositional Pattern Producing Network as a Generative Model, using Generative Adversarial Networks and Variational Aut

343 Dec 29, 2022

Automated image registration. Registrationimation was too much of a mouthful.

alignimation Automated image registration. Registrationimation was too much of a mouthful. This repo contains the code used for my blog post Alignimat

9 Oct 13, 2022

Neural Articulated Radiance Field

Neural Articulated Radiance Field NARF Neural Articulated Radiance Field Atsuhiro Noguchi, Xiao Sun, Stephen Lin, Tatsuya Harada ICCV 2021 [Paper] [Co

144 Jan 03, 2023

A simple and extensible library to create Bayesian Neural Network layers on PyTorch.

Blitz - Bayesian Layers in Torch Zoo BLiTZ is a simple and extensible library to create Bayesian Neural Network Layers (based on whats proposed in Wei

722 Jan 08, 2023

The second project in Python course on FCC

Assignment Write a function named add_time that takes in two required parameters and one optional parameter: a start time in the 12-hour clock format

1 Dec 13, 2021

House_prices_kaggle - Predict sales prices and practice feature engineering, RFs, and gradient boosting

House Prices - Advanced Regression Techniques Predicting House Prices with Machine Learning This project is build to enhance my knowledge about machin

1 Jan 01, 2022

Authors implementation of LieTransformer: Equivariant Self-Attention for Lie Groups

LieTransformer This repository contains the implementation of the LieTransformer used for experiments in the paper LieTransformer: Equivariant self-at

35 Oct 18, 2022

Python Actor concurrency library

Thespian Actor Library This library provides the framework of an Actor model for use by applications implementing Actors. Thespian Site with Documenta

177 Dec 11, 2022

ML-Decoder: Scalable and Versatile Classification Head

ML-Decoder: Scalable and Versatile Classification Head Paper Official PyTorch Implementation Tal Ridnik, Gilad Sharir, Avi Ben-Cohen, Emanuel Ben-Baru

189 Jan 04, 2023

This repo contains the code required to train the multivariate time-series Transformer.

Multi-Variate Time-Series Transformer This repo contains the code required to train the multivariate time-series Transformer. Download the data The No

4 Nov 24, 2022

Pyramid Pooling Transformer for Scene Understanding

Pyramid Pooling Transformer for Scene Understanding Requirements: torch 1.6+ torchvision 0.7.0 timm==0.3.2 Validated on torch 1.6.0, torchvision 0.7.0

119 Dec 29, 2022

Corgis are the cutest creatures; have 30K of them!

corgi-net This is a dataset of corgi images scraped from the corgi subreddit. After filtering using an ImageNet classifier, the training set consists

6 Dec 24, 2022

CAST: Character labeling in Animation using Self-supervision by Tracking

CAST: Character labeling in Animation using Self-supervision by Tracking (Published as a conference paper at EuroGraphics 2022) Note: The CAST paper c

15 Nov 18, 2022

Official pytorch implementation of "Scaling-up Disentanglement for Image Translation", ICCV 2021.

41 Nov 29, 2022

ENet: A Deep Neural Network Architecture for Real-Time Semantic Segmentation

ENet in Caffe Execution times and hardware requirements Network 1024x512 1280x720 Parameters Model size (fp32) ENet 20.4 ms 32.9 ms 0.36 M 1.5 MB SegN

561 Jan 04, 2023

PyTorch implementation of "Image-to-Image Translation Using Conditional Adversarial Networks".

pix2pix-pytorch PyTorch implementation of Image-to-Image Translation Using Conditional Adversarial Networks. Based on pix2pix by Phillip Isola et al.

383 Dec 17, 2022