CBREN: Convolutional Neural Networks for Constant Bit Rate Video Quality Enhancement

Last update: Nov 04, 2022

Related tags

Overview

CBREN

This is the Pytorch implementation for our IEEE TCSVT paper : CBREN: Convolutional Neural Networks for Constant Bit Rate Video Quality Enhancement.

Note: different from the paper, this code adds residual blocks to the pixel-domain branch of DRM module, but it has little impact on the effect of the network.

Because the DCN compilation in Windows environment may cause problems, this code may only run in Linux environment.

Requirements

Python 3.8
PyTorch 1.6.0
Numpy 1.19.2
Pillow 7.2.0
OpenCV 4.4.0.44

Prepare

Build

Deformable convolution is used in this code from: https://github.com/chengdazhi/Deformable-Convolution-V2-PyTorch
Run sh make.sh to compile the deformable convolution. If there are compilation errors, delete the 'build/' directory before recompiling.

Datasets

The directories used in the project need to be created manually.
Download HEVC standard test sequence: https://pan.baidu.com/s/1m0jZfkhX_cjaoFrHlMp0Xg Extraction code: 88n9
Hm16.0 is used to compress the standard test sequence.
The original video and compressed video in YUV format are converted to MP4 format by ffmpeg.
We provided the BasketballPass video in MP4 format as a demonstration in this project.
Run tools/video_get_frames.py to obtain the image sequence in PNG format from the video.

Pretrained models

Pretrained models are available: https://pan.baidu.com/s/1sszHgZ1tYVEu8toyUkFaUw Extraction code: i0zs

Run

Runrun.py, and the generated images are saved in results/.
If the size of GPU memory is not large enough to run the sequences A and B, please run run_group_A&B.py.

CBREN: Convolutional Neural Networks for Constant Bit Rate Video Quality Enhancement

Related tags

Overview

CBREN

Requirements

Prepare

Build

Datasets

Pretrained models

Run

Owner

Zhao Hengrun

Implementation of our recent paper, WOOD: Wasserstein-based Out-of-Distribution Detection.

MoCap-Solver: A Neural Solver for Optical Motion Capture Data

TLDR: Twin Learning for Dimensionality Reduction

Level Based Customer Segmentation

Deep Reinforcement Learning based Trading Agent for Bitcoin

An experiment on the performance of homemade Q-learning AIs in Agar.io depending on their state representation and available actions

MAGMA - a GPT-style multimodal model that can understand any combination of images and language

A synthetic texture-invariant dataset for object detection of UAVs

Transfer style api - An API to use with Tranfer Style App, where you can use two image and transfer the style

This is official implementaion of paper "Token Shift Transformer for Video Classification".

Weight estimation in CT by multi atlas techniques

A Simulated Optimal Intrusion Response Game

Code for our CVPR 2021 Paper "Rethinking Style Transfer: From Pixels to Parameterized Brushstrokes".

Using contrastive learning and OpenAI's CLIP to find good embeddings for images with lossy transformations

Unit-Convertor - Unit Convertor Built With Python

Implementation for Panoptic-PolarNet (CVPR 2021)

Train CPPNs as a Generative Model, using Generative Adversarial Networks and Variational Autoencoder techniques to produce high resolution images.

Adversarial vulnerability of powerful near out-of-distribution detection

A Genetic Programming platform for Python with TensorFlow for wicked-fast CPU and GPU support.

This is a clean and robust Pytorch implementation of DQN and Double DQN.