Blind Video Temporal Consistency via Deep Video Prior

Last update: Dec 21, 2022

Related tags

Deep Learning deep-video-prior

Overview

deep-video-prior (DVP)

Code for NeurIPS 2020 paper: Blind Video Temporal Consistency via Deep Video Prior

PyTorch implementation | paper | project website

Introduction

Our method is a general framework to improve the temporal consistency of video processed by image algorithms. For example, combining image colorization or image dehazing algorithm with our framework, we can achieve the goal of video colorization or video dehazing.

Dependencey

Environment

This code is based on tensorflow. It has been tested on Ubuntu 18.04 LTS.

Anaconda is recommended: Ubuntu 18.04 | Ubuntu 16.04

After installing Anaconda, you can setup the environment simply by

conda env create -f environment.yml
conda activate deep-video-prior

Download VGG model

cd deep-video-prior
python download_VGG.py
unzip VGG_Model.zip

Inference

Demo

bash test.sh

The results are placed in ./result

Use your own data

For the video with unimodal inconsistency:

python dvp_video_consistency.py --input PATH_TO_YOUR_INPUT_FOLDER --processed PATH_TO_YOUR_PROCESSED_FOLDER --task NAME_OF_YOUR_MODEL  --output ./result/OWN_DATA

For the video with multimodal inconsistency:

python dvp_video_consistency.py --input PATH_TO_YOUR_INPUT_FOLDER --processed PATH_TO_YOUR_PROCESSED_FOLDER --task NAME_OF_YOUR_MODEL --with_IRT 1 --IRT_initialization 1 --output ./result/OWN_DATA

Other information

  -h, --help            show this help message and exit
  --task TASK           Name of task
  --input INPUT         Dir of input video
  --processed PROCESSED
                        Dir of processed video
  --output OUTPUT       Dir of output video
  --use_gpu USE_GPU     Use gpu or not
  --loss {perceptual,l1,l2}
                        Chooses which loss to use. perceptual, l1, l2
  --network {unet}      Chooses which model to use. unet, fcn
  --coarse_to_fine_speedup COARSE_TO_FINE_SPEEDUP
                        Use coarse_to_fine_speedup for training
  --with_IRT WITH_IRT   Sse IRT or not, set this to 1 if you want to solve
                        multimodal inconsistency
  --IRT_initialization IRT_INITIALIZATION
                        Sse initialization for IRT
  --large_video LARGE_VIDEO
                        Set this to 1 when the number of video frames are
                        large, e.g., more than 1000 frames
  --save_freq SAVE_FREQ
                        Save frequency of epochs
  --max_epoch MAX_EPOCH
                        The max number of epochs for training
  --format FORMAT       Format of output image

Citation

If you find this work useful for your research, please cite:

@inproceedings{lei2020dvp,
  title={Blind Video Temporal Consistency via Deep Video Prior},
  author={Lei, Chenyang and Xing, Yazhou and Chen, Qifeng},
  booktitle={Advances in Neural Information Processing Systems},
  year={2020}
}

Contact

Please contact me if there is any question (Chenyang Lei, [email protected])

Beyond the tasks we evaluated

Researcher found that Blind Temporal Consistency (e.g., DVP) can be applied to many more tasks!

Video segmentation AuxAdapt: Stable and Efficient Test-Time Adaptation for Temporally Consistent Video Semantic Segmentation
Video denoising Neural Radiance Flow for 4D View Synthesis and Video Processing
Low-light Video Enhancement Learning Temporal Consistency for Low Light Video Enhancement from Single Images

Blind Video Temporal Consistency via Deep Video Prior

Related tags

Overview

deep-video-prior (DVP)

Introduction

Dependencey

Environment

Download VGG model

Inference

Demo

Use your own data

Citation

Contact

Beyond the tasks we evaluated

Owner

Chenyang LEI

This repository contains the implementation of the following paper: Cross-Descriptor Visual Localization and Mapping

Web-interface + rest API for classification and regression (https://jeff1evesque.github.io/machine-learning.docs)

This MVP data web app uses the Streamlit framework and Facebook's Prophet forecasting package to generate a dynamic forecast from your own data.

[ICCV2021] Official code for "Channel-wise Topology Refinement Graph Convolution for Skeleton-Based Action Recognition"

Learnable Boundary Guided Adversarial Training (ICCV2021)

The official PyTorch implementation of the paper: Xili Dai, Xiaojun Yuan, Haigang Gong, Yi Ma. "Fully Convolutional Line Parsing." .

Codes for [NeurIPS'21] You are caught stealing my winning lottery ticket! Making a lottery ticket claim its ownership.

DeepFaceLive - Live Deep Fake in python, Real-time face swap for PC streaming or video calls

Do you like Quick, Draw? Well what if you could train/predict doodles drawn inside Streamlit? Also draws lines, circles and boxes over background images for annotation.

Repo for WWW 2022 paper: Progressively Optimized Bi-Granular Document Representation for Scalable Embedding Based Retrieval

Implementation for "Domain-Specific Bias Filtering for Single Labeled Domain Generalization"

For auto aligning, cropping, and scaling HR and LR images for training image based neural networks

Cerberus Transformer: Joint Semantic, Affordance and Attribute Parsing

PyTorch trainer and model for Sequence Classification

ICLR 2021, Fair Mixup: Fairness via Interpolation

Code and data of the Fine-Grained R2R Dataset proposed in paper Sub-Instruction Aware Vision-and-Language Navigation

This repository contains the official implementation code of the paper Transformer-based Feature Reconstruction Network for Robust Multimodal Sentiment Analysis

Generating retro pixel game characters with Generative Adversarial Networks. Dataset "TinyHero" included.

This is an unofficial PyTorch implementation of Meta Pseudo Labels

Revisiting Contrastive Methods for Unsupervised Learning of Visual Representations. [2021]

Blind Video Temporal Consistency via Deep Video Prior

Related tags

Overview

deep-video-prior (DVP)

Introduction

Dependencey

Environment

Download VGG model

Inference

Demo

Use your own data

Citation

Contact

Beyond the tasks we evaluated

Owner

Chenyang LEI

This repository contains the implementation of the following paper: Cross-Descriptor Visual Localization and Mapping

Web-interface + rest API for classification and regression (https://jeff1evesque.github.io/machine-learning.docs)

This MVP data web app uses the Streamlit framework and Facebook's Prophet forecasting package to generate a dynamic forecast from your own data.

[ICCV2021] Official code for "Channel-wise Topology Refinement Graph Convolution for Skeleton-Based Action Recognition"

Learnable Boundary Guided Adversarial Training (ICCV2021)

The official PyTorch implementation of the paper: *Xili Dai, Xiaojun Yuan, Haigang Gong, Yi Ma. "Fully Convolutional Line Parsing." *.

Codes for [NeurIPS'21] You are caught stealing my winning lottery ticket! Making a lottery ticket claim its ownership.

DeepFaceLive - Live Deep Fake in python, Real-time face swap for PC streaming or video calls

Do you like Quick, Draw? Well what if you could train/predict doodles drawn inside Streamlit? Also draws lines, circles and boxes over background images for annotation.

Repo for WWW 2022 paper: Progressively Optimized Bi-Granular Document Representation for Scalable Embedding Based Retrieval

Implementation for "Domain-Specific Bias Filtering for Single Labeled Domain Generalization"

For auto aligning, cropping, and scaling HR and LR images for training image based neural networks

Cerberus Transformer: Joint Semantic, Affordance and Attribute Parsing

PyTorch trainer and model for Sequence Classification

ICLR 2021, Fair Mixup: Fairness via Interpolation

Code and data of the Fine-Grained R2R Dataset proposed in paper Sub-Instruction Aware Vision-and-Language Navigation

This repository contains the official implementation code of the paper Transformer-based Feature Reconstruction Network for Robust Multimodal Sentiment Analysis

Generating retro pixel game characters with Generative Adversarial Networks. Dataset "TinyHero" included.

This is an unofficial PyTorch implementation of Meta Pseudo Labels

Revisiting Contrastive Methods for Unsupervised Learning of Visual Representations. [2021]

The official PyTorch implementation of the paper: Xili Dai, Xiaojun Yuan, Haigang Gong, Yi Ma. "Fully Convolutional Line Parsing." .