GitHub repository for "Improving Video Generation for Multi-functional Applications"

Last update: Dec 07, 2022

Related tags

Overview

Improving Video Generation for Multi-functional Applications

GitHub repository for "Improving Video Generation for Multi-functional Applications"

Paper Link

For more information please refer to our homepage.

Requirements

Tensorflow 1.2.1
Python 2.7
ffmpeg

Data Format

Videos are stored as JPEGs of vertically stacked frames. Every frame needs to be at least 64x64 pixels; videos contain between 16 and 32 frames. For an example datasets see: http://carlvondrick.com/tinyvideo/#data

Training

python main_train.py

Important Parameters:

mode: one of 'generate', 'predict', 'bw2rgb', 'inpaint' depending on weather you want to generate videos, predict future frames, colorize videos or do inpainting.
batch_size: Recommended 64, for colorization use 32 for memory issues.
root_dir: root directory of dataset
index_file: must be in root_dir, containing a list of all training data clips; path relative to root_dir.
experiment_name: name of experiment
output_every: output loss to stdout and write to tensorboard summary every xx steps.
sample_every: generate a visual sample every xx steps.
save_model_very: save the model every xx steps.
recover_model: if true recover model and continue training

GitHub repository for "Improving Video Generation for Multi-functional Applications"

Related tags

Overview

Improving Video Generation for Multi-functional Applications

Requirements

Data Format

Training

Owner

Bernhard Kratzwald

An Easy-to-use, Modular and Prolongable package of deep-learning based Named Entity Recognition Models.

Shape Matching of Real 3D Object Data to Synthetic 3D CADs (3DV project @ ETHZ)

code from "Tensor decomposition of higher-order correlations by nonlinear Hebbian plasticity"

The official repository for "Intermediate Layers Matter in Momentum Contrastive Self Supervised Learning" paper.

The personal repository of the work: DanceNet3D: Music Based Dance Generation with Parametric Motion Transformer.

Data Augmentation Using Keras and Python

Human-Pose-and-Motion History

An Implementation of Transformer in Transformer in TensorFlow for image classification, attention inside local patches

AI drive app that can help user become beautiful.

Codes for CyGen, the novel generative modeling framework proposed in "On the Generative Utility of Cyclic Conditionals" (NeurIPS-21)

Text to Image Generation with Semantic-Spatial Aware GAN

SCU OlympicsRunning Baseline

SAPIEN Manipulation Skill Benchmark

For encoding a text longer than 512 tokens, for example 800. Set max_pos to 800 during both preprocessing and training.

This code provides a PyTorch implementation for OTTER (Optimal Transport distillation for Efficient zero-shot Recognition), as described in the paper.

🤗 Push your spaCy pipelines to the Hugging Face Hub

Lung Pattern Classification for Interstitial Lung Diseases Using a Deep Convolutional Neural Network

Code from the paper "High-Performance Brain-to-Text Communication via Handwriting"

SnapMix: Semantically Proportional Mixing for Augmenting Fine-grained Data (AAAI 2021)

BTC-Generator - BTC Generator With Python

GitHub repository for "Improving Video Generation for Multi-functional Applications"

Related tags

Overview

Improving Video Generation for Multi-functional Applications

Requirements

Data Format

Training

Owner

Bernhard Kratzwald

An Easy-to-use, Modular and Prolongable package of deep-learning based Named Entity Recognition Models.

Shape Matching of Real 3D Object Data to Synthetic 3D CADs (3DV project @ ETHZ)

code from "Tensor decomposition of higher-order correlations by nonlinear Hebbian plasticity"

The official repository for "Intermediate Layers Matter in Momentum Contrastive Self Supervised Learning" paper.

The personal repository of the work: *DanceNet3D: Music Based Dance Generation with Parametric Motion Transformer*.

Data Augmentation Using Keras and Python

Human-Pose-and-Motion History

An Implementation of Transformer in Transformer in TensorFlow for image classification, attention inside local patches

AI drive app that can help user become beautiful.

Codes for CyGen, the novel generative modeling framework proposed in "On the Generative Utility of Cyclic Conditionals" (NeurIPS-21)

Text to Image Generation with Semantic-Spatial Aware GAN

SCU OlympicsRunning Baseline

SAPIEN Manipulation Skill Benchmark

For encoding a text longer than 512 tokens, for example 800. Set max_pos to 800 during both preprocessing and training.

This code provides a PyTorch implementation for OTTER (Optimal Transport distillation for Efficient zero-shot Recognition), as described in the paper.

🤗 Push your spaCy pipelines to the Hugging Face Hub

Lung Pattern Classification for Interstitial Lung Diseases Using a Deep Convolutional Neural Network

Code from the paper "High-Performance Brain-to-Text Communication via Handwriting"

SnapMix: Semantically Proportional Mixing for Augmenting Fine-grained Data (AAAI 2021)

BTC-Generator - BTC Generator With Python

The personal repository of the work: DanceNet3D: Music Based Dance Generation with Parametric Motion Transformer.