[CVPR 2022] Thin-Plate Spline Motion Model for Image Animation.

Last update: Dec 30, 2022

Overview

[CVPR2022] Thin-Plate Spline Motion Model for Image Animation

Source code of the CVPR'2022 paper "Thin-Plate Spline Motion Model for Image Animation"

Example animation

PS: The paper trains the model for 100 epochs for a fair comparison. You can use more data and train for more epochs to get better performance.

Web demo for animation

Try the web demo for animation here:
Google Colab:

Pre-trained models

Installation

We support python3.(Recommended version is Python 3.9). To install the dependencies run:

pip install -r requirements.txt

YAML configs

There are several configuration files one for each dataset in the config folder named as config/dataset_name.yaml.

See description of the parameters in the config/taichi-256.yaml.

Datasets

MGif. Follow Monkey-Net.
TaiChiHD and VoxCeleb. Follow instructions from video-preprocessing.
TED-talks. Follow instructions from MRAA.

Training

To train a model on specific dataset run:

CUDA_VISIBLE_DEVICES=0,1 python run.py --config config/dataset_name.yaml --device_ids 0,1

A log folder named after the timestamp will be created. Checkpoints, loss values, reconstruction results will be saved to this folder.

Training AVD network

To train a model on specific dataset run:

CUDA_VISIBLE_DEVICES=0 python run.py --mode train_avd --checkpoint '{checkpoint_folder}/checkpoint.pth.tar' --config config/dataset_name.yaml

Checkpoints, loss values, reconstruction results will be saved to {checkpoint_folder}.

Evaluation on video reconstruction

To evaluate the reconstruction performance run:

CUDA_VISIBLE_DEVICES=0 python run.py --mode reconstruction --config config/dataset_name.yaml --checkpoint '{checkpoint_folder}/checkpoint.pth.tar'

The reconstruction subfolder will be created in {checkpoint_folder}. The generated video will be stored to this folder, also generated videos will be stored in png subfolder in loss-less '.png' format for evaluation. To compute metrics, follow instructions from pose-evaluation.

Image animation demo

notebook: demo.ipynb, edit the config cell and run for image animation.
python:

CUDA_VISIBLE_DEVICES=0 python demo.py --config config/vox-256.yaml --checkpoint checkpoints/vox.pth.tar --source_image ./source.jpg --driving_video ./driving.mp4

Acknowledgments

The main code is based upon FOMM and MRAA

Thanks for the excellent works!

Thanks iperov, this work has been integrated in DeepFaceLive

[CVPR 2022] Thin-Plate Spline Motion Model for Image Animation.

Related tags

Overview

[CVPR2022] Thin-Plate Spline Motion Model for Image Animation

Example animation

Web demo for animation

Pre-trained models

Installation

YAML configs

Datasets

Training

Training AVD network

Evaluation on video reconstruction

Image animation demo

Acknowledgments

Owner

yoyo-nb

A GPT, made only of MLPs, in Jax

A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning

Code for EMNLP2020 long paper: BERT-Attack: Adversarial Attack Against BERT Using BERT

UMT is a unified and flexible framework which can handle different input modality combinations, and output video moment retrieval and/or highlight detection results.

A denoising autoencoder + adversarial losses and attention mechanisms for face swapping.

Posterior temperature optimized Bayesian models for inverse problems in medical imaging

It helps user to learn Pick-up lines and share if he has a better one

An onlinel learning to rank python codebase.

FAIR's research platform for object detection research, implementing popular algorithms like Mask R-CNN and RetinaNet.

Datasets, Transforms and Models specific to Computer Vision

PyTorch implementation of Deformable Convolution

Reinforcement Learning for the Blackjack

A method to perform unsupervised cross-region adaptation of crop classifiers trained with satellite image time series.

Best Practices on Recommendation Systems

This repository contains the code and models for the following paper.

验证码识别深度学习 tensorflow 神经网络

Pytorch implementation of Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors

Learning Chinese Character style with conditional GAN

Regression Metrics Calculation Made easy for tensorflow2 and scikit-learn

A framework for multi-step probabilistic time-series/demand forecasting models

[CVPR 2022] Thin-Plate Spline Motion Model for Image Animation.

Related tags

Overview

[CVPR2022] Thin-Plate Spline Motion Model for Image Animation

Example animation

Web demo for animation

Pre-trained models

Installation

YAML configs

Datasets

Training

Training AVD network

Evaluation on video reconstruction

Image animation demo

Acknowledgments

Owner

yoyo-nb

A GPT, made only of MLPs, in Jax

A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning

Code for EMNLP2020 long paper: BERT-Attack: Adversarial Attack Against BERT Using BERT

UMT is a unified and flexible framework which can handle different input modality combinations, and output video moment retrieval and/or highlight detection results.

A denoising autoencoder + adversarial losses and attention mechanisms for face swapping.

Posterior temperature optimized Bayesian models for inverse problems in medical imaging

It helps user to learn Pick-up lines and share if he has a better one

An onlinel learning to rank python codebase.

FAIR's research platform for object detection research, implementing popular algorithms like Mask R-CNN and RetinaNet.

Datasets, Transforms and Models specific to Computer Vision

PyTorch implementation of Deformable Convolution

Reinforcement Learning for the Blackjack

A method to perform unsupervised cross-region adaptation of crop classifiers trained with satellite image time series.

Best Practices on Recommendation Systems

This repository contains the code and models for the following paper.

验证码识别 深度学习 tensorflow 神经网络

Pytorch implementation of Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors

Learning Chinese Character style with conditional GAN

Regression Metrics Calculation Made easy for tensorflow2 and scikit-learn

A framework for multi-step probabilistic time-series/demand forecasting models

验证码识别深度学习 tensorflow 神经网络