This repository contains a PyTorch implementation of "AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis".

Last update: Dec 29, 2022

Related tags

Deep Learning AD-NeRF

Overview

AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis

| Project Page | Paper |

PyTorch implementation for the paper "AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis"

Prerequisites

You can create an anaconda environment called adnerf with:

conda env create -f environment.yml
conda activate adnerf

PyTorch3D

Recommend install from a local clone

git clone https://github.com/facebookresearch/pytorch3d.git
cd pytorch3d && pip install -e .

Basel Face Model 2009

Put "01_MorphableModel.mat" to data_util/face_tracking/3DMM/; cd data_util/face_tracking; run
```
python convert_BFM.py
```

Train AD-NeRF

Data Preprocess ($id Obama for example)
```
bash process_data.sh Obama
```
- Input: A portrait video at 25fps containing voice audio. (dataset/vids/$id.mp4)
- Output: folder dataset/$id that contains all files for training
Train Two NeRFs (Head-NeRF and Torso-NeRF)
- Train Head-NeRF with command
```
python NeRFs/HeadNeRF/run_nerf.py --config dataset/$id/HeadNeRF_config.txt
```
- Copy latest trainied model from dataset/$id/logs/$id_head to dataset/$id/logs/$id_com
- Train Torso-NeRF with command
```
python NeRFs/TorsoNeRF/run_nerf.py --config dataset/$id/TorsoNeRF_config.txt
```

Run AD-NeRF for rendering

Reconstruct original video with audio input

python NeRFs/TorsoNeRF/run_nerf.py --config dataset/$id/TorsoNeRFTest_config.txt --aud_file=dataset/$id/aud.npy --test_size=300

Drive the target person with another audio input

python NeRFs/TorsoNeRF/run_nerf.py --config dataset/$id/TorsoNeRFTest_config.txt --aud_file=${deepspeechfile.npy} --test_size=-1

Acknowledgments

We use face-parsing.PyTorch for parsing head and torso maps, and DeepSpeech for audio feature extraction. The NeRF model is implemented based on NeRF-pytorch.

This repository contains a PyTorch implementation of "AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis".

Related tags

Overview

AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis

| Project Page | Paper |

Prerequisites

Train AD-NeRF

Run AD-NeRF for rendering

Acknowledgments

Owner

Speech Emotion Recognition with Fusion of Acoustic- and Linguistic-Feature-Based Decisions

This repository is the official implementation of Unleashing the Power of Contrastive Self-Supervised Visual Models via Contrast-Regularized Fine-Tuning (NeurIPS21).

Unofficial pytorch implementation of 'Image Inpainting for Irregular Holes Using Partial Convolutions'

CLIPort: What and Where Pathways for Robotic Manipulation

A list of awesome PyTorch scholarship articles, guides, blogs, courses and other resources.

🧠 A PyTorch implementation of 'Deep CORAL: Correlation Alignment for Deep Domain Adaptation.', ECCV 2016

Get 2D point positions (e.g., facial landmarks) projected on 3D mesh

How Do Adam and Training Strategies Help BNNs Optimization? In ICML 2021.

"Neural Turing Machine" in Tensorflow

PyTorch implementation of paper "StarEnhancer: Learning Real-Time and Style-Aware Image Enhancement" (ICCV 2021 Oral)

Cave Generation using metaballs in Blender. Originally created by sdfgeoff, Edited by Myself (Archie Jaskowicz).

Sdf sparse conv - Deep Learning on SDF for Classifying Brain Biomarkers

This is an easy python software which allows to sort images with faces by gender and after by age.

Code basis for the paper "Camera Condition Monitoring and Readjustment by means of Noise and Blur" (2021)

Deep learning library featuring a higher-level API for TensorFlow.

NAS-HPO-Bench-II is the first benchmark dataset for joint optimization of CNN and training HPs.

FAMIE is a comprehensive and efficient active learning (AL) toolkit for multilingual information extraction (IE)

Official Code for "Non-deep Networks"

Machine Learning Framework for Operating Systems - Brings ML to Linux kernel

Online-compatible Unsupervised Non-resonant Anomaly Detection Repository