3D Avatar Lip Syncronization from speech (JALI based face-rigging)

Last update: Dec 20, 2022

Overview

visemenet-inference

Inference Demo of "VisemeNet-tensorflow"
- VisemeNet is an audio-driven animator centric speech animation driving a JALI or standard FACS-based face-rigging from input audio.
- The original repo is outdated and difficult to setup the environment for testing the pretrained model. This code is to provide a super-clean inference module based on the original author's repo.

How to freeze graph

This repo does not need bazel-build for "freeze-graph" function
Thanks to https://github.com/lighttransport/VisemeNet-infer for giving some examples.

Requirements

Python 3.6.x using "pyenv"
Tensorflow 1.1.0

Setup the envs and packages

# Install Virtualenv using pyenv
pyenv install 3.6.5
pyenv virtualenv 3.6.5 visemenet-freeze
pyenv activate visemenet-freeze

# Install packages
pip install tensorflow==1.1.0

Clone the repo

# Clone Visemenet repo and the pretrained model
git clone https://github.com/yzhou359/VisemeNet_tensorflow.git
curl -L https://www.dropbox.com/sh/7nbqgwv0zz8pbk9/AAAghy76GVYDLqPKdANcyDuba?dl=0 > pretrained_model.zip
unzip prtrained_model.zip -d VisemeNet_tensorflow/data/ckpt/pretrain_biwi/

Freeze Graph and Save as pb

# Freeze Graph
python freeze_graph.py

Model Inference

Colab Demo

This code provides the simple and clean inference code without any needless ones
It's compatible with TF 2.0 Version

Requirements

Tensorflow 2.x
numpy
scipy
python_speech_features

How to run inference

import numpy as np
from inference import VisemeRegressor

pb_filepath = "./visemenet_frozen.pb"
wav_file_path = "./test_audio.wav"
out_txt_path = "./maya_viseme_outputs.txt"

viseme_regressor = VisemeRegressor(pb_filepath=pb_filepath)

viseme_outputs = viseme_regressor.predict_outputs(wav_file_path=wav_file_path)

np.savetxt(out_txt_path, viseme_outputs, '%.4f')

3D Avatar Lip Syncronization from speech (JALI based face-rigging)

Related tags

Overview

visemenet-inference

How to freeze graph

Requirements

Model Inference

Requirements

How to run inference

Owner

Junhwan Jang

A booklet on machine learning systems design with exercises

Point detection through multi-instance deep heatmap regression for sutures in endoscopy

Source code, datasets and trained models for the paper Learning Advanced Mathematical Computations from Examples (ICLR 2021), by François Charton, Amaury Hayat (ENPC-Rutgers) and Guillaume Lample

Tensors and neural networks in Haskell

RuleBERT: Teaching Soft Rules to Pre-Trained Language Models

Code for our paper Domain Adaptive Semantic Segmentation with Self-Supervised Depth Estimation

MvtecAD unsupervised Anomaly Detection

Binary Passage Retriever (BPR) - an efficient passage retriever for open-domain question answering

Can we visualize a large scientific data set with a surrogate model? We're building a GAN for the Earth's Mantle Convection data set to see if we can!

DI-HPC is an acceleration operator component for general algorithm modules in reinforcement learning algorithms

Pytorch implementation of BRECQ, ICLR 2021

ISBI 2022: Cross-level Contrastive Learning and Consistency Constraint for Semi-supervised Medical Image.

A real world application of a Recurrent Neural Network on a binary classification of time series data

This is the pytorch implementation of the paper - Axiomatic Attribution for Deep Networks.

Lua-parser-lark - An out-of-box Lua parser written in Lark

Repositorio oficial del curso IIC2233 Programación Avanzada 🚀✨

🦕 NanoSaur is a little tracked robot ROS2 enabled, made for an NVIDIA Jetson Nano

TensorLight - A high-level framework for TensorFlow

Human Activity Recognition example using TensorFlow on smartphone sensors dataset and an LSTM RNN. Classifying the type of movement amongst six activity categories - Guillaume Chevalier

机器学习、深度学习、自然语言处理等人工智能基础知识总结。