An unofficial implementation of the paper "AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss".

Last update: Jun 16, 2022

Related tags

Computer Vision AutoVC

Overview

AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss

This is an unofficial implementation of AutoVC based on the official one.

The repository is still under construction, so some details may be missing or incomplete.

Preprocessing

python preprocess.py <data_path> <save_path> <encoder_path> [--seg_len seg] [--n_workers workers]

Training

python train.py <config> <data_path> <save_path> [--n_steps steps] [--save_steps save] [--log_steps log] [--batch_size batch] [--seg_len seg]

Reference

Please cite the paper if you find it useful.

@InProceedings{pmlr-v97-qian19c,
  title = {{A}uto{VC}: Zero-Shot Voice Style Transfer with Only Autoencoder Loss},
  author = {Qian, Kaizhi and Zhang, Yang and Chang, Shiyu and Yang, Xuesong and Hasegawa-Johnson, Mark},
  pages = {5210--5219},
  year = {2019},
  editor = {Kamalika Chaudhuri and Ruslan Salakhutdinov},
  volume = {97},
  series = {Proceedings of Machine Learning Research},
  address = {Long Beach, California, USA},
  month = {09--15 Jun},
  publisher = {PMLR},
  pdf = {http://proceedings.mlr.press/v97/qian19c/qian19c.pdf},
  url = {http://proceedings.mlr.press/v97/qian19c.html}
}

An unofficial implementation of the paper "AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss".

Related tags

Overview

AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss

Preprocessing

Training

Reference

Owner

Chien-yu Huang

Code for the "Sensing leg movement enhances wearable monitoring of energy expenditure" paper.

Fatigue Driving Detection Based on Dlib

MeshToGeotiff - A fast Python algorithm to convert a 3D mesh into a GeoTIFF

PianoVisuals - Create background videos synced with piano music using opencv

Aloception is a set of package for computer vision: aloscene, alodataset, alonet.

Run tesseract with the tesserocr bindings with @OCR-D's interfaces

Opencv-image-filters - A camera to capture videos in real time by placing filters using Python with the help of the Tkinter and OpenCV libraries

Distilling Knowledge via Knowledge Review, CVPR 2021

scene-linear test images

Image Smoothing and Blurring Using OpenCV

a micro OCR network with 0.07mb params.

基于Paddle框架的PSENet复现

3点クリックで円を指定し、極座標変換を行うサンプルプログラム

Binarize document images

Localization of thoracic abnormalities model based on VinBigData (top 1%)

Read Japanese manga inside browser with selectable text.

The first open-source library that detects the font of a text in a image.

This tool will help you convert your text to handwriting xD

BNF Globalization Code (CVPR 2016)

fishington.io bot with OpenCV and NumPy