Official implementation of the RAVE model: a Realtime Audio Variational autoEncoder

Last update: Jan 01, 2023

Overview

RAVE: Realtime Audio Variational autoEncoder

Official implementation of RAVE: A variational autoencoder for fast and high-quality neural audio synthesis (article link) by Antoine Caillon and Philippe Esling.

If you use RAVE as a part of a music performance or installation, be sure to cite either this repository or the article !

Installation

RAVE needs python 3.9. Install the dependencies using

pip install -r requirements.txt

Detailed instructions to setup a training station for this project are available here.

Preprocessing

RAVE comes with two command line utilities, resample and duration. resample allows to pre-process (silence removal, loudness normalization) and augment (compression) an entire directory of audio files (.mp3, .aiff, .opus, .wav, .aac). duration prints out the total duration of a .wav folder.

Training

Both RAVE and the prior model are available in this repo. For most users we recommand to use the cli_helper.py script, since it will generate a set of instructions allowing the training and export of both RAVE and the prior model on a specific dataset.

python cli_helper.py

However, if you want to customize even more your training, you can use the provided train_{rave, prior}.py and export_{rave, prior}.py scripts manually.

Reconstructing audio

Once trained, you can reconstruct an entire folder containing wav files using

python reconstruct.py --ckpt /path/to/checkpoint --wav-folder /path/to/wav/folder

You can also export RAVE to a torchscript file using export_rave.py and use the encode and decode methods on tensors.

Realtime usage

UPDATE

If you want to use the realtime mode, you should update your dependencies !

pip install -r requirements.txt

RAVE and the prior model can be used in realtime on live audio streams, allowing creative interactions with both models.

nn~

RAVE is compatible with the nn~ max/msp and PureData external.

An audio example of the prior sampling patch is available in the docs/ folder.

RAVE vst

You can also use RAVE as a VST audio plugin using the RAVE vst !

Discussion

If you have questions, want to share your experience with RAVE or share musical pieces done with the model, you can use the Discussion tab !

Official implementation of the RAVE model: a Realtime Audio Variational autoEncoder

Related tags

Overview

RAVE: Realtime Audio Variational autoEncoder

Installation

Preprocessing

Training

Reconstructing audio

Realtime usage

nn~

RAVE vst

Discussion

Owner

ACIDS

Some tentative models that incorporate label propagation to graph neural networks for graph representation learning in nodes, links or graphs.

ISNAS-DIP: Image Specific Neural Architecture Search for Deep Image Prior [CVPR 2022]

Tutorial on scikit-learn and IPython for parallel machine learning

This is the code for our KILT leaderboard submission to the T-REx and zsRE tasks. It includes code for training a DPR model then continuing training with RAG.

DrWhy is the collection of tools for eXplainable AI (XAI). It's based on shared principles and simple grammar for exploration, explanation and visualisation of predictive models.

Retinal Vessel Segmentation with Pixel-wise Adaptive Filters (ISBI 2022)

This is code of book "Learn Deep Learning with PyTorch"

The audio-video synchronization of MKV Container Format is exploited to achieve data hiding

MASA-SR: Matching Acceleration and Spatial Adaptation for Reference-Based Image Super-Resolution (CVPR2021)

Simple object detection app with streamlit

Vignette is a face tracking software for characters using osu!framework.

DiSECt: Differentiable Simulator for Robotic Cutting

Video Swin Transformer - PyTorch

Miscellaneous and lightweight network tools

Code for testing convergence rates of Lipschitz learning on graphs

The source code and data of the paper "Instance-wise Graph-based Framework for Multivariate Time Series Forecasting".

existing and custom freqtrade strategies supporting the new hyperstrategy format.

Remote sensing change detection using PaddlePaddle

Stable Neural ODE with Lyapunov-Stable Equilibrium Points for Defending Against Adversarial Attacks

An end-to-end machine learning web app to predict rugby scores (Pandas, SQLite, Keras, Flask, Docker)