Official implementation of the RAVE model: a Realtime Audio Variational autoEncoder

Last update: Jan 02, 2023

Overview

RAVE: Realtime Audio Variational autoEncoder

Official implementation of RAVE: A variational autoencoder for fast and high-quality neural audio synthesis (article link)

Installation

RAVE needs python 3.9. Install the dependencies using

pip install -r requirements.txt

Training

Both RAVE and the prior model are available in this repo. For most users we recommand to use the cli_helper.py script, since it will generate a set of instructions allowing the training and export of both RAVE and the prior model on a specific dataset.

python cli_helper.py

However, if you want to customize even more your training, you can use the provided train_{rave, prior}.py and export_{rave, prior}.py scripts manually.

Realtime usage

[NOT AVAILABLE YET]

RAVE and the prior model can be used in realtime inside max/msp, allowing creative interactions with both models. Code and details about this part of the project are not available yet, we are currently working on the corresponding article !

An audio example of the prior sampling patch is available in the docs/ folder.

Official implementation of the RAVE model: a Realtime Audio Variational autoEncoder

Related tags

Overview

RAVE: Realtime Audio Variational autoEncoder

Installation

Training

Realtime usage

Owner

Antoine Caillon

This is the official released code for our paper, The Emergence of Objectness: Learning Zero-Shot Segmentation from Videos

Code for the paper "Location-aware Single Image Reflection Removal"

Disentangled Cycle Consistency for Highly-realistic Virtual Try-On, CVPR 2021

UFPR-ADMR-v2 Dataset

Extremely easy multi instancing software for minecraft speedrunning.

Use MATLAB to simulate the signal and extract features. Use PyTorch to build and train deep network to do spectrum sensing.

MLPs for Vision and Langauge Modeling (Coming Soon)

Punctuation Restoration using Transformer Models for High-and Low-Resource Languages

The official code repo of "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection"

Hub is a dataset format with a simple API for creating, storing, and collaborating on AI datasets of any size.

Bringing Characters to Life with Computer Brains in Unity

A Keras implementation of YOLOv3 (Tensorflow backend)

Single Image Random Dot Stereogram for Tensorflow

The Official Implementation of Neural View Synthesis and Matching for Semi-Supervised Few-Shot Learning of 3D Pose [NIPS 2021].

Python package for downloading ECMWF reanalysis data and converting it into a time series format.

Teaches a student network from the knowledge obtained via training of a larger teacher network

darija <-> english dictionary

Implementation of Gans

Unofficial PyTorch Implementation of AHDRNet (CVPR 2019)

Reviving Iterative Training with Mask Guidance for Interactive Segmentation