User-friendly Voice Cloning Application

Last update: Dec 30, 2022

Overview

Multi-Language-RTVC stands for Multi-Language Real Time Voice Cloning and is a Voice Cloning Tool capable of transfering speaker-specific audio features to synthesize speeches in that voice based on just a few seconds of unknown audio data.

License

This code is licensed under MIT. For more information regarding the license model or associated duties and rights, click here.

Project History

This project was started in 2021 with the goal of inheriting Corentin Jemine's Real-Time-Voice-Cloning. The project originated from the wish of multi-language support for voice cloning models and is now maintained and enhanced by contributing volunteers.

Contributing

We welcome all those interested in the project, from beginners to experts. The MLRTVC community standard is a nice, open-minded and efficient working climate. We encourage all those with ideas to take part in the project by sharing their thoughts.
There are multiple meaningful ways of contributing:

Developing code (new features, fixes, enhancements)
Writing documentation
Raising issues (bugs, feature requests, enhancement proposals, code refacturing, etc.)
Providing pre-trained models
Participating in community tasks (code reviews, discussions, maintenance, etc.)

For transparacy reasons, we ask you to engage with this project via the official ways (issues, pull requests) to share knowledge and questions publicly. Only in cases where privacy or confidentiality is of great importance, other communication channels are accepted (email, chat, etc.).

Further information can be gained in the Contributing Guidelines.

User-friendly Voice Cloning Application

Related tags

Overview

License

Project History

Contributing

Owner

Sven Eschlbeck

Library for working with sound files of the format: .ogg, .mp3, .wav

Audio processor to map oracle notes in the VoG raid in Destiny 2 to call outs.

A GUI-based audio player with support for a large variety of formats

Audio fingerprinting and recognition in Python

Speech recognition module for Python, supporting several engines and APIs, online and offline.

Play any song directly into your group voice chat.

Synchronize a local directory of songs' (MP3, MP4) metadata (genre, ratings) and playlists with a Plex server.

Vixtify - Python Controlled Music Player

Audio spatialization over WebRTC and JACK Audio Connection Kit

GNOME powered sound conversion

Enhanced Audio Player for Discord

Open-Source Tools & Data for Music Source Separation: A Pragmatic Guide for the MIR Practitioner

Implementation of "Slow-Fast Auditory Streams for Audio Recognition, ICASSP, 2021" in PyTorch

GiantMIDI-Piano is a classical piano MIDI dataset contains 10,854 MIDI files of 2,786 composers

Manipulate audio with a simple and easy high level interface

A simple python script to play bell sound in your system infinitely, just for fun and experimental purposes

:notes: Cross-platform music player

gentle forced aligner

Audio book player for senior visually impaired.

DeepMusic is an easy to use Spotify like app to manage and listen to your favorites musics.