In this project we can see how we can generate automatic music using character RNN.

Overview

Automatic Music Genaration

Table of Contents

  1. Project Description
  2. Approach towards the problem
  3. Limitations
  4. Libraries Used
  5. Summary
  6. Applications
  7. References


1. Project Description

  • Natural Language Processing in Artificial Intelligence is the application of computational techniques to the analysis and synthesis of natural language and speech.

  • For a hypothetical scenario it was assumed that ABC music prod. pvt.ltd is a reknowned audio-video production house based out of Mumbai, India

  • As COVID-19 cases are increasing day by day it is almost impossible for the musicians to coop up with real time studio work.

  • Hence, the company wants you to make an AI based music genaration system.

  • The goal of this project is to make an AI based music genaration system.

  • The key contraint to the problem is accruracy.

  • You have been hired as a freelance data scientist for ABC music prod. pvt.ltd

  • The model should read a text file in abc format.

  • The model should genarate the corresponding music framed out of that note sequence.


2. Approach towards the problem

  • The approach to this project was to make a chat bot from scratch.

  • Initially all the necessary libraries were imported and installed.

  • Then, the data corpus was formed.

  • A data corpus is a collection of linguistic data.

  • Then while performing preprocessing we used lemitization technique in order to process the data into desirable format.

  • Then after preprocessing, we genarated various batches to fit into the model.

  • After that we build a charecter RNN model and trained the model for 100 epochs with a batch size of 16 and a sequence length of 64

  • For every 10 epochs the model would save the updated weights in the main directory

  • We recieved a validation accuracy of 92%

  • Now we genarated the music code which is nothing but the predicted vocabulary

  • After that we went here and we were successfully able to genarate music.

3. Limitation(s)

  • The only limitation of this model is that it is being trained with very less data.

  • However, while getting trained on various intrument data this model can be further enhanced for different instruments as well.

  • We have trained this model for only 100 epochs. As the number of epochs increase it is expected that the accuracy of the model will increase.

4. Libraries Used

Following are the list of libraries that were used for making this project.

  • Python was used as the general purposae programming language

  • Keras was used to perform all Deep Learning operation(s) such as model bulding, compliation and training.

  • numpy was used in order to calcucate numercal operations.

5. Summary

  • In this project an Automatic music genaration system was made from scratch.

  • Here, we recieved a validation accuracy of 92%

  • This project can be widely used for music production systems

  • The only limitation of this model is that it is being trained with very less data.

  • However, while getting trained on various intrument data this model can be further enhanced for different instruments as well.

  • This model can be used for in house music production systems.

  • This can be widely used to automate manual instruments.

  • This can be also used to make automatic VST(virtual studio toolkit) plugins

6. Applications

  • This model can be used for in house music production systems.

  • This can be widely used to automate manual instruments.

  • This can be also used to make automatic VST(virtual studio toolkit) plugins

7. References

Owner
Pronay Ghosh
A data-driven professional who grows day by day. Currently working as a Data Science Researcher at The International School of AI and Data Science.
Pronay Ghosh
An app made in Python using the PyTube and Tkinter libraries to download videos and MP3 audio.

yt-dl (GUI Edition) An app made in Python using the PyTube and Tkinter libraries to download videos and MP3 audio. How do I download this? Windows: Fi

1 Oct 23, 2021
Any-to-any voice conversion using synthetic specific-speaker speeches as intermedium features

MediumVC MediumVC is an utterance-level method towards any-to-any VC. Before that, we propose SingleVC to perform A2O tasks(Xi → Ŷi) , Xi means utter

谷下雨 47 Dec 25, 2022
a library for audio and music analysis

aubio aubio is a library to label music and sounds. It listens to audio signals and attempts to detect events. For instance, when a drum is hit, at wh

aubio 2.9k Dec 30, 2022
A library for augmenting annotated audio data

muda A library for Musical Data Augmentation. muda package implements annotation-aware musical data augmentation, as described in the muda paper. The

Brian McFee 214 Nov 22, 2022
:notes: Cross-platform music player

Exaile Exaile is a music player with a simple interface and powerful music management capabilities. Features include automatic fetching of album art,

Exaile 327 Dec 19, 2022
A lightweight yet powerful audio-to-MIDI converter with pitch bend detection

Basic Pitch is a Python library for Automatic Music Transcription (AMT), using lightweight neural network developed by Spotify's Audio Intelligence La

Spotify 1.4k Jan 01, 2023
SomaFM Plugin for Kodi

SomaFM XBMC Plugin This description is a bit outdated. You can simply install this addon by browsing the official repositories from within Kodi. Insta

7 Jan 21, 2022
Python audio and music signal processing library

madmom Madmom is an audio signal processing library written in Python with a strong focus on music information retrieval (MIR) tasks. The library is i

Institute of Computational Perception 1k Dec 26, 2022
Linear Prediction Coefficients estimation from mel-spectrogram implemented in Python based on Levinson-Durbin algorithm.

LPC_for_TTS Linear Prediction Coefficients estimation from mel-spectrogram implemented in Python based on Levinson-Durbin algorithm. 基于Levinson-Durbin

Zewang ZHANG 58 Nov 17, 2022
Powerful, simple, audio tag editor for GNU/Linux

puddletag puddletag is an audio tag editor (primarily created) for GNU/Linux similar to the Windows program, Mp3tag. Unlike most taggers for GNU/Linux

341 Dec 26, 2022
User-friendly Voice Cloning Application

Multi-Language-RTVC stands for Multi-Language Real Time Voice Cloning and is a Voice Cloning Tool capable of transfering speaker-specific audio featur

Sven Eschlbeck 19 Dec 30, 2022
Voicefixer aims at the restoration of human speech regardless how serious its degraded.

Voicefixer aims at the restoration of human speech regardless how serious its degraded.

Leo 324 Dec 26, 2022
Deep learning transformer model that generates unique music sequences.

music-ai Deep learning transformer model that generates unique music sequences. Abstract In 2017, a new state-of-the-art was published for natural lan

xacer 6 Nov 19, 2022
BART aids transcribe tasks by taking a source audio file and creating automatic repeated loops, allowing transcribers to listen to fragments multiple times

BART (Beyond Audio Replay Technology) aids transcribe tasks by taking a source audio file and creating automatic repeated loops, allowing transcribers to listen to fragments multiple times (with poss

2 Feb 04, 2022
The project aims to develop a personal-assistant for Windows & Linux-based systems

The project aims to develop a personal-assistant for Windows & Linux-based systems. Samiksha draws its inspiration from virtual assistants like Cortana for Windows, and Siri for iOS. It has been desi

SHUBHANSHU RAI 1 Jan 16, 2022
A python library for working with praat, textgrids, time aligned audio transcripts, and audio files.

praatIO Questions? Comments? Feedback? A library for working with praat, time aligned audio transcripts, and audio files that comes with batteries inc

Tim 224 Dec 19, 2022
Real-Time Spherical Microphone Renderer for binaural reproduction in Python

ReTiSAR Implementation of the Real-Time Spherical Microphone Renderer for binaural reproduction in Python [1][2]. Contents: | Requirements | Setup | Q

Division of Applied Acoustics at Chalmers University of Technology 51 Dec 17, 2022
Xbot-Music - Bot Play Music and Video in Voice Chat Group Telegram

XBOT-MUSIC A Telegram Music+video Bot written in Python using Pyrogram and Py-Tg

Fariz 2 Jan 20, 2022
Datamoshing with FFmpeg

ffmosher Datamoshing with FFmpeg Drag and drop video onto mosh.bat to create a datamoshed video. To datamosh an image, please ensure the file is in a

18 Sep 11, 2022
This is an AI that runs in the terminal. It is a voice assistant that can do common activities and can also help in your coding doubts like

This is an AI that runs in the terminal. It is a voice assistant that can do common activities and can also help in your coding doubts like

OneBit 1 Nov 05, 2021