Implementing a simplified copy of Shazam application from scratch using MinHashing and LSH.

Last update: Nov 17, 2022

Overview

Building Shazam from scratch

In this repository we tried to implement a simplified copy of the Shazam application able to tell you the name of a song listening to a short sample.

Overview

Converting the songs from mp3 to wav with Librosa and extraction of the peaks
MinHashing with permutations on the shingles matrix
Locality sensitive hashing to divide the songs in buckets
Shazam!

pickle is a folder that contains the songs peaks, the shingles array and the shingle matrix in pickle format.
ShazamLSH.ipynb is the main notebook that only contains the explanation of the steps and some comments
function.py contains all the implemented function needed to execute the notebook

Resources

This is the dataset we used and processed:

https://www.kaggle.com/dhrumil140396/mp3s32k

We also share some useful links can help to understand what is the process behind Min Hashing and LSH in order to recognise song:

Implementing a simplified copy of Shazam application from scratch using MinHashing and LSH.

Related tags

Overview

Building Shazam from scratch

Overview

Contents

Resources

Owner

Arturo Ghinassi

The (Official) PyTorch Implementation of the paper "Deep Extraction of Manga Structural Lines"

Code for paper Decoupled Dynamic Spatial-Temporal Graph Neural Network for Traffic Forecasting

Personals scripts using ageitgey/face_recognition

Generative Flow Networks for Discrete Probabilistic Modeling

PyImpetus is a Markov Blanket based feature subset selection algorithm that considers features both separately and together as a group in order to provide not just the best set of features but also the best combination of features

PyTorch code for: Learning to Generate Grounded Visual Captions without Localization Supervision

PyTorch implementation of our method for adversarial attacks and defenses in hyperspectral image classification.

Generic Event Boundary Detection: A Benchmark for Event Segmentation

[WWW 2021] Source code for "Graph Contrastive Learning with Adaptive Augmentation"

Classic Papers for Beginners and Impact Scope for Authors.

GAN Image Generator and Characterwise Image Recognizer with python

2020 CCF大数据与计算智能大赛-非结构化商业文本信息中隐私信息识别-第7名方案

An off-line judger supporting distributed problem repositories

details on efforts to dump the Watermelon Games Paprium cart

Companion repository to the paper accepted at the 4th ACM SIGSPATIAL International Workshop on Advances in Resilient and Intelligent Cities

Official implementation for paper Render In-between: Motion Guided Video Synthesis for Action Interpolation

Learnable Multi-level Frequency Decomposition and Hierarchical Attention Mechanism for Generalized Face Presentation Attack Detection

Only valid pull requests will be allowed. Use python only and readme changes will not be accepted.

SynNet - synthetic tree generation using neural networks

Code samples for my book "Neural Networks and Deep Learning"