A fast implementation of bss_eval metrics for blind source separation

Last update: Dec 13, 2022

Overview

fast_bss_eval

Do you have a zillion BSS audio files to process and it is taking days ? Is your simulation never ending ?

Fear no more! fast_bss_eval is here to help you!

fast_bss_eval is a fast implementation of the bss_eval metrics for the evaluation of blind source separation. Our implementation of the bss_eval metrics has the following advantages compared to other existing ones.

seamlessly works with both numpy arrays and pytorch tensors
very fast
can be even faster by using an iterative solver (add use_cg_iter=10 option to the function call)
differentiable via pytorch
can run on GPU via pytorch

Author

Robin Scheibler

Quick Start

Install

# from pypi
pip install fast-bss-eval

# or from source
git clone https://github.com/fakufaku/fast_bss_eval
cd fast_bss_eval
pip install -e .

Use

Assuming you have multichannel signals for the estmated and reference sources stored in wav format files names my_estimate_file.wav and my_reference_file.wav, respectively, you can quickly evaluate the bss_eval metrics as follows.

from scipy.io import wavfile
import fast_bss_eval

# open the files, we assume the sampling rate is known
# to be the same
fs, ref = wavfile.read("my_reference_file.wav")
_, est = wavfile.read("my_estimate_file.wav")

# compute the metrics
sdr, sir, sar, perm = fast_bss_eval.bss_eval_sources(ref.T, est.T)

Benchmark

This package is significantly faster than other packages that also allow to compute bss_eval metrics such as mir_eval or sigsep/bsseval. We did a benchmark using numpy/torch, single/double precision floating point arithmetic (fp32/fp64), and using either Gaussian elimination or a conjugate gradient descent (solve/CGD10).

Citation

If you use this package in your own research, please cite our paper describing it.

@misc{scheibler_sdr_2021,
  title={SDR --- Medium Rare with Fast Computations},
  author={Robin Scheibler},
  year={2021},
  eprint={2110.06440},
  archivePrefix={arXiv},
  primaryClass={eess.AS}
}

License

2021 (c) Robin Scheibler, LINE Corporation

This code is released under MIT License.

A fast implementation of bss_eval metrics for blind source separation

Related tags

Overview

fast_bss_eval

Author

Quick Start

Install

Use

Benchmark

Citation

License

Owner

Robin Scheibler

[CVPR 2021] MiVOS - Scribble to Mask module

Non-stationary GP package written from scratch in PyTorch

🐤 Nix-TTS: An Incredibly Lightweight End-to-End Text-to-Speech Model via Non End-to-End Distillation

Lucid Sonic Dreams syncs GAN-generated visuals to music.

Use evolutionary algorithms instead of gridsearch in scikit-learn

[IEEE TPAMI21] MobileSal: Extremely Efficient RGB-D Salient Object Detection [PyTorch & Jittor]

Pytorch Implementation of PointNet and PointNet++++

SAMO: Streaming Architecture Mapping Optimisation

Implementation of CVPR 2021 paper "Spatially-invariant Style-codes Controlled Makeup Transfer"

Companion repo of the UCC 2021 paper "Predictive Auto-scaling with OpenStack Monasca"

Jittor 64*64 implementation of StyleGAN

This is a simple backtesting framework to help you test your crypto currency trading. It includes a way to download and store historical crypto data and to execute a trading strategy.

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Github for the conference paper GLOD-Gaussian Likelihood OOD detector

[CVPR 2022] Back To Reality: Weak-supervised 3D Object Detection with Shape-guided Label Enhancement

CryptoFrog - My First Strategy for freqtrade

Implementation of Enformer, Deepmind's attention network for predicting gene expression, in Pytorch

This code is a near-infrared spectrum modeling method based on PCA and pls

Get started with Machine Learning with Python - An introduction with Python programming examples

Chinese clinical named entity recognition using pre-trained BERT model