This is the source code for the experiments related to the paper Unsupervised Audio Source Separation Using Differentiable Parametric Source Models

Last update: Oct 19, 2022

Related tags

Deep Learning umss

Overview

Unsupervised Audio Source Separation Using Differentiable Parametric Source Models

This is the source code for the experiments related to the paper Unsupervised Audio Source Separation Using Differentiable Parametric Source Models.

It contains a re-implementation of parts of the DDSP library in PyTorch. We added a differentiable all-pole filter which can be parameterized by line spectral frequencies or reflection coefficients.

Please cite the paper, if you use parts of the code in your work.

Links

🔊 Audio examples

📄 Paper

Requirements

The following packages are required:

pytorch==1.6.0
matplotlib==3.3.1
python-sounddevice==0.4.0
scipy==1.5.2
torchaudio=0.6.0
tqdm==4.49.0
pysoundfile==0.10.3
librosa==0.8.0
scikit-learn==0.23.2
tensorboard==2.3.0
resampy==0.2.2
pandas==1.2.3
tensorboard==2.3.0

Training

python train.py -c config.txt

python train_u_nets.py -c unet_config.txt

Evaluation

python eval.py --tag 'TAG' --f0-from-mix --test-set 'CSD'

Acknowledgment

This project has received funding from the European Union's Horizon 2020 research and innovation programme under the Marie Skłodowska-Curie grant agreement No. 765068.

This is the source code for the experiments related to the paper Unsupervised Audio Source Separation Using Differentiable Parametric Source Models

Related tags

Overview

Unsupervised Audio Source Separation Using Differentiable Parametric Source Models

Links

Requirements

Training

Evaluation

Acknowledgment

Copyright

Owner

Code for AutoNL on ImageNet (CVPR2020)

ScaleNet: A Shallow Architecture for Scale Estimation

Python implementation of 3D facial mesh exaggeration using the techniques described in the paper: Computational Caricaturization of Surfaces.

Vision Transformer for 3D medical image registration (Pytorch).

Black-Box-Tuning - Black-Box Tuning for Language-Model-as-a-Service

BlueFog Tutorials

This repository contains a PyTorch implementation of the paper Learning to Assimilate in Chaotic Dynamical Systems.

NeROIC: Neural Object Capture and Rendering from Online Image Collections

Unsupervised captioning - Code for Unsupervised Image Captioning

EMNLP 2021 Findings' paper, SCICAP: Generating Captions for Scientific Figures

A certifiable defense against adversarial examples by training neural networks to be provably robust

Bagua is a flexible and performant distributed training algorithm development framework.

This repository gives an example on how to preprocess the data of the HECKTOR challenge

This repository contains the code for EMNLP-2021 paper "Word-Level Coreference Resolution"

Breaking the Dilemma of Medical Image-to-image Translation

YOLO-v5 기반 단안 카메라의 영상을 활용해 차간 거리를 일정하게 유지하며 주행하는 Adaptive Cruise Control 기능 구현

Quantized tflite models for ailia TFLite Runtime

Code for the paper "M2m: Imbalanced Classification via Major-to-minor Translation" (CVPR 2020)

Repo for WWW 2022 paper: Progressively Optimized Bi-Granular Document Representation for Scalable Embedding Based Retrieval

Reinforcement learning models in ViZDoom environment