Optimizing synthesizer parameters using gradient approximation

NASH 2021 Hackathon!

These are some experiments I conducted during NASH 2021, the Neural Audio Synthesis Hackathon that took place on the 18th & 19th of December.

Over the weekend I explored implementing gradient approximation for torchsynth, so that synthesizers could be included in deep learning models & training without having to have the full synth be differentiable. It uses simultaneous perturbation stochastic approximation (SPSA) to estimate the gradients for synthesizer parameters. This technique was used by Marco A. Martínez Ramírez et al. in their work on Differentiable Signal Processing With Black-Box Audio Effects.

I was able to start optimizing on a few parameters for a simple synthesizer, but ran into issues as soon as oscillator tuning or FM was introduced. There is a known issue with audio loss functions for calculating loss with pitch (Turian and Henry, 2020), so this is not surprising.

Nonetheless, techniques like SPSA seem promising for including traditional DSP synthesis into neural nets and deep learning!

Fun weekend puttering around with this! Thank you to Ben Hayes for organing the event.

Optimizing synthesizer parameters using gradient approximation

Related tags

Overview

Optimizing synthesizer parameters using gradient approximation

NASH 2021 Hackathon!

Owner

Jordie Shier

InferPy: Deep Probabilistic Modeling with Tensorflow Made Easy

Keras implementations of Generative Adversarial Networks.

Lightweight stereo matching network based on MobileNetV1 and MobileNetV2

Kohei's 5th place solution for xview3 challenge

Implementation supporting the ICCV 2017 paper "GANs for Biological Image Synthesis"

Code for "Discovering Non-monotonic Autoregressive Orderings with Variational Inference" (paper and code updated from ICLR 2021)

Using knowledge-informed machine learning on the PRONOSTIA (FEMTO) and IMS bearing data sets. Predict remaining-useful-life (RUL).

exponential adaptive pooling for PyTorch

The full training script for Enformer (Tensorflow Sonnet) on TPU clusters

🥇 LG-AI-Challenge 2022 1위 솔루션 입니다.

BuildingNet: Learning to Label 3D Buildings

On the model-based stochastic value gradient for continuous reinforcement learning

A simple Neural Network that predicts the label for a series of handwritten digits

a delightful machine learning tool that allows you to train, test and use models without writing code

Source code related to the article submitted to the International Conference on Computational Science ICCS 2022 in London

DataCLUE: 国内首个以数据为中心的AI测评（含模型分析报告）

Self-Supervised Monocular DepthEstimation with Internal Feature Fusion(arXiv), BMVC2021

Official PyTorch Implementation of paper "NeLF: Neural Light-transport Field for Single Portrait View Synthesis and Relighting", EGSR 2021.

OpenVINO黑客松比赛项目

Visual dialog agents with pre-trained vision-and-language encoders.