E2e music remastering system - End-to-end Music Remastering System Using Self-supervised and Adversarial Training

Last update: Dec 15, 2022

Overview

End-to-end Music Remastering System

This repository includes source code and pre-trained models of the work End-to-end Music Remastering System Using Self-supervised and Adversarial Training by Junghyun Koo, Seungryeol Paik, and Kyogu Lee.

We provide inference code of the proposed system, which targets to alter the mastering style of a song to desired reference track.

Pre-trained Models

Model	Number of Epochs Trained	Details
Music Effects Encoder	1000	Trained with MTG-Jamendo Dataset
Mastering Cloner	1000	Trained with the above pre-trained Music Effects Encoder and Projection Discriminator

Inference

To run the inference code,

Download pre-trained models above and place them under the folder named 'model_checkpoints' (default)
Prepare input and reference tracks under the folder named 'inference_samples' (default).
Target files should be organized as follow:

    "path_to_data_directory"/"song_name_#1"/input.wav
    "path_to_data_directory"/"song_name_#1"/reference.wav
    ...
    "path_to_data_directory"/"song_name_#n"/input.wav
    "path_to_data_directory"/"song_name_#n"/reference.wav

Run 'inference.py'

python inference.py \
    --ckpt_dir "path_to_checkpoint_directory" \
    --data_dir_test "path_to_directory_containing_inference_samples"

Outputs will be stored under the folder 'inference_samples' (default)

Note: The system accepts WAV files of stereo-channeled, 44.1kHZ, and 16-bit rate. Target files shold be named "input.wav" and "reference.wav".

Configurations of each sub-networks

A detailed configuration of each sub-networks can also be found at

Self_Supervised_Music_Remastering_System/configs.yaml

E2e music remastering system - End-to-end Music Remastering System Using Self-supervised and Adversarial Training

Related tags

Overview

End-to-end Music Remastering System

Pre-trained Models

Inference

Configurations of each sub-networks

Owner

Junghyun (Tony) Koo

Deep Learning Theory

CurriculumNet: Weakly Supervised Learning from Large-Scale Web Images

Improving Query Representations for DenseRetrieval with Pseudo Relevance Feedback:A Reproducibility Study.

An Agnostic Computer Vision Framework - Pluggable to any Training Library: Fastai, Pytorch-Lightning with more to come

PSPNet in Chainer

LieTransformer: Equivariant Self-Attention for Lie Groups

The source code for 'Noisy-Labeled NER with Confidence Estimation' accepted by NAACL 2021

A annotation of yolov5-5.0

This repository collects project-relevant Isabelle/HOL formalizations.

Vector AI — A platform for building vector based applications. Encode, query and analyse data using vectors.

A curated list of awesome resources related to Semantic Search🔎 and Semantic Similarity tasks.

Image Restoration Toolbox (PyTorch). Training and testing codes for DPIR, USRNet, DnCNN, FFDNet, SRMD, DPSR, BSRGAN, SwinIR

Object detection GUI based on PaddleDetection

Functional TensorFlow Implementation of Singular Value Decomposition for paper Fast Graph Learning

Pairwise learning neural link prediction for ogb link prediction

Human annotated noisy labels for CIFAR-10 and CIFAR-100.

A flexible tool for creating, organizing, and sharing visualizations of live, rich data. Supports Torch and Numpy.

[3DV 2021] A Dataset-Dispersion Perspective on Reconstruction Versus Recognition in Single-View 3D Reconstruction Networks

This is a TensorFlow implementation for C2-Rec

Wider or Deeper: Revisiting the ResNet Model for Visual Recognition