A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK

Last update: Dec 28, 2022

Related tags

Deep Learning Pytorch-MBNet

Overview

Pytorch-MBNet

A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK

Training

To train a new model, please run train.py, the input arguments are:

--data_path: The path of the directory containing all .wav files of VCC-2018 and the train/dev/test split files (the files in ./data).
--save_dir: The path of the directory to save the trained models. Please create the directory before training.
--total_steps: The total #training step in the training.
--valid_steps: Do the validation every #(valid_steps) of training update.
--log_steps: Log the tensorboard every #(log_steps) of training update.
--update_freq: Gradient accumulation, the default value is 1 (no accumulation).

Testing

To test on VCC-2018, please run test.py, the input arguments are:

--model_path: The path to the saved model.
--idtable_path: The path to the "judge id-number" mapping table file used during training.
--step: The time step for tensorboard log, which can be the same as the training steps.
--split: The valid/test split of data to be used in the testing.

Inference

After training on the VCC data, the model can be utilized to inference on other data. The input arguments are --data_path, --model_path, --save_dir, which are similar to the above. Notice that the bias-net is not used since in this code the ground-truth judge ids are assumed to be unavailable.

The pre-trained model can be found in ./pre_trained.

A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK

Related tags

Overview

Pytorch-MBNet

Training

Testing

Inference

Owner

code for our BMVC 2021 paper "HCV: Hierarchy-Consistency Verification for Incremental Implicitly-Refined Classification"

Breaking the Curse of Space Explosion: Towards Efficient NAS with Curriculum Search

Code for the paper "Learning-Augmented Algorithms for Online Steiner Tree"

A time series processing library

ShinRL: A Library for Evaluating RL Algorithms from Theoretical and Practical Perspectives

A simple software for capturing human body movements using the Kinect camera.

The aim of the game, as in the original one, is to find a specific image from a group of different images of a person's face

Pytorch implementation for "Open Compound Domain Adaptation" (CVPR 2020 ORAL)

This package implements THOR: Transformer with Stochastic Experts.

Virtual Dance Reality Stage is a feature that offers you to share a stage with another user virtually.

mPose3D, a mmWave-based 3D human pose estimation model.

The (Official) PyTorch Implementation of the paper "Deep Extraction of Manga Structural Lines"

Deep Latent Force Models

A Python library for adversarial machine learning focusing on benchmarking adversarial robustness.

duralava is a neural network which can simulate a lava lamp in an infinite loop.

Free course that takes you from zero to Reinforcement Learning PRO 🦸🏻‍🦸🏽

Computing Shapley values using VAEAC

Code for reproducing our analysis in the paper titled: Image Cropping on Twitter: Fairness Metrics, their Limitations, and the Importance of Representation, Design, and Agency

Generic image compressor for machine learning. Pytorch code for our paper "Lossy compression for lossless prediction".

CSAW-M: An Ordinal Classification Dataset for Benchmarking Mammographic Masking of Cancer