HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement

Last update: Dec 29, 2022

Related tags

Overview

HiFi++ : a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement

This is the unofficial implementation of Vocoder part of HiFi++ : a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement.

Currently, this repo is WIP but you can start your training without any error.

Training:

python train.py --config config_v2.json

Citations:

@misc{https://doi.org/10.48550/arxiv.2203.13086,
  doi = {10.48550/ARXIV.2203.13086},
  
  url = {https://arxiv.org/abs/2203.13086},
  
  author = {Andreev, Pavel and Alanov, Aibek and Ivanov, Oleg and Vetrov, Dmitry},
  
  keywords = {Sound (cs.SD), Machine Learning (cs.LG), Audio and Speech Processing (eess.AS), FOS: Computer and information sciences, FOS: Computer and information sciences, FOS: Electrical engineering, electronic engineering, information engineering, FOS: Electrical engineering, electronic engineering, information engineering},
  
  title = {HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement},
  
  publisher = {arXiv},
  
  year = {2022},
  
  copyright = {arXiv.org perpetual, non-exclusive license}
}

References:

https://github.com/jik876/hifi-gan

HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement

Related tags

Overview

HiFi++ : a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement

Training:

Citations:

References:

Owner

Rishikesh (ऋषिकेश)

Do you like Quick, Draw? Well what if you could train/predict doodles drawn inside Streamlit? Also draws lines, circles and boxes over background images for annotation.

N-HiTS: Neural Hierarchical Interpolation for Time Series Forecasting

Pytorch and Torch testing code of CartoonGAN

Negative Interactions for Improved Collaborative Filtering:

Framework for Spectral Clustering on the Sparse Coefficients of Learned Dictionaries

Research shows Google collects 20x more data from Android than Apple collects from iOS. Block this non-consensual telemetry using pihole blocklists.

realsense d400 -> jpg + csv

This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".

Implementation of Kaneko et al.'s MaskCycleGAN-VC model for non-parallel voice conversion.

[NeurIPS 2020] Code for the paper "Balanced Meta-Softmax for Long-Tailed Visual Recognition"

The Python code for the paper A Hybrid Quantum-Classical Algorithm for Robust Fitting

It is an open dataset for object detection in remote sensing images.

Unsupervised CNN for Single View Depth Estimation: Geometry to the Rescue

VIL-100: A New Dataset and A Baseline Model for Video Instance Lane Detection (ICCV 2021)

A PyTorch Implementation of Gated Graph Sequence Neural Networks (GGNN)

Official PyTorch implementation of "ArtFlow: Unbiased Image Style Transfer via Reversible Neural Flows"

Awesome Transformers in Medical Imaging

The world's largest toxicity dataset.

基于pytorch构建cyclegan示例

Hypercomplex Neural Networks with PyTorch