AEC_DeepModel

基于深度学习的声学回声消除基线代码

具体解析见我的博客：https://www.cnblogs.com/LXP-Never/p/14779360.html

数据准备

按照以下文件结构，放好语音，我直接使用的是AEC-Challenge 数据集中的合成数据集

└─Synthetic
    ├─TEST
    │  ├─echo_signal
    │  ├─farend_speech
    │  ├─nearend_mic_signal
    │  └─nearend_speech
    ├─TRAIN
    │  ├─echo_signal
    │  ├─farend_speech
    │  ├─nearend_mic_signal
    │  └─nearend_speech
    └─VAL
        ├─echo_signal
        ├─farend_speech
        ├─nearend_mic_signal
        └─nearend_speech

数据处理脚本为 data_preparation.py

如果想要自己生成回声的话建议使用 RIR-Generator 方法，毕竟很多论文中使用的也是这个方法

运行

python train.py

具体的命令行解析参数见train.py脚本

估计近端语音

python test/model_test.py

点赞，关注，不迷路

我以后还会开源更有价值的内容

参考

语音数据增强及python实现

AEC_DeepModel - Deep learning based acoustic echo cancellation baseline code

Related tags

Overview

AEC_DeepModel

数据准备

运行

估计近端语音

参考

Owner

凌逆战

ByT5: Towards a token-free future with pre-trained byte-to-byte models

🤗🖼️ HuggingPics: Fine-tune Vision Transformers for anything using images found on the web.

NeoDays-based tileset for the roguelike CDDA (Cataclysm Dark Days Ahead)

SNCSE: Contrastive Learning for Unsupervised Sentence Embedding with Soft Negative Samples

A natural language modeling framework based on PyTorch

Checking spelling of form elements

Backend for the Autocomplete platform. An AI assisted coding platform.

Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.

A NLP program: tokenize method, PoS Tagging with deep learning

p-tuning for few-shot NLU task

Training code of Spatial Time Memory Network. Semi-supervised video object segmentation.

自然言語で書かれた時間情報表現を抽出/規格化するルールベースの解析器

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

CredData is a set of files including credentials in open source projects

Repo for Enhanced Seq2Seq Autoencoder via Contrastive Learning for Abstractive Text Summarization

Prompt-learning is the latest paradigm to adapt pre-trained language models (PLMs) to downstream NLP tasks

Flaxformer: transformer architectures in JAX/Flax

Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration

STonKGs is a Sophisticated Transformer that can be jointly trained on biomedical text and knowledge graphs

Repository to hold code for the cap-bot varient that is being presented at the SIIC Defence Hackathon 2021.