ConferencingSpeech 2021 challenge
This repository contains the datasets list and scripts required for the ConferencingSpeech challenge. For more details about the challenge, please see our website.
Details
-
baseline, this folder contains baseline system include inference model exported by onnx and inference scripts; -
eval, this folder contains evaluation scripts to calculate PESQ, STOI and SI-SNR; -
selected_lists, the selected wave about train speech and noise wave name from aishell-1, aishell-3, librispeech-360, VCTK, MUSAN, Audioset. Each participant is only allowed to use the selected speech and noise data below :selected_lists/dev/circle.namecircle RIR wave utt name of dev setselected_lists/dev/linear.namelinear RIR wave utt name of dev setselected_lists/dev/non_uniform.namenon uniform linear RIR wave utt name of dev setselected_lists/dev/clean.namewave utt name of dev set used clean setselected_lists/dev/noise.namewave utt name of dev set used noise setselected_lists/train/aishell_1.namewave utt name from aishell-1 set used in train setselected_lists/train/aishell_3.namewave utt name from aishell-3 set used in train setselected_lists/train/librispeech_360.namewave utt name from librispeech-360 set used in train setselected_lists/train/vctk.namewave utt name from VCTK set used in train setselected_lists/train/audioset.namewave utt name from Audioset used in train setselected_lists/train/musan.namewave utt name from MUSAN used in train setselected_lists/train/circle.namecircle wave RIR name of train setselected_lists/train/linear.namelinear wave RIR name of train setselected_lists/train/non_uniform.namenon unifrom linear RIR utt name of train set
-
simulation, about simulation scripts, how to use to see ReadMesimulation/mix_wav.pysimulate dev set and train setsimulation/prepare.shuseselected_lists/*/*nameto select used wave from downloaded raw data, or you can select them by yourself scripts.simulation/quick_select.pyquickly select the name by a name list instead ofgrep -r -fsimulation/challenge_rirgenerator.pythe script to simulate RIRs in train and dev setsimulation/data/dev_circle_simu_mix.configdev circle set simulation setup, include clean wave, noise wave, rir wave, snr, volume scale, start pointsimulation/data/dev_linear_simu_mix.configdev linear set simulation setup, include clean wave, noise wave, rir wave, snr, volume scale, start pointsimulation/data/dev_non_uniform_linear_simu_mix.configdev non uniform linear set simulation setup, include clean wave, noise wave, rir wave, snr, volume scale, start pointsimulation/data/train_simu_circle.configtrain circle set simulation setup, include clean wave, noise wave, rir wave, snr, volume scale, start point; please download it from dropbox.simulation/data/train_simu_linear.configtrain linear set simulation setup, include clean wave, noise wave, rir wave, snr, volume scale, start point; please download it from dropbox.simulation/data/train_simu_non_uniform.configtrain non uniform linear set simulation setup, include clean wave, noise wave, rir wave, snr, volume scale, start point; please download it from dropbox.
-
requirements.txt, dependency
Notes:
1. \*.config file should be replaced with correct path of audio files.
2. Training config files have been released together with challenge data.
Requirements
python3.6 or above
pip install -r requirements.txt
if you simulation RIRs by yourself with our scripts, you may better install this: