ConferencingSpeech2022; Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge

Last update: Dec 02, 2022

Related tags

Text Data & NLP ConferencingSpeech2022

Overview

ConferencingSpeech 2022 challenge

This repository contains the datasets list and scripts required for the ConferencingSpeech 2022 challenge. For more details about the challenge, please see our website.

Details

baseline, this folder contains baseline system include inference model exported by inference scripts;
eval, this folder contains evaluation scripts to calculate PLCC, RMSE and SRCC;
data-sets, this folder contains training and development test data-sets provied to the participant;
- Tencent Corpus, this dataset includes about 14,000 speech chinese speech clips with simulated (e.g. codecs, packet-loss, background noise) and live conditions.
- NISQA Corpus, the NISQA Corpus includes more than 14,000 speech samples with simulated (e.g. codecs, packet-loss, background noise) and live (e.g. mobile phone, Zoom, Skype, WhatsApp) conditions.
- IU Bloomington Corpus, there are 10,000 speech signals extracted from COSINE and VOiCESdatasets, each truncated between 3 to 6 seconds long.
- PSTN Corpus, there are about 80,000 speech clips through classic public switched telephone networks, each truncated 10 seconds long.

Requirements

To install requirements install Anaconda and then use:

conda env create -f envs.yml

This will create a new environment with the name "conferencingSpeech". Activate this environment to go on:

conda activate conferencingSpeech

Code license

Apache 2.0

ConferencingSpeech2022; Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge

Related tags

Overview

ConferencingSpeech 2022 challenge

Details

Requirements

Code license

Owner

Named-entity recognition using neural networks. Easy-to-use and state-of-the-art results.

Quick insights from Zoom meeting transcripts using Graph + NLP

Two-stage text summarization with BERT and BART

This is a project built for FALLABOUT2021 event under SRMMIC, This project deals with NLP poetry generation.

Sentence Embeddings with BERT & XLNet

A workshop with several modules to help learn Feast, an open-source feature store

ACL22 paper: Imputing Out-of-Vocabulary Embeddings with LOVE Makes Language Models Robust with Little Cost

A BERT-based reverse dictionary of Korean proverbs

Code for text augmentation method leveraging large-scale language models

Visual Automata is a Python 3 library built as a wrapper for Caleb Evans' Automata library to add more visualization features.

API for the GPT-J language model 🦜. Including a FastAPI backend and a streamlit frontend

Natural Language Processing with transformers

Conversational text Analysis using various NLP techniques

OpenAI CLIP text encoders for multiple languages!

Syntax-aware Multi-spans Generation for Reading Comprehension (TASLP 2022)

Twitter-NLP-Analysis - Twitter Natural Language Processing Analysis

CCF BDCI BERT系统调优赛题baseline（Pytorch版本）

Training code for Korean multi-class sentiment analysis

Train and use generative text models in a few lines of code.

Repository for Graph2Pix: A Graph-Based Image to Image Translation Framework