Speech Rankings

This project mimics CSRankings to generate an ordered list of researchers in speech/spoken language processing along with their possible research topics, based on recent publications on important venues of the field, so as to help students seeking for PhD studies to find desirable advisors.

How to use

The pre-generated report is available at here. To build it by yourself,

Run prepare_data.py to build publications.json and authors.json, or simply use the data provided, covering those from 2011 to 2021.
Run export.py to generate the report.

How does it work

We scrape author metadata and publication data of the following three types of venues from DBLP, including:

Speech venues: Interspeech, Speech Communications, SLT, SSW, ASRU, IWSLT
Mixed venues: ICASSP, TASLP
General venues: NeurIPS, ICML, ICLR, ACL, EMNLP, NAACL, KDD, AAAI, IJCAI

All publications in Speech venues are included. Paricularly for Interspeech, section/field of each paper are collected from ISCA Archive to show possible research topics of each researcher. So are the keywords from IEEE Xplore for papers published on IEEE-held venues. Keywords (as well as titles) are also used to filter out non-speech papers in Mixed venues by a set of rules. Titles are used to identify speech papers in General venues. Researchers are sorted by the total number of publications.

The collected data contain errors, and the project is neither intended to index speech-related papers nor to compare researchers in the field.

A CSRankings-like index for speech researchers

Related tags

Overview

Speech Rankings

How to use

How does it work

Owner

Mutian He

無料で使える中品質なテキスト読み上げソフトウェア、VOICEVOXの音声合成エンジン

A framework for cleaning Chinese dialog data

The code for two papers: Feedback Transformer and Expire-Span.

FireFlyer Record file format, writer and reader for DL training samples.

Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing

使用pytorch+transformers复现了SimCSE论文中的有监督训练和无监督训练方法

PyTorch implementation of NATSpeech: A Non-Autoregressive Text-to-Speech Framework

Pretty-doc - Composable text objects with python

Voice Assistant inspired by Google Assistant, Cortana, Alexa, Siri, ...

Python generation script for BitBirds

Community and sentiment analysis based on tweets

GAP-text2SQL: Learning Contextual Representations for Semantic Parsing with Generation-Augmented Pre-Training

Rich Prosody Diversity Modelling with Phone-level Mixture Density Network

BMInf (Big Model Inference) is a low-resource inference package for large-scale pretrained language models (PLMs).

PatrickStar enables Larger, Faster, Greener Pretrained Models for NLP. Democratize AI for everyone.

Unsupervised Language Model Pre-training for French

Lyrics generation with GPT2-based Transformer

💛 Code and Dataset for our EMNLP 2021 paper: "Perspective-taking and Pragmatics for Generating Empathetic Responses Focused on Emotion Causes"

A natural language processing model for sequential sentence classification in medical abstracts.

A Semi-Intelligent ChatBot filled with statistical and economical data for the Premier League.