This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to Speech Systems

Last update: Dec 04, 2022

Related tags

Text Data & NLP proteno

Overview

Proteno

This is the data release associated with the corresponding NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to Speech Systems (https://arxiv.org/abs/2104.07777)

Security

See CONTRIBUTING for more information.

License

This project is released under CC-BY-NC-4.0 and other licenses:

English: CC-BY-SA
Spanish: CC-BY-SA
Tamil: CC-BY-NC-SA

Citation

If you use our data, please cite the following paper:

@inproceedings{tyagi-etal-2021-proteno,
    title = "Proteno: Text Normalization with Limited Data for Fast Deployment in Text to Speech Systems",
    author = "Tyagi, Shubhi  and
      Bonafonte, Antonio  and
      Lorenzo-Trueba, Jaime  and
      Latorre, Javier",
    booktitle = "Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Industry Papers",
    month = jun,
    year = "2021",
    address = "Online",
    publisher = "Association for Computational Linguistics",
    url = "https://www.aclweb.org/anthology/2021.naacl-industry.10",
    pages = "72--79",
}

This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to Speech Systems

Related tags

Overview

Proteno

Security

License

Citation

Owner

CCF BDCI BERT系统调优赛题baseline（Pytorch版本）

official ( API ) for the zAmericanEnglish app in [ Google play ] and [ App store ]

Implementation of Multistream Transformers in Pytorch

The simple project to separate mixed voice (2 clean voices) to 2 separate voices.

Data loaders and abstractions for text and NLP

Optimal Transport Tools (OTT), A toolbox for all things Wasserstein.

Repository for the paper "Optimal Subarchitecture Extraction for BERT"

YACLC - Yet Another Chinese Learner Corpus

Code release for NeX: Real-time View Synthesis with Neural Basis Expansion

OCR을 이용하여 인원수를 인식 후 줌을 Kill 해줍니다

Official implementations for various pre-training models of ERNIE-family, covering topics of Language Understanding & Generation, Multimodal Understanding & Generation, and beyond.

List of GSoC organisations with number of times they have been selected.

Fidibo.com comments Sentiment Analyser

Use the power of GPT3 to execute any function inside your programs just by giving some doctests

[ICCV 2021] Instance-level Image Retrieval using Reranking Transformers

A Persian Image Captioning model based on Vision Encoder Decoder Models of the transformers🤗.

Converts text into a PDF of handwritten notes

This repository contains examples of Task-Informed Meta-Learning

CorNet Correlation Networks for Extreme Multi-label Text Classification

Translation to python of Chris Sims' optimization function