STT for TorchScript is a port of Coqui STT based on DeepSpeech to PyTorch.

Last update: Oct 18, 2021

Related tags

Text Data & NLP st3

Overview

st3

STT for TorchScript is a port of Coqui STT based on DeepSpeech to PyTorch.

Currently it supports converting pbmm models to pt scripts with integrated beam search.

Check out the first pre-release: https://github.com/proger/st3/releases

PyTorch impelementations of BERT-based Spelling Error Correction Models

59 Jun 29, 2021

PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.

VAENAR-TTS - PyTorch Implementation PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.

67 Nov 14, 2022

A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis

WaveGlow A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis Quick Start: Install requirements: pip install

204 Jul 14, 2022

PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models

Deepvoice3_pytorch PyTorch implementation of convolutional networks-based text-to-speech synthesis models: arXiv:1710.07654: Deep Voice 3: Scaling Tex

1.8k Dec 30, 2022

Transformers4Rec is a flexible and efficient library for sequential and session-based recommendation, available for both PyTorch and Tensorflow.

730 Jan 9, 2023

A Word Level Transformer layer based on PyTorch and 🤗 Transformers.

Transformer Embedder A Word Level Transformer layer based on PyTorch and 🤗 Transformers. How to use Install the library from PyPI: pip install transf

27 Nov 20, 2022

The PyTorch based implementation of continuous integrate-and-fire (CIF) module.

CIF-PyTorch This is a PyTorch based implementation of continuous integrate-and-fire (CIF) module for end-to-end (E2E) automatic speech recognition (AS

24 Dec 29, 2022

An example project using OpenPrompt under pytorch-lightning for prompt-based SST2 sentiment analysis model

pl_prompt_sst An example project using OpenPrompt under the framework of pytorch-lightning for a training prompt-based text classification model on SS

5 Oct 21, 2022

PyTorch implementation of the paper: Text is no more Enough! A Benchmark for Profile-based Spoken Language Understanding

Text is no more Enough! A Benchmark for Profile-based Spoken Language Understanding This repository contains the official PyTorch implementation of th

26 Dec 14, 2022

Releases(english1)

english1(Sep 13, 2021)
This is a conversion of Coqui English STT v0.9.3 model to TorchScript, allowing to deploy a speech recognizer as a single file. The TorchScript bundle is self-contained and runs DeepSpeech frontend and beam search returning 10 best results. LM Scorer is not supported at the moment.

To run, download the pt file and save the following code to recognize.py and make sure you have torchaudio installed using pip3 install torchaudio:

import torch, torchaudio, sys waveform, sr = torchaudio.load(sys.argv[1], normalize=True) assert sr == 16000 model = torch.jit.load('coqui-stt-0.9.3-models.pt') for transcript, scores in model(waveform.squeeze()): print(transcript, scores)

Now you can run the model on English recordings like below. Any format supported by TorchAudio backend should work.

python3 recognize.py sample.wav
Source code(tar.gz)
Source code(zip)
coqui-stt-0.9.3-models.pt(180.26 MB)

Owner

Vlad Ki

GitHub Repository

Source code for the paper "TearingNet: Point Cloud Autoencoder to Learn Topology-Friendly Representations"

TearingNet: Point Cloud Autoencoder to Learn Topology-Friendly Representations Created by Jiahao Pang, Duanshun Li, and Dong Tian from InterDigital In

21 Dec 29, 2022

Topic Inference with Zeroshot models

zeroshot_topics Table of Contents Installation Usage License Installation zeroshot_topics is distributed on PyPI as a universal wheel and is available

55 Nov 28, 2022

A Semi-Intelligent ChatBot filled with statistical and economical data for the Premier League.

MONEYBALL - ChatBot Module: 4006CEM, Class: B, Group: 5 Contributors: Jonas Djondo Roshan Kc Cole Samson Daniel Rodrigues Ihteshaam Naseer Kind remind

1 Nov 18, 2021

NLP topic mdel LDA - Gathered from New York Times website

1 Oct 14, 2021

Behavioral Testing of Clinical NLP Models

Behavioral Testing of Clinical NLP Models This repository contains code for testing the behavior of clinical prediction models based on patient letter

2 Sep 20, 2022

ProteinBERT is a universal protein language model pretrained on ~106M proteins from the UniRef90 dataset.

ProteinBERT is a universal protein language model pretrained on ~106M proteins from the UniRef90 dataset. Through its Python API, the pretrained model can be fine-tuned on any protein-related task in

241 Jan 04, 2023

WIT (Wikipedia-based Image Text) Dataset is a large multimodal multilingual dataset comprising 37M+ image-text sets with 11M+ unique images across 100+ languages.

740 Dec 24, 2022

Research code for the paper "Fine-tuning wav2vec2 for speaker recognition"

Fine-tuning wav2vec2 for speaker recognition This is the code used to run the experiments in https://arxiv.org/abs/2109.15053. Detailed logs of each t

103 Dec 26, 2022

A minimal code for fairseq vq-wav2vec model inference.

vq-wav2vec inference A minimal code for fairseq vq-wav2vec model inference. Runs without installing the fairseq toolkit and its dependencies. Usage ex

7 Nov 15, 2022

Translators - is a library which aims to bring free, multiple, enjoyable translation to individuals and students in Python

907 Dec 27, 2022

STT for TorchScript is a port of Coqui STT based on DeepSpeech to PyTorch.

Related tags

Overview

st3

You might also like...

PyTorch impelementations of BERT-based Spelling Error Correction Models

PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.

A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis

PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models

Transformers4Rec is a flexible and efficient library for sequential and session-based recommendation, available for both PyTorch and Tensorflow.

A Word Level Transformer layer based on PyTorch and 🤗 Transformers.

The PyTorch based implementation of continuous integrate-and-fire (CIF) module.

An example project using OpenPrompt under pytorch-lightning for prompt-based SST2 sentiment analysis model

PyTorch implementation of the paper: Text is no more Enough! A Benchmark for Profile-based Spoken Language Understanding

Releases(english1)

english1(Sep 13, 2021)

Owner

Vlad Ki

Source code for the paper "TearingNet: Point Cloud Autoencoder to Learn Topology-Friendly Representations"

Topic Inference with Zeroshot models

A Semi-Intelligent ChatBot filled with statistical and economical data for the Premier League.

NLP topic mdel LDA - Gathered from New York Times website

Behavioral Testing of Clinical NLP Models

ProteinBERT is a universal protein language model pretrained on ~106M proteins from the UniRef90 dataset.

WIT (Wikipedia-based Image Text) Dataset is a large multimodal multilingual dataset comprising 37M+ image-text sets with 11M+ unique images across 100+ languages.

Research code for the paper "Fine-tuning wav2vec2 for speaker recognition"

A minimal code for fairseq vq-wav2vec model inference.

Translators - is a library which aims to bring free, multiple, enjoyable translation to individuals and students in Python

Python SDK for working with Voicegain Speech-to-Text

Large-scale Knowledge Graph Construction with Prompting

SDL: Synthetic Document Layout dataset

Words-per-minute - A terminal app written in python utilizing the curses module that tests the user's ability to type

DataCLUE: 国内首个以数据为中心的AI测评（含模型分析报告）

Finally, some decent sample sentences

simpleT5 is built on top of PyTorch-lightning⚡️ and Transformers🤗 that lets you quickly train your T5 models.

This repository implements a brute-force spellchecker utilizing the Damerau-Levenshtein edit distance.

The (extremely) naive sentiment classification function based on NBSVM trained on wisesight_sentiment

Rhyme with AI