基于百度的语音识别，用python实现，pyaudio+pyqt

Last update: Jan 03, 2022

Related tags

Text Data & NLP Speech-recognition

Overview

Speech-recognition

基于百度的语音识别，python3.8(conda)+pyaudio+pyqt+baidu-aip

百度有面向python的语音识别框架，用pip 直接安装

pip install baidu-aip

安装完成后，在百度智能云完成登录，在控制台创建一个应用。

在下面填入自己的 APP_ID,API_KEY,SECRET_KEY后，能运行即可

def main():
    APP_ID = 'your id'
    API_KEY = 'your key'
    SECRET_KEY = 'your secret keys'

    client = AipSpeech(APP_ID, API_KEY, SECRET_KEY)
    outfile = "./audio/16k.wav"

    app = QApplication(sys.argv)
    form = ExampleApp(outfile, client)
    form.show()

    app.exec_()


if __name__ == '__main__':
    main()

Releases(V1.0)

V1.0(Dec 29, 2021)

speech_recognition v1.0
Source code(tar.gz)
Source code(zip)

Owner

J-L

Delayed enjoyment

GitHub Repository

Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge

Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge This is an implementation of the paper,

19 Oct 14, 2022

A Python/Pytorch app for easily synthesising human voices

Voice Cloning App A Python/Pytorch app for easily synthesising human voices Documentation Discord Server Video guide Voice Sharing Hub FAQ's System Re

840 Jan 04, 2023

Code for the paper "VisualBERT: A Simple and Performant Baseline for Vision and Language"

This repository contains code for the following two papers: VisualBERT: A Simple and Performant Baseline for Vision and Language (arxiv) with a short

464 Jan 04, 2023

Chinese NewsTitle Generation Project by GPT2.带有超级详细注释的中文GPT2新闻标题生成项目。

GPT2-NewsTitle 带有超详细注释的GPT2新闻标题生成项目 UpDate 01.02.2021 从网上收集数据，将清华新闻数据、搜狗新闻数据等新闻数据集，以及开源的一些摘要数据进行整理清洗，构建一个较完善的中文摘要数据集。数据集清洗时，仅进行了简单地规则清洗。

785 Dec 29, 2022

This repository contains the code for "Generating Datasets with Pretrained Language Models".

Datasets from Instructions (DINO 🦕 ) This repository contains the code for Generating Datasets with Pretrained Language Models. The paper introduces

154 Jan 01, 2023

Translation to python of Chris Sims' optimization function

pycsminwel This is a locol minimization algorithm. Uses a quasi-Newton method with BFGS update of the estimated inverse hessian. It is robust against

1 Mar 21, 2022

Code for evaluating Japanese pretrained models provided by NTT Ltd.

japanese-dialog-transformers 日本語の説明文はこちら This repository provides the information necessary to evaluate the Japanese Transformer Encoder-decoder dialo

216 Dec 22, 2022

Disfl-QA: A Benchmark Dataset for Understanding Disfluencies in Question Answering

Disfl-QA is a targeted dataset for contextual disfluencies in an information seeking setting, namely question answering over Wikipedia passages. Disfl-QA builds upon the SQuAD-v2 (Rajpurkar et al., 2

52 Jun 21, 2022

Th2En & Th2Zh: The large-scale datasets for Thai text cross-lingual summarization

Th2En & Th2Zh: The large-scale datasets for Thai text cross-lingual summarization 📥 Download Datasets 📥 Download Trained Models INTRODUCTION TH2ZH (

5 Jan 03, 2022

A spaCy wrapper of OpenTapioca for named entity linking on Wikidata

spaCyOpenTapioca A spaCy wrapper of OpenTapioca for named entity linking on Wikidata. Table of contents Installation How to use Local OpenTapioca Vizu

80 Jan 03, 2023

this repository has datasets containing information of Uber pickups in NYC from April 2014 to September 2014 and January to June 2015. data Analysis , virtualization and some insights are gathered here

uber-pickups-analysis Data Source: https://www.kaggle.com/fivethirtyeight/uber-pickups-in-new-york-city Information about data set The dataset contain

1 Nov 02, 2021

基于百度的语音识别，用python实现，pyaudio+pyqt

Related tags

Overview

Speech-recognition

You might also like...

Releases(V1.0)

V1.0(Dec 29, 2021)

Owner

J-L

Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge

A Python/Pytorch app for easily synthesising human voices

Code for the paper "VisualBERT: A Simple and Performant Baseline for Vision and Language"

Chinese NewsTitle Generation Project by GPT2.带有超级详细注释的中文GPT2新闻标题生成项目。

This repository contains the code for "Generating Datasets with Pretrained Language Models".

Translation to python of Chris Sims' optimization function

Code for evaluating Japanese pretrained models provided by NTT Ltd.

Disfl-QA: A Benchmark Dataset for Understanding Disfluencies in Question Answering

Th2En & Th2Zh: The large-scale datasets for Thai text cross-lingual summarization

A spaCy wrapper of OpenTapioca for named entity linking on Wikidata

this repository has datasets containing information of Uber pickups in NYC from April 2014 to September 2014 and January to June 2015. data Analysis , virtualization and some insights are gathered here

Train GPT-3 model on V100(16GB Mem) Using improved Transformer.

Addon for adding subtitle files to blender VSE as Text sequences. Using pysub2 python module.

Bidirectional LSTM-CRF and ELMo for Named-Entity Recognition, Part-of-Speech Tagging and so on.

Türkçe küfürlü içerikleri bulan bir yapay zeka kütüphanesi / An ML library for profanity detection in Turkish sentences

Associated Repository for "Translation between Molecules and Natural Language"

SAVI2I: Continuous and Diverse Image-to-Image Translation via Signed Attribute Vectors

SentimentArcs: a large ensemble of dozens of sentiment analysis models to analyze emotion in text over time

Longformer: The Long-Document Transformer

Simple and efficient RevNet-Library with DeepSpeed support