WeKws

Production First and Production Ready End-to-End Keyword Spotting Toolkit.

The goal of this toolkit it to...

Small footprint keyword spotting (KWS), or specifically wake-up word (WuW) detection is a typical and important module in internet of things (IoT) devices. It provides a way for users to control IoT devices with a hands-free experience. A WuW detection system usually runs locally and persistently on IoT devices, which requires low consumptional power, less model parameters, low computational comlexity and to detect predefined keyword in a streaming way, i.e., requires low latency.

Typical Scenario

We are going to support the following typical applications of wakeup word:

Single wake-up word
Multiple wake-up words
Customizable wake-up word
Personalized wake-up word, i.e. combination of wake-up word detection and voiceprint

Installation

Clone the repo

git clone https://github.com/wenet-e2e/wekws.git

Install Conda: please see https://docs.conda.io/en/latest/miniconda.html
Create Conda env:

conda create -n wenet python=3.8
conda activate wenet
pip install -r requirements.txt
conda install pytorch=1.10.0 torchaudio=0.10.0 cudatoolkit=11.1 -c pytorch -c conda-forge

Dataset

We plan to support a variaty of open source wake-up word datasets, include but not limited to:

All the well-trained models on these dataset will be made public avaliable.

Runtime

We plan to support a variaty of hardwares and platforms, including:

Web browser
x86
Android
Raspberry Pi

Production First and Production Ready End-to-End Keyword Spotting Toolkit

Related tags

Overview

WeKws

Typical Scenario

Installation

Dataset

Runtime

Owner

Add your new words to a text file and get them randomly.

Chilean Digital Vaccination Pass Parser (CDVPP) parses digital vaccination passes from PDF files

"Complexity" of Flags of the countries of the world

A Python package to facilitate research on building and evaluating automated scoring models.

Free & simple way to encipher text

text-to-speach bot - You really do NOT have time for read a newsletter? Now you can listen to it

Hamming code generation, error detection & correction.

Converts a Bangla numeric string to literal words.

Fuzz a language by mixing up only few words.

Bidirectionally transformed strings

pydantic-i18n is an extension to support an i18n for the pydantic error messages.

box is a text-based visual programming language inspired by Unreal Engine Blueprint function graphs.

Production First and Production Ready End-to-End Keyword Spotting Toolkit

Umamusume story patcher with python

Python tool to make adding to your armory spreadsheet armory less of a pain.

Fuzzy String Matching in Python

Export solved codewars kata challenges to a text file.

Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.

A generator library for concise, unambiguous and URL-safe UUIDs.

Convert ebooks with few clicks on Telegram!