kaldi-asr/kaldi is the official location of the Kaldi project.

Last update: Jan 05, 2023

Overview

Kaldi Speech Recognition Toolkit

To build the toolkit: see ./INSTALL. These instructions are valid for UNIX systems including various flavors of Linux; Darwin; and Cygwin (has not been tested on more "exotic" varieties of UNIX). For Windows installation instructions (excluding Cygwin), see windows/INSTALL.

To run the example system builds, see egs/README.txt

If you encounter problems (and you probably will), please do not hesitate to contact the developers (see below). In addition to specific questions, please let us know if there are specific aspects of the project that you feel could be improved, that you find confusing, etc., and which missing features you most wish it had.

Kaldi information channels

For HOT news about Kaldi see the project site.

Documentation of Kaldi:

Info about the project, description of techniques, tutorial for C++ coding.
Doxygen reference of the C++ code.

Kaldi forums and mailing lists:

We have two different lists

User list kaldi-help
Developer list kaldi-developers:

To sign up to any of those mailing lists, go to http://kaldi-asr.org/forums.html:

Development pattern for contributors

Create a personal fork of the main Kaldi repository in GitHub.
Make your changes in a named branch different from master, e.g. you create a branch my-awesome-feature.
Generate a pull request through the Web interface of GitHub.
As a general rule, please follow Google C++ Style Guide. There are a few exceptions in Kaldi. You can use the Google's cpplint.py to verify that your code is free of basic mistakes.

Platform specific notes

PowerPC 64bits little-endian (ppc64le)

Kaldi is expected to work out of the box in RHEL >= 7 and Ubuntu >= 16.04 with OpenBLAS, ATLAS, or CUDA.
CUDA drivers for ppc64le can be found at https://developer.nvidia.com/cuda-downloads.
An IBM Redbook is available as a guide to install and configure CUDA.

Android

Kaldi supports cross compiling for Android using Android NDK, clang++ and OpenBLAS.
See this blog post for details.

kaldi-asr/kaldi is the official location of the Kaldi project.

Related tags

Overview

Kaldi Speech Recognition Toolkit

Kaldi information channels

Development pattern for contributors

Platform specific notes

PowerPC 64bits little-endian (ppc64le)

Android

Owner

Kaldi

code for our ICCV 2021 paper "DeepCAD: A Deep Generative Network for Computer-Aided Design Models"

2 telegram-bots: for image recognition and for text generation

🖺 OCR using tensorflow with attention

A tool to enhance your old/damaged pictures built using python & opencv.

scene-linear test images

An application of high resolution GANs to dewarp images of perturbed documents

The code of "Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes"

WACV 2022 Paper - Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching

[BMVC'21] Official PyTorch Implementation of Grounded Situation Recognition with Transformers

A curated list of resources for text detection/recognition (optical character recognition ) with deep learning methods.

Msos searcher - A half-hearted attempt at finding a magic square of squares

Tensorflow-based CNN+LSTM trained with CTC-loss for OCR

~1000 book pages + OpenCV + python = page regions identified as paragraphs, lines, images, captions, etc.

RepMLP: Re-parameterizing Convolutions into Fully-connected Layers for Image Recognition

A list of hyperspectral image super-solution resources collected by Junjun Jiang

A selectional auto-encoder approach for document image binarization

Forked from argman/EAST for the ICPR MTWI 2018 CHALLENGE

第一届西安交通大学人工智能实践大赛（2018AI实践大赛--图片文字识别）第一名；仅采用densenet识别图中文字

Script para controlar o movimento do mouse usando Python e openCV com câmera em tempo real que detecta pontos de referência da mão, rastreia padrões de gestos em vez de um mouse físico.

Code for paper "Role-based network embedding via structural features reconstruction with degree-regularized constraint"