- Source: Kaldi (software)
Kaldi is an open-source speech recognition toolkit written in C++ for speech recognition and signal processing, freely available under the Apache License v2.0.
Kaldi aims to provide software that is flexible and extensible, and is intended for use by automatic speech recognition (ASR) researchers for building a recognition system.
It supports linear transforms, MMI, boosted MMI and MCE discriminative training, feature-space discriminative training, and deep neural networks.
Kaldi is capable of generating features like mfcc, fbank, fMLLR, etc. Hence in recent deep neural network research, a popular usage of Kaldi is to pre-process raw waveform into acoustic feature for end-to-end neural models.
Kaldi has been incorporated as part of the CHiME Speech Separation and Recognition Challenge over several successive events. The software was initially developed as part of a 2009 workshop at Johns Hopkins University.
Kaldi is named after the legendary Ethiopian goat herder Kaldi who was said to have discovered the coffee plant.
See also
fMLLR
List of speech recognition software
References
External links
Official website
Kaldi – The official GitHub project
Kaldi paper - The Kaldi Speech Recognition Toolkit
VOSK – open source and commercial models from Alpha Cephei on Kaldi foundations
Kata Kunci Pencarian:
- Kaldi (software)
- Speech recognition software for Linux
- List of speech recognition software
- Speech recognition
- TAI Anka
- Emerging technologies
- Gottfried Wilhelm Leibniz
- Index Theologicus