Port of OpenAI's Whisper model in C/C++
kaldi-asr/kaldi is the official location of the Kaldi project
Foundational Models for State-of-the-Art Speech and Text Translation
TEN, a voice agent framework to create conversational AI.
A library for audio and music analysis, feature extraction
Local AI file organization with categorization and rename suggestions
Drop In the Bucket Neural Networks
Improved JPEG encoder
Fork of OCR software cuneiform
Speech recognition software for English & Polish languages
A Torch implementation of the object detection network
Speech recognition for Ubuntu