Robust Speech Recognition via Large-Scale Weak Supervision
Speech recognition module for Python
Open-source industrial-grade ASR models
Audio foundation model excelling in audio understanding
kaldi-asr/kaldi is the official location of the Kaldi project
A PyTorch-based Speech Toolkit
StreamSpeech is a seamless model for offline speech recognition
Multilingual Automatic Speech Recognition with word-level timestamps
Toolkit for conversational AI
Voice Recognition to Text Tool
Underthesea - Vietnamese NLP Toolkit
Repo of Qwen2-Audio chat & pretrained large audio language model
Translate the video from one language to another and embed dubbing
Training data (data labeling, annotation, workflow) for all data types
Capable of understanding text, audio, vision, video
The behavior guidance framework for customer-facing LLM agents
Omnilingual ASR Open-Source Multilingual SpeechRecognition
End-to-end speech processing toolkit
Replace OpenAI GPT with another LLM in your app
Real-time voice interactive digital human
GLM-4-Voice | End-to-End Chinese-English Conversational Model
Han Language Processing
Qwen3-ASR is an open-source series of ASR models
NLP Cloud serves high performance pre-trained or custom models for NER
Persian NLP Toolkit