Robust Speech Recognition via Large-Scale Weak Supervision
Comprehensive Gradio WebUI for audio processing
Speech recognition module for Python
A robust, efficient, low-latency speech-to-text library
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML
A deep learning toolkit for Text-to-Speech, battle-tested in research
The behavior guidance framework for customer-facing LLM agents
Toolkit for conversational AI
Stanford NLP Python library for many human languages
Models for the spaCy Natural Language Processing (NLP) library
Han Language Processing
A generative speech model for daily dialogue
High-quality multi-lingual text-to-speech library by MyShell.ai
Open source personal AI Assistant for Linux, Windows and Mac
A speech-text foundation model for real time dialogue
NLP Cloud serves high performance pre-trained or custom models for NER
Transforming Multimodal Content into Captivating Multilingual Audio
SoTA open-source TTS
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
Build AI-powered semantic search applications
Multimodal AI Story Teller, built with Stable Diffusion, GPT, etc.
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
A very simple framework for state-of-the-art NLP
ChatGPT extension for scientific research work
Persian NLP Toolkit