Audiocraft is a library for audio processing and generation
Generate audiobooks from e-books, voice cloning & 1107+ languages
Qwen3-omni is a natively end-to-end, omni-modal LLM
A high-quality rapid TTS voice cloning model
text and image to video generation: CogVideoX (2024) and CogVideo
Offline inference engine for art, real-time voice conversations
Go efficient multilingual NLP and text segmentation
Python binding to the Apache Tika™ REST services
Speech-AI-Forge is a project developed around TTS generation model
Industrial-level controllable zero-shot text-to-speech system
Label Studio is a multi-type data labeling and annotation tool
Persian NLP Toolkit
High-performance, multiplayer code editor from the creators of Atom
Open source no-code system for text annotation and building of text
Offline Text To Speech synthesis for python
A Unified Framework for Text-to-3D and Image-to-3D Generation
Chat & pretrained large vision language model
Stanford CoreNLP, a Java suite of core NLP tools
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
Lightning-fast, on-device TTS, running natively via ONNX
Connect MATLAB to LLM APIs, including OpenAI® Chat Completions
Clone a voice in 5 seconds to generate arbitrary speech in real-time
General natural language facilities for node
A persistent, network resilient, full text search library
A sound cloning tool with a web interface, using your voice