TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
EPUB to audiobook converter, optimized for Audiobookshelf
Windows GUI Automation with Python (based on text properties)
Framework for building realtime multimodal voice AI agents apps
State-of-the-art (SoTA) text-to-video pre-trained model
High accuracy RAG for answering questions from scientific documents
A high-quality rapid TTS voice cloning model
Unifying 3D Mesh Generation with Language Models
Speech-AI-Forge is a project developed around TTS generation model
Open source healthcare AI
Free, high-quality text-to-speech API endpoint to replace OpenAI
Offline inference engine for art, real-time voice conversations
A 0.1B Omni model trained from scratch
An open-source toolkit for monitoring Language Learning Models (LLMs)
Faster Whisper transcription with CTranslate2
Generate audiobooks from e-books, voice cloning & 1107+ languages
Qwen3-omni is a natively end-to-end, omni-modal LLM
ASCII art library for Python
Snippet solution for Vim
Use Microsoft Edge's online text-to-speech service from Python
SoTA open-source TTS
A Sublime Text 2/3 plugin to see git diff in gutter
Reading book source
Foundation model for image generation
A text-to-speech, speech-to-text and speech-to-speech library