Reading book source
Open source no-code system for text annotation and building of text
Speech-AI-Forge is a project developed around TTS generation model
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML
Industrial-level controllable zero-shot text-to-speech system
Generate audiobooks from e-books, voice cloning & 1107+ languages
CLIP, Predict the most relevant text snippet given an image
text and image to video generation: CogVideoX (2024) and CogVideo
Controllable & emotion-expressive zero-shot TTS
Controllable and fast Text-to-Speech for over 7000 languages
TTS with kokoro and onnx runtime
Synchronized Translation for Videos
Parse files for optimal RAG
A sound cloning tool with a web interface, using your voice
A nearly-live implementation of OpenAI's Whisper
Implementation of Imagen, Google's Text-to-Image Neural Network
A simple native web interface that uses ChatTTS to synthesize text
Offline Text To Speech synthesis for python
Open Source Document Management System for Digital Archives
GLM-4-Voice | End-to-End Chinese-English Conversational Model
Offline inference engine for art, real-time voice conversations
VITS2 backbone with multilingual-bert
Generate blog articles from video or audio
Speech recognition module for Python
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning