Use Microsoft Edge's online text-to-speech service from Python
Python implementation of TextRank algorithms
Offline inference engine for art, real-time voice conversations
Speech recognition module for Python
Official inference repo for FLUX.2 models
Python tool for converting files and office documents to Markdown
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Ready-to-use OCR with 80+ supported languages
Official MiniMax Model Context Protocol (MCP) server
EPUB to audiobook converter, optimized for Audiobookshelf
The python library for real-time communication
State-of-the-art TTS model under 25MB
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
CLIP, Predict the most relevant text snippet given an image
Multi-Voice and Prompt-Controlled TTS Engine
Audiocraft is a library for audio processing and generation
A generative speech model for daily dialogue
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Converts text to speech in realtime
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Generating Immersive, Explorable, and Interactive 3D Worlds
Web interface for generating images using Stable Diffusion models
TTS with kokoro and onnx runtime
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
A simple native web interface that uses ChatTTS to synthesize text