Use Microsoft Edge's online text-to-speech service from Python
Dramatron uses large language models to generate coherent scripts
Official inference repo for FLUX.2 models
Models for object and human mesh reconstruction
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Open-Sora: Democratizing Efficient Video Production for All
Multi-Voice and Prompt-Controlled TTS Engine
TTS with kokoro and onnx runtime
RGBD video generation model conditioned on camera input
A robust, efficient, low-latency speech-to-text library
Powerful AI language model (MoE) optimized for efficiency/performance
Build GenAI application quick and easy
Offline inference engine for art, real-time voice conversations
Self-hosted AI coding assistant
An undetectable, powerful, flexible, high-performance Python library
Open-source, high-performance AI model with advanced reasoning
Plug-and-play library to enable agents to call MCP and UTCP tools
Python library for defining and optimizing mathematical expressions
The React for Voice and Chat, build apps for Alexa, Google Assistant
The python library for real-time communication
Access large language models from the command-line
ChatGLM-6B: An Open Bilingual Dialogue Language Model
A nearly-live implementation of OpenAI's Whisper
DeepSeek Coder: Let the Code Write Itself
An API standard for single-agent reinforcement learning environments