ComfyUI wrapper nodes for HunyuanVideo
Generating Immersive, Explorable, and Interactive 3D Worlds
Crowdsourcing platform for full text transcription and tagging
Qwen-Image is a powerful image generation foundation model
Translate the video from one language to another and embed dubbing
Extensions for Python Markdown
Agent harness to make your slop code well-engineered and beautiful
Unifying 3D Mesh Generation with Language Models
Rich is a Python library for rich text and beautiful formatting
Speech recognition module for Python
Collection of Gemma 3 variants that are trained for performance
High accuracy RAG for answering questions from scientific documents
Free, high-quality text-to-speech API endpoint to replace OpenAI
A python parametric CAD scripting framework based on OCCT
Framework for building real-time voice and multimodal AI agents
Python binding to the Apache Tika™ REST services
Faster Whisper transcription with CTranslate2
Speech-AI-Forge is a project developed around TTS generation model
SoTA open-source TTS
A high-quality rapid TTS voice cloning model
The official Python SDK for the ElevenLabs API
AI-powered code assistant for Vim. OpenAI and ChatGPT plugin for Vim
Generate blog articles from video or audio
A sound cloning tool with a web interface, using your voice
Python & command-line tool to gather text on the Web