A python parametric CAD scripting framework based on OCCT
Qwen3-omni is a natively end-to-end, omni-modal LLM
A high-quality rapid TTS voice cloning model
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
Implementation of Imagen, Google's Text-to-Image Neural Network
Python binding to the Apache Tika™ REST services
Easily compute clip embeddings and build a clip retrieval system
Generate audiobooks from e-books
Full git and GitHub integration with Sublime Text
Generate blog articles from video or audio
Foundation model for image generation
PersonaPlex code
Lightweight Markdown-only skills for autonomous ML research
A sound cloning tool with a web interface, using your voice
Generating Immersive, Explorable, and Interactive 3D Worlds
Rich is a Python library for rich text and beautiful formatting
MTEB: Massive Text Embedding Benchmark
GLM-4-Voice | End-to-End Chinese-English Conversational Model
Extensions for Python Markdown
World's first open-source, agentic video production system
Open Source Speech Language Model
Tools to ease the creation of snippets, syntax definitions, etc.
Voice Recognition to Text Tool
Get free HTTPS certificates forever from Let's Encrypt
General-purpose image editing model that delivers high-fidelity