A Multi-Modal World Model for Reconstructing, Generating, Simulation
The open-source voice synthesis studio powered by Qwen3-TTS
Autoregressive Model Beats Diffusion
Audiocraft is a library for audio processing and generation
Image generation model with single-stream diffusion transformer
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
StarVector is a foundation model for SVG generation
Diagram and flowchart generation from text similar to markdown
Simple, powerful and flexible site generation framework
Long-form streaming TTS system for multi-speaker dialogue generation
Official inference repo for FLUX.2 models
NLP Cloud serves high performance pre-trained or custom models for NER
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
Foundation model for image generation
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
Run AI models locally on your machine with node.js bindings for llama
The most powerful local music generation model
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML
A TTS that fits in your CPU (and pocket)
Qwen-Image is a powerful image generation foundation model
A text-to-speech, speech-to-text and speech-to-speech library
An easy 1-click way to create beautiful artwork on your PC using AI
Curated AI engineering notes on LLMs, generative models, and tools
Examples and guides for using the Gemini API
AI-powered code assistant for Vim. OpenAI and ChatGPT plugin for Vim