Audiocraft is a library for audio processing and generation
The open-source voice synthesis studio powered by Qwen3-TTS
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Diagram and flowchart generation from text similar to markdown
Autoregressive Model Beats Diffusion
State-of-the-art (SoTA) text-to-video pre-trained model
Run local LLMs like llama, deepseek, kokoro etc. inside your browser
Image generation model with single-stream diffusion transformer
StarVector is a foundation model for SVG generation
Simple, powerful and flexible site generation framework
Long-form streaming TTS system for multi-speaker dialogue generation
Official inference repo for FLUX.2 models
Foundation model for image generation
NLP Cloud serves high performance pre-trained or custom models for NER
A TTS that fits in your CPU (and pocket)
ComfyUI wrapper nodes for WanVideo and related models
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
The most powerful local music generation model
Qwen-Image is a powerful image generation foundation model
Run AI models locally on your machine with node.js bindings for llama
Ready-to-use OCR with 80+ supported languages
The free, Open Source alternative to OpenAI, Claude and others
Open-Sora: Democratizing Efficient Video Production for All
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML
An easy 1-click way to create beautiful artwork on your PC using AI