A multimodal model for brain response prediction
Tiny vision language model
Production-tested AI infrastructure tools
The most powerful local music generation model
General-purpose image editing model that delivers high-fidelity
Python inference and LoRA trainer package for the LTX-2 audio–video
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Advanced language and coding AI model
An easy 1-click way to create beautiful artwork on your PC using AI
A Family of Open Sourced Music Foundation Models
Diffusion model(SD,Flux,Wan,Qwen Image,Z-Image,...) inference
Official inference repo for FLUX.2 models
ICLR2024 Spotlight: curation/training code, metadata, distribution
Chat & pretrained large audio language model proposed by Alibaba Cloud
Foundation model for image generation
GLM-Image: Auto-regressive for Dense-knowledge and High-fidelity Image
CogView4, CogView3-Plus and CogView3(ECCV 2024)
High-Resolution Image Synthesis with Latent Diffusion Models
Di♪♪Rhythm: Blazingly Fast & Simple End-to-End Song Generation
Let us control diffusion models
The official pytorch implementation of our paper
Text-to-image model optimized for artistic quality and safe generation
Jan-v1-edge: efficient 1.7B reasoning model optimized for edge devices
An advanced bilingual image editing with semantic control