AlphaFold 3 inference pipeline
A multimodal model for brain response prediction
Tiny vision language model
Production-tested AI infrastructure tools
The most powerful local music generation model
An easy 1-click way to create beautiful artwork on your PC using AI
Advanced language and coding AI model
Diffusion model(SD,Flux,Wan,Qwen Image,Z-Image,...) inference
Python inference and LoRA trainer package for the LTX-2 audio–video
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Official inference repo for FLUX.2 models
A Family of Open Sourced Music Foundation Models
Tooling for the Common Objects In 3D dataset
Foundation model for image generation
Official implementation of Watermark Anything with Localized Messages
General-purpose image editing model that delivers high-fidelity
ICLR2024 Spotlight: curation/training code, metadata, distribution
GLM-Image: Auto-regressive for Dense-knowledge and High-fidelity Image
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
Uncommon Objects in 3D dataset
CogView4, CogView3-Plus and CogView3(ECCV 2024)
A SOTA open-source image editing model
Open-source framework for intelligent speech interaction
LLM-based Reinforcement Learning audio edit model
Chat & pretrained large audio language model proposed by Alibaba Cloud