Qwen-Image is a powerful image generation foundation model
An Efficient Agentic Model for Computer Use
Fast-stable-diffusion + DreamBooth
Multimodal Diffusion with Representation Alignment
Pretrained time-series foundation model developed by Google Research
Easy Docker setup for Stable Diffusion with user-friendly UI
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
A Customizable Image-to-Video Model based on HunyuanVideo
Stable Diffusion WebUI Forge is a platform on top of Stable Diffusion
Tooling for the Common Objects In 3D dataset
An AI-powered security review GitHub Action using Claude
Renderer for the harmony response format to be used with gpt-oss
State-of-the-art (SoTA) text-to-video pre-trained model
Audio foundation model excelling in audio understanding
Phi-3.5 for Mac: Locally-run Vision and Language Models
Revolutionizing Database Interactions with Private LLM Technology
Tiny vision language model
26m function call model that runs on incredibly small devices
Qwen3-ASR is an open-source series of ASR models
A Pragmatic VLA Foundation Model
Python SDK for Claude Agent
CLIP, Predict the most relevant text snippet given an image
Ling is a MoE LLM provided and open-sourced by InclusionAI
Project Lyra: Open Generative 3D World Models
General-purpose image editing model that delivers high-fidelity