Agentic, Reasoning, and Coding (ARC) foundation models
Official Python inference and LoRA trainer package
A Powerful Native Multimodal Model for Image Generation
Pretrained time-series foundation model developed by Google Research
An AI-powered security review GitHub Action using Claude
High-Resolution Image Synthesis with Latent Diffusion Models
Collection of Gemma 3 variants that are trained for performance
Video Object and Interaction Deletion
VMZ: Model Zoo for Video Modeling
Hunyuan Translation Model Version 1.5
Repo of Qwen2-Audio chat & pretrained large audio language model
Large Multimodal Models for Video Understanding and Editing
Pokee Deep Research Model Open Source Repo
Easy Docker setup for Stable Diffusion with user-friendly UI
Inference code for scalable emulation of protein equilibrium ensembles
Qwen-Image is a powerful image generation foundation model
CodeGeeX2: A More Powerful Multilingual Code Generation Model
Tongyi Deep Research, the Leading Open-source Deep Research Agent
Open-source industrial-grade ASR models
Ling-V2 is a MoE LLM provided and open-sourced by InclusionAI
Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI
Qwen3-omni is a natively end-to-end, omni-modal LLM
Uncommon Objects in 3D dataset
GPT4V-level open-source multi-modal model based on Llama3-8B
Chinese and English multimodal conversational language model