CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
DeepSeek Coder: Let the Code Write Itself
Open-Source Financial Large Language Models
Qwen2.5-VL is the multimodal large language model series
Qwen3-VL, the multimodal large language model series by Alibaba Cloud
RGBD video generation model conditioned on camera input
Large-scale autoregressive pixel model for image generation by OpenAI
One-click local MCP server installation in desktop apps
A multimodal model for brain response prediction
ChatGPT interface with better UI
LTX-Video Support for ComfyUI
Lets make video diffusion practical
GLM-4 series: Open Multilingual Multimodal Chat LMs
A Multi-Modal World Model for Reconstructing, Generating, Simulation
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
Visual Causal Flow
An experimental version of DeepSeek model
GLM-Image: Auto-regressive for Dense-knowledge and High-fidelity Image
Accurate × Fast × Comprehensive
A Systematic Framework for Interactive World Modeling
Bidirectional token-classification model for identifiable info
Code for running inference with the SAM 3D Body Model 3DB
Repo for SeedVR2 & SeedVR
gpt-oss-120b and gpt-oss-20b are two open-weight language models
Proxy that exposes Antigravity provided claude / gemini models