ChatGPT interface with better UI
Release for Improved Denoising Diffusion Probabilistic Models
Official inference repo for FLUX.2 models
Inference code for scalable emulation of protein equilibrium ensembles
Diversity-driven optimization and large-model reasoning ability
PyTorch code and models for the DINOv2 self-supervised learning
GLM-4 series: Open Multilingual Multimodal Chat LMs
An experimental version of DeepSeek model
Lets make video diffusion practical
gpt-oss-120b and gpt-oss-20b are two open-weight language models
Clean and efficient FP8 GEMM kernels with fine-grained scaling
LTX-Video Support for ComfyUI
ChatGLM-6B: An Open Bilingual Dialogue Language Model
OCR expert VLM powered by Hunyuan's native multimodal architecture
A Powerful Native Multimodal Model for Image Generation
MiniMax-M2, a model built for Max coding & agentic workflows
Ling is a MoE LLM provided and open-sourced by InclusionAI
CLIP, Predict the most relevant text snippet given an image
4M: Massively Multimodal Masked Modeling
One-click local MCP server installation in desktop apps
Collection of Gemma 3 variants that are trained for performance
Qwen3-VL, the multimodal large language model series by Alibaba Cloud
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
Official implementation of DreamCraft3D
Designed for text embedding and ranking tasks