PyTorch code and models for the DINOv2 self-supervised learning
Release for Improved Denoising Diffusion Probabilistic Models
Code for running inference with the SAM 3D Body Model 3DB
Visual Causal Flow
Official inference repo for FLUX.2 models
Models for object and human mesh reconstruction
gpt-oss-120b and gpt-oss-20b are two open-weight language models
GLM-4 series: Open Multilingual Multimodal Chat LMs
ChatGPT interface with better UI
Inference code for scalable emulation of protein equilibrium ensembles
Lets make video diffusion practical
An experimental version of DeepSeek model
A Powerful Native Multimodal Model for Image Generation
Diversity-driven optimization and large-model reasoning ability
High-Fidelity and Controllable Generation of Textured 3D Assets
CLIP, Predict the most relevant text snippet given an image
Ling is a MoE LLM provided and open-sourced by InclusionAI
4M: Massively Multimodal Masked Modeling
Z80-μLM is a 2-bit quantized language model
LTX-Video Support for ComfyUI
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Designed for text embedding and ranking tasks
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Repo of Qwen2-Audio chat & pretrained large audio language model
Large Multimodal Models for Video Understanding and Editing