A 0.1B Omni model trained from scratch
26m function call model that runs on incredibly small devices
Models for object and human mesh reconstruction
Tool for exploring and debugging transformer model behaviors
This repository contains the official implementation of FastVLM
Qwen2.5-VL is the multimodal large language model series
Z80-μLM is a 2-bit quantized language model
Language modeling in a sentence representation space
Diversity-driven optimization and large-model reasoning ability
Official DeiT repository
Code release for "Masked-attention Mask Transformer
Large-scale autoregressive pixel model for image generation by OpenAI
Generate embeddings from large-scale graph-structured data
Model that fuses instruct, reasoning and agentic skills