A 0.1B Omni model trained from scratch
26m function call model that runs on incredibly small devices
Models for object and human mesh reconstruction
Tool for exploring and debugging transformer model behaviors
This repository contains the official implementation of FastVLM
Qwen2.5-VL is the multimodal large language model series
Z80-μLM is a 2-bit quantized language model
Language modeling in a sentence representation space
Diversity-driven optimization and large-model reasoning ability
Official DeiT repository
Detect faces in an image
Python example app from the OpenAI API quickstart tutorial
A repository of trained models
Code release for "Masked-attention Mask Transformer
Large-scale autoregressive pixel model for image generation by OpenAI
Generate embeddings from large-scale graph-structured data
Model that fuses instruct, reasoning and agentic skills
T5-Small: Lightweight text-to-text transformer for NLP tasks
Lightweight 24B agentic coding model with vision and long context
Large language model developed and released by NVIDIA
Compact English sentence embedding model for semantic search tasks
Small 3B-base multimodal model ideal for custom AI on edge hardware
Ultra-efficient 3B multimodal instruct model built for edge deployment
685B model with improved agents and consistency
Jan-v1-edge: efficient 1.7B reasoning model optimized for edge devices