Model export recipes, Python primitives, and Swift runtime utilities
26m function call model that runs on incredibly small devices
Official repository for LTX-Video
Tiny vision language model
Implementation of "MobileCLIP" CVPR 2024
Python inference and LoRA trainer package for the LTX-2 audio–video
Python SDK for Claude Agent
PyTorch code and models for the DINOv2 self-supervised learning
tiktoken is a fast BPE tokeniser for use with OpenAI's models
Unified Multimodal Understanding and Generation Models
Multimodal embedding and reranking models built on Qwen3-VL
Instructions on how to use the Realtime API on Microcontrollers
Generate Any 3D Scene in Seconds
Foundation Models for Time Series
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
RGBD video generation model conditioned on camera input
Large-language-model & vision-language-model based on Linear Attention
Towards Real-World Vision-Language Understanding
Official DeiT repository
Code release for ConvNeXt V2 model
Code release for "Masked-attention Mask Transformer
Dual LSTM Encoder for Dialog Response Generation
Tiny pre-trained IBM model for multivariate time series forecasting
Small 3B-base multimodal model ideal for custom AI on edge hardware
Compact agentic model for coding, tools, and productivity tasks