Model export recipes, Python primitives, and Swift runtime utilities
26m function call model that runs on incredibly small devices
Official repository for LTX-Video
Port of Facebook's LLaMA model in C/C++
Implementation of "MobileCLIP" CVPR 2024
Python inference and LoRA trainer package for the LTX-2 audio–video
Tiny vision language model
PyTorch code and models for the DINOv2 self-supervised learning
Python SDK for Claude Agent
tiktoken is a fast BPE tokeniser for use with OpenAI's models
Unified Multimodal Understanding and Generation Models
RGBD video generation model conditioned on camera input
Multimodal embedding and reranking models built on Qwen3-VL
Instructions on how to use the Realtime API on Microcontrollers
Generate Any 3D Scene in Seconds
Foundation Models for Time Series
Open-source large language model family from Tencent Hunyuan
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Large-language-model & vision-language-model based on Linear Attention
Clean and efficient FP8 GEMM kernels with fine-grained scaling
FlashMLA: Efficient Multi-head Latent Attention Kernels
Pretrained time-series foundation model developed by Google Research
A Customizable Image-to-Video Model based on HunyuanVideo
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
Towards self-verifiable mathematical reasoning