Model export recipes, Python primitives, and Swift runtime utilities
Official repository for LTX-Video
Native and Compact Structured Latents for 3D Generation
Python SDK for Claude Agent
Python inference and LoRA trainer package for the LTX-2 audio–video
Flux 2 image generation model pure C inference
C#/.NET binding of llama.cpp, including LLaMa/GPT model inference
26m function call model that runs on incredibly small devices
State-of-the-art Image & Video CLIP, Multimodal Large Language Models
Unified Multimodal Understanding and Generation Models
PyTorch code and models for the DINOv2 self-supervised learning
Multimodal embedding and reranking models built on Qwen3-VL
Instructions on how to use the Realtime API on Microcontrollers
Generate Any 3D Scene in Seconds
Foundation Models for Time Series
tiktoken is a fast BPE tokeniser for use with OpenAI's models
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Large-language-model & vision-language-model based on Linear Attention
Towards Real-World Vision-Language Understanding
Real-time behaviour synthesis with MuJoCo, using Predictive Control
Software that can generate photos from paintings
A minimal PyTorch re-implementation of the OpenAI GPT
Code release for "Masked-attention Mask Transformer
A mix of GAN implementations including progressive growing
Dual LSTM Encoder for Dialog Response Generation