A 0.1B Omni model trained from scratch
26m function call model that runs on incredibly small devices
Qwen3-ASR is an open-source series of ASR models
A Pragmatic VLA Foundation Model
OpenTinker is an RL-as-a-Service infrastructure for foundation models
Hunyuan Translation Model Version 1.5
Block Diffusion for Ultra-Fast Speculative Decoding
Multimodal embedding and reranking models built on Qwen3-VL
Z80-μLM is a 2-bit quantized language model
Collection of Gemma 3 variants that are trained for performance
Implementation of "MobileCLIP" CVPR 2024
Official implementation of Watermark Anything with Localized Messages
Video understanding codebase from FAIR for reproducing video models
Instructions on how to use the Realtime API on Microcontrollers
Open-weight, large-scale hybrid-attention reasoning model
Qwen3-omni is a natively end-to-end, omni-modal LLM
Towards self-verifiable mathematical reasoning
Bidirectional token-classification model for identifiable info
Genome modeling and design across all domains of life
Ultra-Efficient LLMs on End Device
Pretrained time-series foundation model developed by Google Research
General-purpose image editing model that delivers high-fidelity
Ling-V2 is a MoE LLM provided and open-sourced by InclusionAI
Continuous Autonomy for the AI SDK
Multimodal model achieving SOTA performance