State-of-the-art (SoTA) text-to-video pre-trained model
RGBD video generation model conditioned on camera input
Qwen3-omni is a natively end-to-end, omni-modal LLM
Inference code for scalable emulation of protein equilibrium ensembles
The Clay Foundation Model - An open source AI model and interface
An Efficient Agentic Model for Computer Use
Audio foundation model excelling in audio understanding
Official code base for LeWorldModel: Stable End-to-End Joint-Embedding
Tiny vision language model
The official PyTorch implementation of Google's Gemma models
A 0.1B Omni model trained from scratch
26m function call model that runs on incredibly small devices
Open Source Speech Language Model
Long-form streaming TTS system for multi-speaker dialogue generation
Open-source industrial-grade ASR models
Qwen3-ASR is an open-source series of ASR models
Foundation model for image generation
Fast-stable-diffusion + DreamBooth
A Pragmatic VLA Foundation Model
OpenTinker is an RL-as-a-Service infrastructure for foundation models
Hunyuan Translation Model Version 1.5
Block Diffusion for Ultra-Fast Speculative Decoding
Multimodal embedding and reranking models built on Qwen3-VL
Z80-μLM is a 2-bit quantized language model
Collection of Gemma 3 variants that are trained for performance