Scaling Reinforcement Learning with LLMs
Long-form streaming TTS system for multi-speaker dialogue generation
Open-source industrial-grade ASR models
OpenTinker is an RL-as-a-Service infrastructure for foundation models
Hunyuan Translation Model Version 1.5
Multimodal embedding and reranking models built on Qwen3-VL
Z80-μLM is a 2-bit quantized language model
Extension index for stable-diffusion-webui
Implementation of "MobileCLIP" CVPR 2024
VMZ: Model Zoo for Video Modeling
High-resolution models for human tasks
Video understanding codebase from FAIR for reproducing video models
Instructions on how to use the Realtime API on Microcontrollers
Tool for exploring and debugging transformer model behaviors
Ling is a MoE LLM provided and open-sourced by InclusionAI
A Unified Framework for Text-to-3D and Image-to-3D Generation
Personalize Any Characters with a Scalable Diffusion Transformer
Genome modeling and design across all domains of life
Achieving 3+ generation speedup on reasoning tasks
Ultra-Efficient LLMs on End Device
Pretrained time-series foundation model developed by Google Research
Ling-V2 is a MoE LLM provided and open-sourced by InclusionAI
Open-source deep-learning framework
Continuous Autonomy for the AI SDK
Multimodal model achieving SOTA performance