Open-source image generative foundation model
Open-source industrial-grade ASR models
Ling is a MoE LLM provided and open-sourced by InclusionAI
Netease Youdao's open-source embedding and reranker models
Ling-V2 is a MoE LLM provided and open-sourced by InclusionAI
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
Collection of Gemma 3 variants that are trained for performance
Open-weight, large-scale hybrid-attention reasoning model
Open-source framework for intelligent speech interaction
The official PyTorch implementation of Google's Gemma models
Pokee Deep Research Model Open Source Repo
Implementation of the Surya Foundation Model for Heliophysics
A 0.1B Omni model trained from scratch
OpenTinker is an RL-as-a-Service infrastructure for foundation models
Block Diffusion for Ultra-Fast Speculative Decoding
Multimodal embedding and reranking models built on Qwen3-VL
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
High-Fidelity and Controllable Generation of Textured 3D Assets
Multi-modal large language model designed for audio understanding
Large Multimodal Models for Video Understanding and Editing
Long-form streaming TTS system for multi-speaker dialogue generation
Generate Any 3D Scene in Seconds
This repository contains the official implementation of FastVLM
Memory-efficient and performant finetuning of Mistral's models
Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI