Fast, flexible and easy to use probabilistic modelling in Python
MoBA: Mixture of Block Attention for Long-Context LLMs
Qwen3.5 is the large language model series developed by Qwen team
Wan2.2: Open and Advanced Large-Scale Video Generative Model
A theoretical reconstruction of the Claude Mythos architecture
Qwen3.6 is the large language model series developed by Qwen team
MiMo-V2-Flash: Efficient Reasoning, Coding, and Agentic Foundation
Open-source, high-performance AI model with advanced reasoning
A kernel library written in tilelang
Running a big model on a small laptop
A Next-Generation Training Engine Built for Ultra-Large MoE Models
A Powerful Native Multimodal Model for Image Generation
System Level Intelligent Router for Mixture-of-Models at Cloud
From nobody to big model (LLM) hero
Collection of links for free stock photography, video and Illustration
Ling-V2 is a MoE LLM provided and open-sourced by InclusionAI
Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI
Qwen3-Coder is the code version of Qwen3
Towards self-verifiable mathematical reasoning
Strong, Economical, and Efficient Mixture-of-Experts Language Model
Fully automatic censorship removal for language models
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Kimi K2 is the large language model series developed by Moonshot AI
157 models, 30 providers, one command to find what runs on hardware
Open-weight, large-scale hybrid-attention reasoning model