Fast, flexible and easy to use probabilistic modelling in Python
MoBA: Mixture of Block Attention for Long-Context LLMs
Qwen3.5 is the large language model series developed by Qwen team
Wan2.2: Open and Advanced Large-Scale Video Generative Model
A theoretical reconstruction of the Claude Mythos architecture
Qwen3.6 is the large language model series developed by Qwen team
Open-source, high-performance AI model with advanced reasoning
MiMo-V2-Flash: Efficient Reasoning, Coding, and Agentic Foundation
A kernel library written in tilelang
Running a big model on a small laptop
A Powerful Native Multimodal Model for Image Generation
A Next-Generation Training Engine Built for Ultra-Large MoE Models
System Level Intelligent Router for Mixture-of-Models at Cloud
From nobody to big model (LLM) hero
Collection of links for free stock photography, video and Illustration
Ling-V2 is a MoE LLM provided and open-sourced by InclusionAI
Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI
Qwen3-Coder is the code version of Qwen3
Towards self-verifiable mathematical reasoning
Strong, Economical, and Efficient Mixture-of-Experts Language Model
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Fully automatic censorship removal for language models
Kimi K2 is the large language model series developed by Moonshot AI
157 models, 30 providers, one command to find what runs on hardware
Open-weight, large-scale hybrid-attention reasoning model