MOSS‑TTS Family open‑source speech and sound generation model
Repo for SeedVR2 & SeedVR
Qwen3.6 is the large language model series developed by Qwen team
High-Fidelity and Controllable Generation of Textured 3D Assets
4M: Massively Multimodal Masked Modeling
A SOTA open-source image editing model
The official PyTorch implementation of Google's Gemma models
tiktoken is a fast BPE tokeniser for use with OpenAI's models
Claude Code image, a one-stop open source transit service
GLM-5: From Vibe Coding to Agentic Engineering
Repo of Qwen2-Audio chat & pretrained large audio language model
Collection of Gemma 3 variants that are trained for performance
MiniMax-M2, a model built for Max coding & agentic workflows
Access to Anthropic's safety-first language model APIs
FlashMLA: Efficient Multi-head Latent Attention Kernels
MiniMax M2.1, a SOTA model for real-world dev & agents.
Global weather forecasting model using graph neural networks and JAX
A 0.1B Omni model trained from scratch
Long-form streaming TTS system for multi-speaker dialogue generation
Block Diffusion for Ultra-Fast Speculative Decoding
OCR expert VLM powered by Hunyuan's native multimodal architecture
Implementation of the Surya Foundation Model for Heliophysics
Instructions on how to use the Realtime API on Microcontrollers
New set of lightweight state-of-the-art, open foundation models
Inference script for Oasis 500M