The Clay Foundation Model - An open source AI model and interface
Bidirectional token-classification model for identifiable info
DeepSeek Coder: Let the Code Write Itself
High-resolution models for human tasks
Controllable & emotion-expressive zero-shot TTS
A state-of-the-art open visual language model
Chinese and English multimodal conversational language model
An Efficient Agentic Model for Computer Use
Project Lyra: Open Generative 3D World Models
Collection of Gemma 3 variants that are trained for performance
Python SDK for Claude Agent
Tool for exploring and debugging transformer model behaviors
A Multi-Modal World Model for Reconstructing, Generating, Simulation
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
ChatGPT interface with better UI
Easy Docker setup for Stable Diffusion with user-friendly UI
ICLR2024 Spotlight: curation/training code, metadata, distribution
Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Open-source large language model family from Tencent Hunyuan
Stable Diffusion with Core ML on Apple Silicon
Qwen3-omni is a natively end-to-end, omni-modal LLM
A Systematic Framework for Interactive World Modeling
Code for running inference with the SAM 3D Body Model 3DB
An experimental version of DeepSeek model