GLM-4 series: Open Multilingual Multimodal Chat LMs
FAIR Sequence Modeling Toolkit 2
Stable Diffusion with Core ML on Apple Silicon
Stable Virtual Camera: Generative View Synthesis with Diffusion Models
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
The Clay Foundation Model - An open source AI model and interface
Programmatic access to the AlphaGenome model
A Production-ready Reinforcement Learning AI Agent Library
Open-source large language model family from Tencent Hunyuan
A SOTA open-source image editing model
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
Designed for text embedding and ranking tasks
Generating Immersive, Explorable, and Interactive 3D Worlds
GPT4V-level open-source multi-modal model based on Llama3-8B
Ling-V2 is a MoE LLM provided and open-sourced by InclusionAI
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
A Pragmatic VLA Foundation Model
Collection of Gemma 3 variants that are trained for performance
Open-source deep-learning framework
CogView4, CogView3-Plus and CogView3(ECCV 2024)
Open-source framework for intelligent speech interaction
Phi-3.5 for Mac: Locally-run Vision and Language Models
Diversity-driven optimization and large-model reasoning ability
Pokee Deep Research Model Open Source Repo
Implementation of the Surya Foundation Model for Heliophysics