PyTorch code and models for the DINOv2 self-supervised learning
Anthropic's educational courses
ChatGLM3 series: Open Bilingual Chat LLMs | Open Source Bilingual Chat
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences
Official implementation of DreamCraft3D
A Customizable Image-to-Video Model based on HunyuanVideo
The Unified Machine Learning Framework
A Multi-Modal World Model for Reconstructing, Generating, Simulation
Long-term memory OS for AI with structured recall and context awarenes
SDK for building interactive UI components over MCP for AI tools
The official repository for ERNIE 4.5 and ERNIEKit
Build your own Cowork, AI Scientist and other SoTA Agents
Fast backend for long-term AI user memory via structured profiles
Official PyTorch Implementation
Stable Diffusion WebUI Forge is a platform on top of Stable Diffusion
Unified Multimodal Understanding and Generation Models
DeepMind model for tracking arbitrary points across videos & robotics
GLM-4-Voice | End-to-End Chinese-English Conversational Model
GPT4V-level open-source multi-modal model based on Llama3-8B
A Powerful Native Multimodal Model for Image Generation
Proofs, cases, concept supplements, and reference explanations
A library for scientific machine learning & physics-informed learning
One-click deployment (including offline integration package)
MARS5 speech model (TTS) from CAMB.AI
Get started w/ building Fullstack Agents using Gemini 2.5 & LangGraph