Qwen-Image is a powerful image generation foundation model
Revolutionizing Database Interactions with Private LLM Technology
Python SDK for Claude Agent
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
High-Resolution Image Synthesis with Latent Diffusion Models
A Powerful Native Multimodal Model for Image Generation
Phi-3.5 for Mac: Locally-run Vision and Language Models
A 0.1B Omni model trained from scratch
Recovering the Visual Space from Any Views
General-purpose image editing model that delivers high-fidelity
Open-Source Financial Large Language Models
A Multi-Modal World Model for Reconstructing, Generating, Simulation
Code for running inference with the SAM 3D Body Model 3DB
Sharp Monocular Metric Depth in Less Than a Second
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
State-of-the-art Image & Video CLIP, Multimodal Large Language Models
Provides convenient access to the Anthropic REST API from any Python 3
Generating Immersive, Explorable, and Interactive 3D Worlds
CodeGeeX2: A More Powerful Multilingual Code Generation Model
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Netease Youdao's open-source embedding and reranker models
Robust Speech Recognition Across Languages, Dialects
MOSS‑TTS Family open‑source speech and sound generation model
Video Object and Interaction Deletion
Foundation model for image generation