Diffusion model(SD,Flux,Wan,Qwen Image,Z-Image,...) inference
Revolutionizing Database Interactions with Private LLM Technology
Tooling for the Common Objects In 3D dataset
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
State-of-the-art Image & Video CLIP, Multimodal Large Language Models
GLM-4 series: Open Multilingual Multimodal Chat LMs
Bidirectional token-classification model for identifiable info
Project Lyra: Open Generative 3D World Models
Official code base for LeWorldModel: Stable End-to-End Joint-Embedding
Repo for SeedVR2 & SeedVR
GLM-4-Voice | End-to-End Chinese-English Conversational Model
Open-source large language model family from Tencent Hunyuan
RGBD video generation model conditioned on camera input
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
CodeGeeX2: A More Powerful Multilingual Code Generation Model
Qwen2.5-VL is the multimodal large language model series
Generate Any 3D Scene in Seconds
A series of math-specific large language models of our Qwen2 series
ChatGPT interface with better UI
26m function call model that runs on incredibly small devices
Qwen3-ASR is an open-source series of ASR models
Fast-stable-diffusion + DreamBooth
Collection of Gemma 3 variants that are trained for performance
Tool for exploring and debugging transformer model behaviors
Multimodal-Driven Architecture for Customized Video Generation