Contexts Optical Compression
Recovering the Visual Space from Any Views
Diffusion Transformer with Fine-Grained Chinese Understanding
Sharp Monocular Metric Depth in Less Than a Second
Multimodal model achieving SOTA performance
Official implementation of DreamCraft3D
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Large-language-model & vision-language-model based on Linear Attention
AI-powered tool to quickly remove watermarks from images flawlessly
AI Suite for upscaling, interpolating & restoring images/videos
Detect faces in an image
Small 3B-base multimodal model ideal for custom AI on edge hardware
Omnimodal AI model for agents, coding, and long-context tasks
Compact 8B multimodal instruct model optimized for edge deployment
Efficient 14B multimodal instruct model with edge deployment and FP8