Discover pretrained models for deep learning in MATLAB
Build Vision Agents quickly with any model or video provider
Contexts Optical Compression
Recovering the Visual Space from Any Views
Node.js example app from the OpenAI API quickstart tutorial
Diffusion Transformer with Fine-Grained Chinese Understanding
Official implementation of DreamCraft3D
Multimodal model achieving SOTA performance
Sharp Monocular Metric Depth in Less Than a Second
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Document Image Parsing via Heterogeneous Anchor Prompting”
Large-language-model & vision-language-model based on Linear Attention
Python SDK for the Computer Use model Lux, developed by OpenAGI
Scientific Visualisation Made Easy
Astronomical object/structure detection from 1D and 2D data sets.
ADAMS is a workflow engine for building complex knowledge workflows.
A Customizable Image-to-Video Model based on HunyuanVideo
Detect faces in an image
Common Resource Grep
A Strong and Easy-to-use Single View 3D Hand+Body Pose Estimator
A python module for hyperspectral image processing
PHP SDK for processing phone calls and SMS through the VoiceShot API.
.NET SDK for processing phone calls and SMS through the VoiceShot API.
ASP SDK for processing phone calls and SMS through the VoiceShot API.