Multimodal model achieving SOTA performance
Official implementation of DreamCraft3D
Weaviate is a cloud-native, modular, real-time vector search engine
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Swift community driven package for OpenAI public API
BoofCV is an open source Java library for real-time computer vision.
An extensive node suite that enables ComfyUI to process 3D inputs
AI-data warehouse to enrich, transform and analyze unstructured data
DNN && GAN && NLP && BIG DATA
Enterprise AI platform for building, deploying, and managing apps
Export and Share your ChatGPT conversation history
Scientific Visualisation Made Easy
Document Image Parsing via Heterogeneous Anchor Prompting”
Build cross-modal and multimodal applications on the cloud
Python SDK for the Computer Use model Lux, developed by OpenAGI
Large-language-model & vision-language-model based on Linear Attention
Astronomical object/structure detection from 1D and 2D data sets.
Discover pretrained models for deep learning in MATLAB
Easy-to-use deep learning framework with 3 key features
View Extract & Remove AI generation metadata with right click
CLI tool for removing watermarks from AI-generated videos using frame-
A Customizable Image-to-Video Model based on HunyuanVideo
Local AI file organization with categorization and rename suggestions
Python wrapper to Philipp Krähenbühl's dense (fully connected) CRFs
Detect faces in an image