Modular AI image and video generation web UI with extensible tools
Easily compute clip embeddings and build a clip retrieval system
Recovering the Visual Space from Any Views
Contexts Optical Compression
Node.js example app from the OpenAI API quickstart tutorial
Implementing large models into scenario-based applications
Diffusion Transformer with Fine-Grained Chinese Understanding
An LLM-based presentation generation platform
Multi-user UI for managing and running Stable Diffusion workflows tool
Full stack framework for building cross-platform mobile AI apps
Sharp Monocular Metric Depth in Less Than a Second
HivisionIDPhotos: a lightweight and efficient AI ID photos tools
Spring AI Alibaba examples for building and testing AI apps
GeoAI: Artificial Intelligence for Geospatial Data
Advanced AI Explainability for computer vision
Multimodal model achieving SOTA performance
Official implementation of DreamCraft3D
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
An extensive node suite that enables ComfyUI to process 3D inputs
DNN && GAN && NLP && BIG DATA
Enterprise AI platform for building, deploying, and managing apps
Export and Share your ChatGPT conversation history
Document Image Parsing via Heterogeneous Anchor Prompting”
Python SDK for the Computer Use model Lux, developed by OpenAGI
Large-language-model & vision-language-model based on Linear Attention