Connect MATLAB to LLM APIs, including OpenAI® Chat Completions
A Powerful Native Multimodal Model for Image Generation
Run Stable Diffusion on Mac natively
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
Grab the color palette from an image using just Javascript
An image processing library written entirely in JavaScript for Node
Content aware image resize library
Interactive video and image annotation tool for computer vision
The most powerful and modular diffusion model GUI, api and backend
Label Studio is a multi-type data labeling and annotation tool
Wan2.1: Open and Advanced Large-Scale Video Generative Model
State-of-the-art diffusion models for image and audio generation
Guiding Instruction-based Image Editing via Multimodal Large Language
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences
Chat & pretrained large vision language model
Easily turn large sets of image urls to an image dataset
A neural network that transforms a design mock-up into static websites
Stable Diffusion with Core ML on Apple Silicon
Generating Immersive, Explorable, and Interactive 3D Worlds
Run Stable Diffusion on Mac natively
Code for running inference with the SAM 3D Body Model 3DB
Cross platform .Net wrapper to the OpenCV image processing library
Awesome multilingual OCR toolkits based on PaddlePaddle
CogView4, CogView3-Plus and CogView3(ECCV 2024)
Open Source Computer Vision Library