C++ image processing and machine learning library with using of SIMD
OpenAI swift async text to image for SwiftUI app using OpenAI
Open Source Computer Vision Library
Client-side indecent content checking powered by TensorFlow.js
CLIP, Predict the most relevant text snippet given an image
A neural network that transforms a design mock-up into static websites
Stable Diffusion with Core ML on Apple Silicon
ArrayFire, a general purpose GPU library
Awesome multilingual OCR toolkits based on PaddlePaddle
Chinese and English multimodal conversational language model
The smallest, simplest JavaScript pixel-level image comparison library
RGBD video generation model conditioned on camera input
This repo contains the code for 1D tokenizer and generator
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Offline inference engine for art, real-time voice conversations
Fast image augmentation library and an easy-to-use wrapper
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
High-Resolution Image Synthesis with Latent Diffusion Models
Provides code for running inference with the SegmentAnything Model
A Customizable Image-to-Video Model based on HunyuanVideo
Java interface to OpenCV, FFmpeg, and more
Flutter-based cross-platform app integrating major AI models
Official MiniMax Model Context Protocol (MCP) server
Automates PWA asset generation and image declaration
Structure-from-Motion and Multi-View Stereo