A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Voice dialogue, role-playing, multi-topic discussion, picture creation
NLP Cloud serves high performance pre-trained or custom models
A Powerful Desktop Full-Text Search Engine, Just Like Local Google.
AI-based tool for removing hardsubs and text-like watermarks
Scientific Visualisation Made Easy
An AI assistant for everyone, powered by the Qwen series models
Award-winning modern data processing SDK in C++20
Detect faces in an image
GFPGAN aims at developing Practical Algorithms
Common Resource Grep
A Unified Toolkit for Deep Learning Based Document Image Analysis
A modern, web-based photo management server
Content aware image cropping
Typeface from Ming Dynasty woodblock printed books
DeepImageTranslator: a deep-learning utility for image translation
Deep learning gateway on Raspberry Pi and other edge devices
Real-time multi-person keypoint detection library for body, face, etc.
Run defined applications by detecting text in a captured screenshot
Learning Convolutional Neural Networks with Interactive Visualization
A collection of computer vision pre-trained models
Analysis Nuclei DAB (AND) Tool
Nodejs bindings to OpenCV 3 and OpenCV 4
Stuttering Chinese word segmentation