MCP for Replicate Flux Model
The ultimate angle brackets parser library parsing HTML5, MathML, SVG
AI-data warehouse to enrich, transform and analyze unstructured data
ITTT is a Free tool designed to Scan and extract Text from Images.
HunyuanVideo: A Systematic Framework For Large Video Generation Model
Multimodal model achieving SOTA performance
All Algorithms implemented in Python
Swiss army knife of image processing
Segmentation models with pretrained backbones. PyTorch
Sharp Monocular Metric Depth in Less Than a Second
An awesome list of helpful resources for students learning MATLAB
Embed web technologies in applications
PDFCraft is a free, privacy-focused PDF toolkit
Basic Machine Learning Natural Language Processing Roadmap
Harmonized and Coherent Human Image Animation
Small program rich text component, supports rendering and editing html
BoofCV is an open source Java library for real-time computer vision.
Fast and Practical Image Manipulation Toolbox
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Kubernetes Native Edge Computing Framework (project under CNCF)
A language for fast, portable data-parallel computation
Refine and quantize messy AI pixel art into clean, perfect pixels
Document Image Parsing via Heterogeneous Anchor Prompting”
A collection of packages providing extra functionality for GNU Octave
Large-language-model & vision-language-model based on Linear Attention