Open-Source Python3 tool for recognizing layouts, tables, and math
Library for OCR-related tasks powered by Deep Learning
Open Source Document Management System for Digital Archives
A modular graph-based Retrieval-Augmented Generation (RAG) system
A lightweight approach to removing Google web service dependency
Awesome multilingual OCR toolkits based on PaddlePaddle
Re-editable LaTeX/ typst graphics for Inkscape
Label Studio is a multi-type data labeling and annotation tool
Speech recognition module for Python
Industrial-strength Natural Language Processing (NLP)
Parse files for optimal RAG
⚡ Building applications with LLMs through composability ⚡
Flet enables developers to easily build realtime web and mobile apps
An easy-to-use backup tool for GNU Linux using rsync in the back
A pure-python PDF library capable of splitting, merging, cropping
Open source personal AI Assistant for Linux, Windows and Mac
Agent S: an open agentic framework that uses computers like a human
Multimodal AI Story Teller, built with Stable Diffusion, GPT, etc.
Extract one time password (OTP) secrets from QR codes
Crowdsourcing platform for full text transcription and tagging
Web UI for your scripts with execution management
Stable Diffusion built-in to Blender
Open source machine learning framework to automate text conversations
State-of-the-art diffusion models for image and audio generation
Transforming Multimodal Content into Captivating Multilingual Audio