A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Structured data extraction and instruction calling with ML, LLM
No-code LLM Platform to launch APIs and ETL Pipelines
CLI tool to extract (meta)data from PDF and manipulate PDF files
ExtractThinker is a Document Intelligence library for LLMs
Zero-copy PDF text extraction library written in Zig
Document content and metadata extraction microservice
Open source NLP guide with models, methods, and real use cases
ContextGem: Effortless LLM extraction from documents
Document (PDF, Word, PPTX ...) extraction and parse API
Turn any technical book PDF into a Claude Code skill
A high-quality tool for convert PDF to Markdown and JSON
Python Audio Analysis Library: Feature Extraction, Classification
End-to-end pipeline converting generative videos
Python & command-line tool to gather text on the Web
AI-ready web crawler that extracts and structures website content
Web Robotics Process Automation Tool
A cross-platform GUI wrapper for yt-dlp written in PySide6
A Simple and Universal Swarm Intelligence Engine
Claude Code skill for generating production-quality SVG+PNG technical
PDF scientific paper translation with preserved formats
Download videos from almost any website
Asyncio-based Python framework for building fast web crawling spiders
NSFW Windows app to batch download images and videos
OCR software, free and offline