A GUI tool for extracting hard-coded subtitle (hardsub) from videos
A machine learning software for extracting information
ExtractThinker is a Document Intelligence library for LLMs
Structured data extraction and instruction calling with ML, LLM
A library for audio and music analysis, feature extraction
Document (PDF, Word, PPTX ...) extraction and parse API
ContextGem: Effortless LLM extraction from documents
Extract and convert data from any document, images, pdfs, word doc
A high-quality tool for convert PDF to Markdown and JSON
Crawl a website starting from a URL, find relevant pages
Fast and efficient unstructured data extraction
Open source NLP guide with models, methods, and real use cases
JavaScript OCR and text extraction for images and PDFs
Make websites accessible for AI agents
Document content and metadata extraction microservice
A Simple and Universal Swarm Intelligence Engine
Python Audio Analysis Library: Feature Extraction, Classification
Skill for installing full networking capabilities for Claude Code
Model Context Protocol server that integrates AgentQL's data
Official Vectorize MCP Server
A fast, helpful, and open-source document parser
End-to-end pipeline converting generative videos
Did you say you like data?
Synthetic data curation for post-training and data extraction
A powerful Model Context Protocol (MCP) server