A GUI tool for extracting hard-coded subtitle (hardsub) from videos
A machine learning software for extracting information
ExtractThinker is a Document Intelligence library for LLMs
Structured data extraction and instruction calling with ML, LLM
A library for audio and music analysis, feature extraction
Document (PDF, Word, PPTX ...) extraction and parse API
ContextGem: Effortless LLM extraction from documents
Extract and convert data from any document, images, pdfs, word doc
A high-quality tool for convert PDF to Markdown and JSON
Fast and efficient unstructured data extraction
Crawl a website starting from a URL, find relevant pages
Open source NLP guide with models, methods, and real use cases
No-code LLM Platform to launch APIs and ETL Pipelines
JavaScript OCR and text extraction for images and PDFs
Make websites accessible for AI agents
Document content and metadata extraction microservice
Python Audio Analysis Library: Feature Extraction, Classification
A Simple and Universal Swarm Intelligence Engine
Skill for installing full networking capabilities for Claude Code
Official Vectorize MCP Server
Model Context Protocol server that integrates AgentQL's data
A fast, helpful, and open-source document parser
End-to-end pipeline converting generative videos
Did you say you like data?
Synthetic data curation for post-training and data extraction