Instagram OSINT tool for gathering profile data and public posts
A high-quality tool for convert PDF to Markdown and JSON
PDF Parser for AI-ready data. Automate PDF accessibility
A machine learning software for extracting information
ContextGem: Effortless LLM extraction from documents
Fast and efficient unstructured data extraction
Download pictures (or videos) along with their captions
Python & command-line tool to gather text on the Web
Tool to help you collect, organize, annotate, cite, and share research
A versatile toolkit for PDF manipulation
A tool to simulate Amazon EC2 instance metadata
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
A distributed job server
A library for interacting with the nhentai API
lightweight Go package to parse, analyze and extract metadata
A self-hostable bookmark-everything app
Coomer downloader
Open source OSINT tool for gathering data on emails, phones, and IPs
Copybara: A tool for transforming and moving code between repositories
Cross platform GUI tool for downloading videos from Bilibili sites
CLI tool to extract (meta)data from PDF and manipulate PDF files
ExtractThinker is a Document Intelligence library for LLMs
Python tool for crawling and extracting structured data from news site
Document content and metadata extraction microservice
Archive of leaked AI system prompts and internal instruction sets