ContextGem: Effortless LLM extraction from documents
Fast and efficient unstructured data extraction
File Parser optimised for LLM Ingestion with no loss
PDF Parser for AI-ready data. Automate PDF accessibility
A versatile toolkit for PDF manipulation
A library for interacting with the nhentai API
A self-hostable bookmark-everything app
lightweight Go package to parse, analyze and extract metadata
A machine learning software for extracting information
A high-quality tool for convert PDF to Markdown and JSON
Assist in organizing your piles of documents
Document content and metadata extraction microservice
Tool to help you collect, organize, annotate, cite, and share research
Download pictures (or videos) along with their captions
Digital Life Kazik Open Source AI Skills Collection
Python & command-line tool to gather text on the Web
This is a public repository containing scrapers
Python library for scraping and analyzing online news articles easily
Open source OSINT tool for gathering data on emails, phones, and IPs
A distributed job server
Cross platform GUI tool for downloading videos from Bilibili sites
dude uncomplicated data extraction: A simple framework
Convert files and web content into clean, usable Markdown easily
Archive of leaked AI system prompts and internal instruction sets
Use LLMs and LLM Vision (OCR) to handle paperless-ngx