A machine learning software for extracting information
ExtractThinker is a Document Intelligence library for LLMs
ContextGem: Effortless LLM extraction from documents
Enhance any agent's browser use skill
A library for audio and music analysis, feature extraction
Structured data extraction and instruction calling with ML, LLM
Clean network diagrams, One-time setup, zero upkeep
An on-premises, OCR-free unstructured data extraction
Python Audio Analysis Library: Feature Extraction, Classification
Crawl a website starting from a URL, find relevant pages
Did you say you like data?
Open source web scraping system for automated data collection tasks
A Simple and Universal Swarm Intelligence Engine
Open source OSINT tool for gathering data on emails, phones, and IPs
A Python tool to help extracting information from structured PDFs
Open-source platform for extracting structured data from documents
Python & command-line tool to gather text on the Web
Python tool for crawling and extracting structured data from news site
From Paper to Presentation in One Click
AI-ready web crawler that extracts and structures website content
A library for interacting with the nhentai API
Archive of leaked AI system prompts and internal instruction sets
Synthetic data curation for post-training and data extraction
Research and application of technologies such as nl processing
OSRFramework, the Open Sources Research Framework is a AGPLv3+ project