Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.
Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
Explore 10,000+ tools
Atera all-in-one platform IT management software with AI agents
Ideal for internal IT departments or managed service providers (MSPs)
Atera’s AI agents don’t just assist, they act. From detection to resolution, they handle incidents and requests instantly, taking your IT management from automated to autonomous.
Shared Questionnaire System(SQS) is a full-functional Optical Mark Reader(OMR) form processing system implemented in Java-Swing, XSL-FO and AJAX with straightforward GUIs. It is aimed at developing social platform to share knowledge about questionnaire.
Akshara Malayalam OCR is a project for the development of an OCR for printed and handwritten documents in Malayalam language. The inspiration is from similar OCR softwares in other languages etc.
Landlords, multi-family homes, manufactured home communities, single family homes, associations, commercial properties and mixed portfolios.
Rent Manager is award-winning property management software built for residential, commercial, and short-term-stay portfolios of any size. The program’s fully customizable features include a double-entry accounting system, maintenance management/scheduling, marketing integration, mobile applications, more than 450 insightful reports, and an API that integrates with the best PropTech providers on the market.
This project aims to create a single easy to use GUI wrapper for ghostscript and tesseract to allow scanned pdf to plain text or HTML for scanned documents.
It can be used to add or edit EXIF2.2 tags to existing JPEG image files. It is particularly useful in storing the exposure details for a photo scanned out of films. It is now a stand alone application implemented in PHP-GTK. Just unzip the release packag
This project is about writing a Linux SANE backend for the USB color scanners HP3300c, HP3400c, HP4300c, Agfa SnapScan Touch and Trust Office Scan 19200
A simple GUI frontend for scanning documents into PDF format. Utilizes scanimage, ps2pdf, pnmflip, and pnmtops commands. Automatically detects scanners avaliable on system. Developed on Linux but might work on other platforms with some tweaking.
Scarse is a free (distributed under GPL) color calibration software package for
Linux and other Unices. Build and use ICC profiles on your Unix box! Custom
profiles can be generated from variety of calibration targets.
SANE (Scanner Access Now Easy) backend for the UMAX Astra 1220U (USB) scanner. This is a heavily modified adaptation of the original command driver written by Paul Mackerras.
Tifftool is a high-performance tool to clean scanned documents in preparation for onscreen display or for OCR. Features include skew correction, orientation correction, despeckle, page alignment, split pages and batch processing.
SANE backend and stand-alone driver for Canon CanoScan parallel scanners (FB320P, FB620P, FB330P, FB630P, N340P, and N640P). Please note FB310P is NOT currently supported, sorry. For USB model support go to http://canon-fb630u.sourceforge.net/