Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.
Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
Explore 10,000+ tools
G-P - Global EOR Solution
Companies searching for an Employer of Record solution to mitigate risk and manage compliance, taxes, benefits, and payroll anywhere in the world
With G-P's industry-leading Employer of Record (EOR) and Contractor solutions, you can hire, onboard and manage teams in 180+ countries — quickly and compliantly — without setting up entities.
Peters Backup is a program for backing up your important data files on
to diskette, zip drive, fixed disk or CD/RW. It uses an extremely
efficient compression algorithm. It keeps track of all versions of your
files in full and incremental backups.
Building a generic Digital Object Management System for libraries based on fedora-commons, including enforced datamodels, Trusted Digital Repository capabilities, and librarian-friendly user interface based on the datamodel, end user access in Summa
This project has moved to SBForge and github. More info on
http://sbforge.org/display/DOMS and http://github.com/Statsbiblioteket
The archive-crawler project is building Heritrix: a flexible, extensible, robust, and scalable web crawler capable of fetching, archiving, and analyzing the full diversity and breadth of internet-accesible content.
A simple document management system (DMS).
Documents and associated datas are stored in a single file.
Keep all your documents in your pocket on your USB stick.
Complete storage and backup system with web administration that integrates many useful tools. Support logical volumes, snapshots, CIFS and NFS shares, tape drives, libraries, VTL's, iSCSI devices, etc. Manage users and roles, and easy restore.
Simple utility to synchronize the content of two file system directories. I've never had problems using it but it's not guaranteed to work perfectly: this program comes with ABSOLUTELY NO WARRANTY, USE AT YOUR OWN RISK!
Notenschrank ist ein kleines Tool, welches einen Bestand an Orchester- oder Chornoten verwaltet und somit schnellen Zugriff, sichere Archivierung und einfache Verwaltung bietet.Benutzen Sie dieses Programm nicht für kopierrechtlich geschützte Noten
A cross-platform file and folder mass renamer. Easy to use and to learn. Features: read new names from text files; insert, replace and remove strings; numbering; case-change; real-time preview; interactive tutorial; order files in different ways.
DAT Freight and Analytics operates DAT One truckload freight marketplace
DAT Freight & Analytics operates DAT One, North America’s largest truckload freight marketplace; DAT iQ, the industry’s leading freight data analytics service; and Trucker Tools, the leader in load visibility. Shippers, transportation brokers, carriers, news organizations, and industry analysts rely on DAT for market trends and data insights, informed by nearly 700,000 daily load posts and a database exceeding $1 trillion in freight market transactions. Founded in 1978, DAT is a business unit of Roper Technologies (Nasdaq: ROP), a constituent of the Nasdaq 100, S&P 500, and Fortune 1000. Headquartered in Beaverton, Ore., DAT continues to set the standard for innovation in the trucking and logistics industry.
jSCSI is a cross-platform Java implementation of an iSCSI initiator. jSCSI thus enables Java to directly access and serve block devices over the Internet by natively speaking the iSCSI protocol. The initiator supports software RAID.
THIS PROJECT HAS MOVED TO GITHUB!!!
SynchroMike is a two-way synchronisation program which allows the user to synchronize two directories. It comes with a handy user interface which displays differences between both storage locations as a tree.
Update: This page is deprecated, find the new program at <a href="http://syncarus.net">syncarus.net</a>
Remote control your home theater PC from across the room
Use the Armchair File Manager to control your Windows home theater PC using its remote control. Perform light-duty computing tasks from across the room without a keyboard or mouse. Armchair works best with a PC connected to a widescreen television.
Java backup tool providing file level data deduplication: If a file is stored, it is never stored a second time unless the file's content changes. Instead, a reference to the stored data is created. This holds true even if the file is moved or renamed.
A Java EE 6 reference application leveraging Apache TIKA, Hibernate Spatial and Hibernate Lucene to index and retrieve arbitrary data including POJO's (via JSON and XML) and all popular file/content types.
This is on ongoing research and development project and will be an attempt to bridge NoSQL/Document Database concepts with some traditional RDMS traits.
Intelligent File Synchronization Program. Synchronisiert zwei Verzeichnisse ohne dass die Richtung angegeben werden muss. Es erkennt Veränderungen automatisch und hält so beide Datenbestände auf einem aktuellen Stand.
Quotero was an open source Document Management System (DMS) developed in java. It provides basic document management features and advanced collaborative features such as version control, comments, workflow, etc.
Quotero becomes Kimios.
Please visit https://sourceforge.net/projects/kimios/
Web Site:
http://www.kimios.com
Issues:
http://issues.kimios.com
Wiki:
http://wiki.kimios.com
The result of this project will be a free (GPL) available file synchronisation tool. It enables the user to sync a folder for making backups or to have files on multiple computers e.g. on a desktop computer and on a laptop.
Purpose is to render allmost all mails (body + attachments) into one or more PDFs. Focus was not set on a "sexy" rendition but on a rendition at all. Mails are read through imap or from a directory, renderer and saved as PDF in an output directory
TreeTank is an easy-to-use framework which allows users to work with tree structures. Modifications are thereby available in a versioned manner which enables not only navigation based in the tree but also on the time-axis.
NOTE: THIS PROJECT HAS MOVED TO GITHUB!