Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.
Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
Explore 10,000+ tools
DAT Freight and Analytics - DAT
DAT Freight and Analytics operates DAT One truckload freight marketplace
DAT Freight & Analytics operates DAT One, North America’s largest truckload freight marketplace; DAT iQ, the industry’s leading freight data analytics service; and Trucker Tools, the leader in load visibility. Shippers, transportation brokers, carriers, news organizations, and industry analysts rely on DAT for market trends and data insights, informed by nearly 700,000 daily load posts and a database exceeding $1 trillion in freight market transactions. Founded in 1978, DAT is a business unit of Roper Technologies (Nasdaq: ROP), a constituent of the Nasdaq 100, S&P 500, and Fortune 1000. Headquartered in Beaverton, Ore., DAT continues to set the standard for innovation in the trucking and logistics industry.
Strong Email & Apache Log Analysis with Active Security Features
X-Itools: eXtended Internet Tools. Suite of tools composed of several collaboration modules. Old and initial project born in 1999, 1st published in 2001 on Sourceforge.
X-Itools E-mail management module (log analysis) initiated in 2004 with Web 1.0 technologies (private SVN server).
X-Itools development restarted since 2011, on the basis of a unique module: E-mail management module (log analysis). Now based on web 2.0 technologies (ExtJS 4.1) and devel restarted because of a particular...
ISPMan is a system to design massive ISPs using LDAP as the backend.
ISPMan provides a web front end and a command line interface to create virtual domains and manages users, dns information, email infos and httpd setup data for these vhosts.
AWStats Enterprise Manager is a tool for managing awstats configuration creation and logfile processing, in a multi-server environment. This script is designed to pull all the webserver logs, for every server, and parse them with awstats.
Supply chain managers, executives, and businesses seeking AI-powered solutions to optimize planning, operations, and decision-making across the supply
Logility is a market-leading provider of AI-first supply chain management solutions engineered to help organizations build sustainable digital supply chains that improve people’s lives and the world we live in. The company’s approach is designed to reimagine supply chain planning by shifting away from traditional “what happened” processes to an AI-driven strategy that combines the power of humans and machines to predict and be ready for what’s coming. Logility’s fully integrated, end-to-end platform helps clients know faster, turn uncertainty into opportunity, and transform the supply chain from a cost center to an engine for growth.
Development goal for connFide is to create an affordable broadband connection monitoring appliance using inexpensive hardware and Open Source firmware/software. This device shall be called sensor box for quick reference. // DE: siehe [Wiki]/Summary
fwblocker is a script used to parse syslog files for SSH, pure-ftpd and iptables entries. It will generate statistics but it's main feature is to lock out IP addresses that used a wrong username/password to log into your SSH or FTP Server.
PyIDS is an intrusion detection system whose aim is to provide concise information to administrators about some parts of the system i.e filesystem checksums, unknown connections to the machine, access control lists of special files, log revision...
FAHWebMon is a web based log analyzer for Folding @ Home Diskless Folding Farms (F@HF). It allows an administrator of such a system to visually see the status of individual work nodes in a given farm.
The Most Powerful Software Platform for EHSQ and ESG Management
Addresses the needs of small businesses and large global organizations with thousands of users in multiple locations.
Choose from a complete set of software solutions across EHSQ that address all aspects of top performing Environmental, Health and Safety, and Quality management programs.
Vistigator is a website analyzer, which allows CEO's, CTO's, and users to view info about their website. Vistigator is integrated with Apache 2 HTTP Server, allowing Vistigator to display website stats graphically & easily about specific IP Addresses.
A statistical view of the recorded activity on a Honeynet. A mechanism for a honeynet to present some information about its findings over the web. This is done by a statistical analysis on the inbound firewall logs recorded by the honeynet's firewall.
RJStats assists in network and host monitoring by creating many graphs of your servers using rrdtool. These graphs can be viewed using a web browser in any combination you would like to see them.
Scripts written in php/mysql/bash aimed to provide control over volume of traffic downloaded with users through squid proxy server. Users are identified with ip addresses or computers names. Current version of squid-traffic contains an installer that sim
Green Screen: A Linuxbased Advanced Syslog Server for Juniper NetScreen Firewalls - Can be expanded later to support other products. It can capture syslog messages, parse them, store them in a MySQL database. A Web GUI interface is also included.
lease-parser is a simple daemon that records the lease state changes of an ISC
DHCP server to a database for historical reference. The data can be searched
via a web search form that is provided with the tool.
jECTS is a JAVA project that focuses on some of the aspects of ECTS (= European Credit Transfer System). Mainly the translation of local grades into ECTS grades and the generation of a ToR (Transcript of Records).
Web Traffic Analysis Software (or counter) supporting all known SQL databases (or XML). Easy install/upgrade, advanced user recognition technics, high usability.<p />Tracks users via: a) Server Logs, b) PHP inc., c)Web Beacons (JavaScript)
Rav Antivirus Log Analysis Kit is a collection of scripts that parse the RAV logs and insert the data into a database. Also included is a php front-end that will allow the display of this information.
A web based system for reporting on web server log files.
Using Postgres DB
Java servlets
Uses a web server and Java runner of your choosing. (Originally Written for Apache / Servletexec)
Allows custom reports.
Timing of reports and
IPChains Logger aims at providing an useful utility to track bandwidth usage from workstations behind a firewall.
It works well for masqueraded machines.
ipac is an ip accounting package for linux. It collects,
summarizes and nicly displays ip accounting data. The output of
ipac can be a simple ascii table or graph images.
chill is a heavy-module-based web-application with a core supporting many features. writing own modules for... everything. modules for webmail, firewall/router-administration, server-administration are planned natively.