Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.
Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
Explore 10,000+ tools
Dun and Bradstreet Risk Analytics - Supplier Intelligence
Use an AI-powered solution for supply and compliance teams who want to mitigate costly supplier risks intelligently.
Risk, procurement, and compliance teams across the globe are under pressure to deal with geopolitical and business risks. Third-party risk exposure is impacted by rapidly scaling complexity in domestic and cross-border businesses, along with complicated and diverse regulations. It is extremely important for companies to proactively manage their third-party relationships. An AI-powered solution to mitigate and monitor counterparty risks on a continuous basis, this cutting-edge platform is powered by D&B’s Data Cloud with 520M+ Global Business Records and 2B+ yearly updates for third-party risk insights. With high-risk procurement alerts and multibillion match points, D&B Risk Analytics leverages best-in-class risk data to help drive informed decisions. Perform quick and comprehensive screening, using intelligent workflows. Receive ongoing alerts of key business indicators and disruptions.
This project has two parts. The first one is a hidden counter (using the 1x1 transparent pixel GIF) which registers into a MySQL database all remote IP addresses (whether IPv4 or IPv6), it tries very hard to use IPv6. The second part shows stats.
This project is devoted to analyze the log lines from the Wikipedia Squid servers by parsing and filtering their information elements according to the directives specified in an XML file. Resulting info is stored in a MySql db for further analysis.
Slogbase is a lightweight database attached to objects in Second Life that provides the same functionality as Apache Web Server logs. It uses ordinary LSL code in Second Life to send sensor data to an external PHP/MySQL application.
Slickster provides tools for viewing and analyzing Log4J logs in a multi-server distributed environment.
Log4J logs are centralized into a database.
Logs are viewed through a browser. Logs can be filtered and monitored in real-time.
Axe Credit Portal - ACP- is axefinance’s future-proof AI-driven solution to digitalize the loan process from KYC to servicing, available as a locally hosted or cloud-based software.
Banks, lending institutions
Founded in 2004, axefinance is a global market-leading software provider focused on credit risk automation for lenders looking to provide an efficient, competitive, and seamless omnichannel financing journey for all client segments (FI, Retail, Commercial, and Corporate.)
STARCAT (STAtiscic Reporting, Collecting and Analysing Tool) is a toolkit to gather statistics out of several sources (for example, rrd-files, logfiles, DBMS,...) and create nice-looking reports out of them using XML-templates.
The Sparten Log Viewer is designed to allow users to view log files online through a web browser. It is currently based on php and runs on any platform that support standard unix log format, php, and tail.
Use python to parse privoxy log in clf format and refer to wot ratings of the site and then dump the data in a sqlite database for statistics generation later.
This is a (hopefully temporary) fork of the great web-site statistics and analytic package JAWStats (www.jawstats.com) to add summary page and mail statistics to this wonderful peace of software
Cislog is a syslog server that specifically targets Cisco devices and their implementation of the syslog protocol. It consists of a Django web interface and a syslog listener with a database between them.
IPHPLog (IP PHP Log) is a simple software written in PHP, JS and SQL that runs over a MySQL database manager system . IPHPLog can log the information of your visitors and organize that on a data base with GeoIP.
Webstats Solr is an attempt to make Apache Access log easier to Data Mine. By adding a powerful Search Engine (SOLR) as a Backend and using Java Script and HTML and maybe PHP I hope to out date AWStats.
phpMyVisites is a Web traffic analyzer with very detailed reports and advanced graphics. It is not an Apache log analysis tool. phpMyVisites creates its own logs, and it allows access to more complete statistics. phpMyVisites provides information about v
LifeManager will allow you to log and manage your life in such areas as: Physical exercises, Alimentation, Accounting, Agenda, etc.
This is going to be web-based, but in the future may be available as an end user application.
PHP Daemon & Frontend for Squid Web Proxy Server.
Provides both User Based bandwidth Monitoring, and Squid Logfile Analysys. Allows users to generate graphs of traffic usage, based on many different factors.
Gathers data from vpopmail domain and user usage, stores in mysql, and displays a list by user and domain, current quota, quota usage, last time/date of auth, last auth ip, aliases pointing at the address, and aliases pointing to external domains.
A Java based web server log file analyzer. Gives you detailed knowledge of what your visitors did whilst visiting your site. Provides information on human visitors, searches, search engines, site areas, pages, visit paths and more...