Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.
Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
Explore 10,000+ tools
Dun and Bradstreet Risk Analytics - Supplier Intelligence
Use an AI-powered solution for supply and compliance teams who want to mitigate costly supplier risks intelligently.
Risk, procurement, and compliance teams across the globe are under pressure to deal with geopolitical and business risks. Third-party risk exposure is impacted by rapidly scaling complexity in domestic and cross-border businesses, along with complicated and diverse regulations. It is extremely important for companies to proactively manage their third-party relationships. An AI-powered solution to mitigate and monitor counterparty risks on a continuous basis, this cutting-edge platform is powered by D&B’s Data Cloud with 520M+ Global Business Records and 2B+ yearly updates for third-party risk insights. With high-risk procurement alerts and multibillion match points, D&B Risk Analytics leverages best-in-class risk data to help drive informed decisions. Perform quick and comprehensive screening, using intelligent workflows. Receive ongoing alerts of key business indicators and disruptions.
Pattern recognition system written in Java.
Provide statistical informations about satellite and medical imagery; and also recognize symbols in a gaussian blur.
TM4 is a suite of applications for managing and analyzing microarray data. TM4 provides data storage and tracking, image analysis, normalization, data filtering, clustering and statistical analysis capabilities. Includes MADAM, Spotfinder, MIDAS, and MeV.
MEASJ, the Multiply Extensible Architecture Simulator in Java, is a Framework for creating Simulations and Emulations of von Neumann architecture based systems.
RISO: distributed, heterogeneous Bayesian belief networks. Belief network: a probability model defined on an acyclic directed graph; distributed: nodes can be on different hosts; and heterogeneous: allowing different types of conditional distributions.
For companies looking to automate their consolidation and financial statement function
The software is cloud based and automates complexities around consolidating and reporting for groups with multiple year ends, currencies and ERP systems with a slice and dice approach to reporting. While retaining the structure, control and validation needed in a financial reporting tool, we’ve managed to keep things flexible.
Track This! Hurricane Tracker is Java/Swing based project for weather enthusiasts and the like to help during Hurricane season. I have not seen a solid "free" hurricane tracker yet this is an important piece of software.
MMOpenGraph is a set of JAVA-Classes to represent graphs within java. It can load and save graphs to serialized or text-based files and analyze graphs to find shortest paths.
The Most Powerful Software Platform for EHSQ and ESG Management
Addresses the needs of small businesses and large global organizations with thousands of users in multiple locations.
Choose from a complete set of software solutions across EHSQ that address all aspects of top performing Environmental, Health and Safety, and Quality management programs.
Niamtool v2 is a graphical drawingtool for creating NIAM-diagrams (Niam is Nijssen Information Analyses Method) As far as we know is the only niam-drawingtool in existance (excluding Niamtool v1). It\'s written in java 1.4.
A Java package to read and manipulate graphs in the GXL (Graph eXchange Language) format, as well as providing a GXL graph editor. Note from admin: This project is discontinued, send me a message if you want to take over.
MODVLO7 provides an environment for exploring harmonic prolongations in any piece of tonal music. It imports Scala files, and saves Scala files for future changes or study, and for rendering MIDI files in just intonation using Scala.
The class libraries here provide infrastructure for creating simulations of low energy nuclear physics experiments, as well as some useful working programs that do simple simulations and analysis of experiments performed with magnetic spectrographs.
ZATreeX is a data manager with two control panels. The difference with conventional data managers is its ability to manage any tree structured data:you just have to implment a java interface. It is also able to extend it functionality (another interface)
Library made with the quantity/formatter pattern. (Currency/CurrencyFormat, Density/DensityFormat, Dimension/DimensionFormat, Distance/DistanceFormat, ElapsedTime/ElapsedTimeFormat, Temperature/TemperatureFormat, Volume/VolumeFormat, Weight/WeightFormat)
Kaph provides a portable (written in Java) method by which to display and guide users through multiple-subject research surveys. Configuration is handled via XML files. Support for a wide range of types of questions is provided.
Annotated Gel Markup Language is a simple markup language that is being proposed to markup data obtained by 2D gel electrophorosis.The goal of AGML is to enable proteomics research move into the browsing mode of searching through existing databases.
A Java modular system for inquiry people via internet or using PDA. The system lets analize the results via statistic methods and print reports and graphics.
Sequence studio main package provides classes and interfaces for various kinds of sequence alignment. Differently from regular expressions it computes similarity with two initially unknown strings. Project page provides code generating applet.
a cross-platform application to decode, search, browse, view, print, and export TLG/PHI BetaCode texts. Project is currently being ported from wxWindows to Java. (For more info, see the project homepage at http://wxtlg.sourceforge.net)
GeNetDB, contraction of Genetic Network Database, is a bioinformatic platform destined to the study of genetic regulatory networks. It contains in one place the data and the way to study them, providing the user an access to all tools needed for his work
A Java-based client-server architecture for processing network intrusion detection data. The server receives XML alerts from Snort sensors buffers them for review by clients. The console provides a real-time view of IDS activity.