Crawl websites, sync to vector databases, and power RAG applications. Pre-built integrations for LLM pipelines and AI assistants.
Build data pipelines that feed your AI models and agents without managing infrastructure. Crawl any website, transform content, and push directly to your preferred vector store. Use 10,000+ tools for RAG applications, AI assistants, and real-time knowledge bases. Monitor site changes, trigger workflows on new data, and keep your AIs fed with fresh, structured information. Cloud-native, API-first, and free to start until you need to scale.
Try for free
Total Network Visibility for Network Engineers and IT Managers
Network monitoring and troubleshooting is hard. TotalView makes it easy.
This means every device on your network, and every interface on every device is automatically analyzed for performance, errors, QoS, and configuration.
Open Force QST is a Query and Schema Tool for Salesforce. View and keep historical records of your schema. Compare schema with history to find changes. Query your Salesforce instance using SOQL and display results. Create reports from saved Queries.
A generic SQL driven data audit tool for detecting differences between any JDBC accessible database tables and other data sources. Platform independent. It's a unix like diff for databases. Produces key values with the differing column name and data
The MCAS Project ( Metrics Correlation and Analysis service ) provides integral solution for system operators or VO users to uniformly access, transform and represent disjoint metrics data generated by distributed middle ware or user services.
PanBI is a collection of analytics modules for existing information systems. For each IS, it provides data extraction, transformation and loading logic coupled with an OLAP schema, delivering OLAP functionality to an unprecedented user base.
Enterprises and companies seeking a solution to manage all their procurement operations and processes
eBuyerAssist by Eyvo is a cloud-based procurement solution designed for businesses of all sizes and industries. Fully modular and scalable, it streamlines the entire procurement lifecycle—from requisition to fulfillment. The platform includes powerful tools for strategic sourcing, supplier management, warehouse operations, and contract oversight. Additional modules cover purchase orders, approval workflows, inventory and asset management, customer orders, budget control, cost accounting, invoice matching, vendor credit checks, and risk analysis. eBuyerAssist centralizes all procurement functions into a single, easy-to-use system—improving visibility, control, and efficiency across your organization. Whether you're aiming to reduce costs, enhance compliance, or align procurement with broader business goals, eBuyerAssist helps you get there faster, smarter, and with measurable results.
Advanced Analysis Services is a Business Intelligence (BI) tool to let users analyze OLAP sources like Pentaho, Mondrian, Microsoft Analysis Services (MSAS) or Hyperion, in an intuitive way, based on analysis templates like Paretto, Ranking and BCG
iSURF: An Interoperability Service Utility for Collaborative Supply Chain Planning across Multiple Domains Supported by RFID Devices. iSURF (http://www.srdc.com.tr/isurf/) project is funded under ICT-2007-1.3 objective of FP7 of European Commission.
SplitPDF -SplitPDF.jar- is a ‘command-line driven’ Java-program, it splits a PDF-file by bookmarks into separated PDF’s. The bookmark is used as title for the newly created PDF. Extremely usefull and fast in a batch processing environment.
Ajanta is a Java API to solve linear programming problems. Linear programming is a method for determining a way to achieve the best outcome (such as maximum profit or lowest cost) in a given list of constraints.
Dun and Bradstreet Risk Analytics - Supplier Intelligence
Use an AI-powered solution for supply and compliance teams who want to mitigate costly supplier risks intelligently.
Risk, procurement, and compliance teams across the globe are under pressure to deal with geopolitical and business risks. Third-party risk exposure is impacted by rapidly scaling complexity in domestic and cross-border businesses, along with complicated and diverse regulations. It is extremely important for companies to proactively manage their third-party relationships. An AI-powered solution to mitigate and monitor counterparty risks on a continuous basis, this cutting-edge platform is powered by D&B’s Data Cloud with 520M+ Global Business Records and 2B+ yearly updates for third-party risk insights. With high-risk procurement alerts and multibillion match points, D&B Risk Analytics leverages best-in-class risk data to help drive informed decisions. Perform quick and comprehensive screening, using intelligent workflows. Receive ongoing alerts of key business indicators and disruptions.
The Pentaho Personalizer is based on the "dead" project PentahoLooker. The Pentaho Personalizer is sponsored by Lizacom and have a "commercialy" financed core of developers to ensure that the project do not "die" by lack of time or interest.
A group a subprojects for Data Cleaning projects, mainly as a step of a Data Mining Project. Visit www.datacleaningopensource.com to review our current applications or if you want to add yours. NOTE: PROGRAMMING SKILLS ARE REQUIRED.
This project aims at providing a centralized system to store, retrieve, and execute BIRT reports in a server environment so that applications using BIRT reports do not have to sore the reports by themselves, and rely on this project for management.
weka outlier is an implementation of outlier detection algorithms for WEKA.
CODB (Class Outliers: Distance-Based) Algorithm is the first algorithm developed using WEKA framework.
SimpleWFS is a 100% Java servlet implementation of the draft OGC standard Web Feature Service (WFS) Simple. This is a Web service interface whose goal is to specify a common, minimal feature set for geospatial-temporal data access.
The project strives to provide an editor for the Business Motivation Model by the OMG. The Business Motivation Model specification provides a scheme or structure for developing, communicating, and managing business plans in an organized manner.
SOLAPLayers is a cartographic component which enables navigation in geospatial (Spatial OLAP or SOLAP) data cubes. It aims to be integrated into existing dashboard frameworks in order to produce interactive geo-analytical dashboards.
A Flex/java based system for presenting statistics over the net. Based on Mondrian backend. Able to handle large amounts of data, administration facilities not worth shouting about.
An extension package to Pentaho Data Integration, providing plug-ins. Steps/job entries can be downloaded independently and each comes with source code in the .zip file. All are licensed as LGPL or GPL.
The Soft N' tic Toolkit API (SNTTk API) is a Java API, designed to simplify access to Business Objects Enterprise Platform,by making an abstraction on BOE SDK. You may implements defined functionalities or obtain an existing implementation.
OO jDREW is an open source deductive reasoning engine for the RuleML web rule language. OO jDREW implements the object oriented extensions to RuleML which include: Order Sorted Types, Slots, and Object identifiers.
The GOLEM (Global Object Learning Enterprise Mediator) is a multi-module system for identity management purposes in an inter- and intra-university context. It supports eLearning applications in a very broad sense, i.e. including wikis and other web tools