Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.
Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
Explore 10,000+ tools
Run applications fast and securely in a fully managed environment
Cloud Run is a fully-managed compute platform that lets you run your code in a container directly on top of scalable infrastructure.
Run frontend and backend services, batch jobs, deploy websites and applications, and queue processing workloads without the need to manage infrastructure.
SemaRule Navigator is an Integrated Suite of Open-Source and Free-License Software, placing Semantic and Text Analysis Technologies in the toolbox of Researchers, Students, and Enterprises.
The name of this project is DuruBI. It is Enterprise Reporting Tool allows DB(Data Base) and OLAP(Online analytical processing) and DM(Data Mining) to query and reporting from various data sources.
Sql Parallel Executer is an open source software making the user perform Data Warehousing tasks with almost any database thanks to the use of ODBC drivers.
Built keeping in mind the highest flexibility for the final user it offers several features: Scheduling Script execution, Parameter Substitution, Parallel Execution and Script Dependency.
Designed for users without special Data Warehousing tools available, it can anyway suite most advanced users providing tools for exploiting...
OpenEphyra is an open framework for question answering (QA). It retrieves answers to natural language questions from the Web and other sources. Visit http://www.ephyra.info/ for more details and information on joining this open research initiative.
Caller ID Reputation provides the most comprehensive view of your caller ID scores across all carriers
Instantly identify flagged caller IDs and decrease flags by up to 95% your first month.
Keep your agents on the phone with increased connection rates by monitoring your phone number reputation across all major carriers and call blocking apps.
GeoMondrian is a "spatially-enabled" version of Mondrian. GeoMondrian brings to the Mondrian OLAP server what PostGIS brings to the PostgreSQL DBMS, i.e. a consistent and powerful support for geospatial data. It also provides geo-extensions to MDX.
ActiveInsight provides real-time detection and reaction to events and patterns. It is a platform that enables the detection of meaningful events within multiple, high frequency, event streams.
DISMOD Core Open Source Project: DISMOD Core is the core library of DISMOD, an SCO Application developed by Fraunhofer IML. DISMOD has been used over years by our specialists to solve optimization problems inside the transportation domain.
Airlock Digital - Application Control (Allowlisting) Made Simple
Airlock Digital delivers an easy-to-manage and scalable application control solution to protect endpoints with confidence.
For organizations seeking the most effective way to prevent malware and ransomware in their environments. It has been designed to provide scalable, efficient endpoint security for organizations with even the most diverse architectures and rigorous compliance requirements. Built by practitioners for the world’s largest and most secure organizations, Airlock Digital delivers precision Application Control & Allowlisting for the modern enterprise.
XIForge is a team of IT volunteer to explore new free open source technology framework and platform. We focus Pentaho and OpenBravo ERP. Our current hosted project includes Pentaho Data Integration Parse JSON String plugin. Team founder is Reid Lai.
easyDE is an Enterprise Business Intelligence platform that facilitates timely and effective business decision for companies to gain competitive advantage. Allow a wide range of end-users to quickly deploy rich analyses with a single integrated product.
Data mining tool for sequences (e.g. trajectories on a map, visited web pages, etc.) that creates a succinct description of the sequences, given a taxonomy (e.g. regions and sub-regions in the map, categories and sub-categories of pages, etc.).
The aim of ALIVE is to develop new approaches to the engineering of flexible, adaptable distributed service-oriented systems based on the adaptation of social coordination and organisation mechanisms.
Open Force QST is a Query and Schema Tool for Salesforce. View and keep historical records of your schema. Compare schema with history to find changes. Query your Salesforce instance using SOQL and display results. Create reports from saved Queries.
A generic SQL driven data audit tool for detecting differences between any JDBC accessible database tables and other data sources. Platform independent. It's a unix like diff for databases. Produces key values with the differing column name and data
The MCAS Project ( Metrics Correlation and Analysis service ) provides integral solution for system operators or VO users to uniformly access, transform and represent disjoint metrics data generated by distributed middle ware or user services.
PanBI is a collection of analytics modules for existing information systems. For each IS, it provides data extraction, transformation and loading logic coupled with an OLAP schema, delivering OLAP functionality to an unprecedented user base.
Advanced Analysis Services is a Business Intelligence (BI) tool to let users analyze OLAP sources like Pentaho, Mondrian, Microsoft Analysis Services (MSAS) or Hyperion, in an intuitive way, based on analysis templates like Paretto, Ranking and BCG
The SWING Dashboard displays your KPI which are updated daily. User friendly and visually appealing, it allows a proactive monitoring of your organization’s activities and to make appropriate decisions quickly. feel free to give us your opinion
iSURF: An Interoperability Service Utility for Collaborative Supply Chain Planning across Multiple Domains Supported by RFID Devices. iSURF (http://www.srdc.com.tr/isurf/) project is funded under ICT-2007-1.3 objective of FP7 of European Commission.
SplitPDF -SplitPDF.jar- is a ‘command-line driven’ Java-program, it splits a PDF-file by bookmarks into separated PDF’s. The bookmark is used as title for the newly created PDF. Extremely usefull and fast in a batch processing environment.
Ajanta is a Java API to solve linear programming problems. Linear programming is a method for determining a way to achieve the best outcome (such as maximum profit or lowest cost) in a given list of constraints.
The Pentaho Personalizer is based on the "dead" project PentahoLooker. The Pentaho Personalizer is sponsored by Lizacom and have a "commercialy" financed core of developers to ensure that the project do not "die" by lack of time or interest.
AMB New Generation Data Empowerment - offers a comprehensive approach to data governance needs with ground breaking features to locate, identify, discover, manage and protect your overall data infrastructure. Repeatable Process/Exposed Repository.