Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.
Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
Explore 10,000+ tools
BoldTrail Real Estate CRM
A first-of-its-kind homeownership solution that puts YOU at the center of the coveted lifetime consumer relationship.
BoldTrail, the #1 rated real estate platform, is built to power your entire brokerage with next-generation technology your agents will use and love. Showcase your unique brand with customizable websites for your company, offices, and every agent. Maximize lead capture with a modern, portal-like consumer search experience and intelligent behavior tracking. Hyper-local area pages, home valuation pages and options for rich lifestyle data keep customers searching with your brokerage as the local experts. The most robust lead gen tools on the market help your brokerage, teams & agents effectively drive new business - no matter their budget. Empower your agents to generate free leads instantly with our simple to use landing pages & IDX squeeze pages. Drive more leads with higher quality and lower cost through in-house tools built within the platform. Diversify lead sources with our automated social media posting, integrated Google and Facebook advertising, custom text codes and more.
iSURF: An Interoperability Service Utility for Collaborative Supply Chain Planning across Multiple Domains Supported by RFID Devices. iSURF (http://www.srdc.com.tr/isurf/) project is funded under ICT-2007-1.3 objective of FP7 of European Commission.
SplitPDF -SplitPDF.jar- is a ‘command-line driven’ Java-program, it splits a PDF-file by bookmarks into separated PDF’s. The bookmark is used as title for the newly created PDF. Extremely usefull and fast in a batch processing environment.
Ajanta is a Java API to solve linear programming problems. Linear programming is a method for determining a way to achieve the best outcome (such as maximum profit or lowest cost) in a given list of constraints.
DOP Software’s mission is to streamline waste and recycling business’ processes by providing them with dynamic, comprehensive software and services that increase productivity and quality of performance.
The Pentaho Personalizer is based on the "dead" project PentahoLooker. The Pentaho Personalizer is sponsored by Lizacom and have a "commercialy" financed core of developers to ensure that the project do not "die" by lack of time or interest.
AMB New Generation Data Empowerment - offers a comprehensive approach to data governance needs with ground breaking features to locate, identify, discover, manage and protect your overall data infrastructure. Repeatable Process/Exposed Repository.
A group a subprojects for Data Cleaning projects, mainly as a step of a Data Mining Project. Visit www.datacleaningopensource.com to review our current applications or if you want to add yours. NOTE: PROGRAMMING SKILLS ARE REQUIRED.
This project aims at providing a centralized system to store, retrieve, and execute BIRT reports in a server environment so that applications using BIRT reports do not have to sore the reports by themselves, and rely on this project for management.
weka outlier is an implementation of outlier detection algorithms for WEKA.
CODB (Class Outliers: Distance-Based) Algorithm is the first algorithm developed using WEKA framework.
SimpleWFS is a 100% Java servlet implementation of the draft OGC standard Web Feature Service (WFS) Simple. This is a Web service interface whose goal is to specify a common, minimal feature set for geospatial-temporal data access.
The project strives to provide an editor for the Business Motivation Model by the OMG. The Business Motivation Model specification provides a scheme or structure for developing, communicating, and managing business plans in an organized manner.
MetadataPortal Metadata Management software is a web based Opensource Metadata Management Solution for Enterprise Integration. For more details please have a look at the blog entry at http://www.metadataportal.com/blog/
SOLAPLayers is a cartographic component which enables navigation in geospatial (Spatial OLAP or SOLAP) data cubes. It aims to be integrated into existing dashboard frameworks in order to produce interactive geo-analytical dashboards.
DB2RDF is a a software tool that will convert data from relational data model to semantic data model (in the form of RDF and RDFS). A SPARQL endpoint for querying the converted data. For querying the semantic data, SPARQL query language is used.
A Flex/java based system for presenting statistics over the net. Based on Mondrian backend. Able to handle large amounts of data, administration facilities not worth shouting about.
A graphical editor based on Visual Syntax of Semantics for Business Vocabulary and Rules(SBVR). SBVR VE is based on eclipse platform. For documentation, please refer to OPAALS deliverable http://files.opaals.org/OPAALS/Year_3_Deliverables/WP10/D10.14.pdf
An extension package to Pentaho Data Integration, providing plug-ins. Steps/job entries can be downloaded independently and each comes with source code in the .zip file. All are licensed as LGPL or GPL.
The Soft N' tic Toolkit API (SNTTk API) is a Java API, designed to simplify access to Business Objects Enterprise Platform,by making an abstraction on BOE SDK. You may implements defined functionalities or obtain an existing implementation.
OO jDREW is an open source deductive reasoning engine for the RuleML web rule language. OO jDREW implements the object oriented extensions to RuleML which include: Order Sorted Types, Slots, and Object identifiers.
The GOLEM (Global Object Learning Enterprise Mediator) is a multi-module system for identity management purposes in an inter- and intra-university context. It supports eLearning applications in a very broad sense, i.e. including wikis and other web tools
DASH is a FREE development platform that enables the rapid deployment of BI dashboards. It is bundled with the Apache Jetspeed Portal. See marvelit.com for more info