Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.
Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
Explore 10,000+ tools
Free and Open Source HR Software
OrangeHRM provides a world-class HRIS experience and offers everything you and your team need to be that HR hero you know that you are.
Give your HR team the tools they need to streamline administrative tasks, support employees, and make informed decisions with the OrangeHRM free and open source HR software.
A group a subprojects for Data Cleaning projects, mainly as a step of a Data Mining Project. Visit www.datacleaningopensource.com to review our current applications or if you want to add yours. NOTE: PROGRAMMING SKILLS ARE REQUIRED.
This project aims at providing a centralized system to store, retrieve, and execute BIRT reports in a server environment so that applications using BIRT reports do not have to sore the reports by themselves, and rely on this project for management.
weka outlier is an implementation of outlier detection algorithms for WEKA.
CODB (Class Outliers: Distance-Based) Algorithm is the first algorithm developed using WEKA framework.
Run applications fast and securely in a fully managed environment
Cloud Run is a fully-managed compute platform that lets you run your code in a container directly on top of scalable infrastructure.
Run frontend and backend services, batch jobs, deploy websites and applications, and queue processing workloads without the need to manage infrastructure.
SimpleWFS is a 100% Java servlet implementation of the draft OGC standard Web Feature Service (WFS) Simple. This is a Web service interface whose goal is to specify a common, minimal feature set for geospatial-temporal data access.
The project strives to provide an editor for the Business Motivation Model by the OMG. The Business Motivation Model specification provides a scheme or structure for developing, communicating, and managing business plans in an organized manner.
MetadataPortal Metadata Management software is a web based Opensource Metadata Management Solution for Enterprise Integration. For more details please have a look at the blog entry at http://www.metadataportal.com/blog/
SOLAPLayers is a cartographic component which enables navigation in geospatial (Spatial OLAP or SOLAP) data cubes. It aims to be integrated into existing dashboard frameworks in order to produce interactive geo-analytical dashboards.
DB2RDF is a a software tool that will convert data from relational data model to semantic data model (in the form of RDF and RDFS). A SPARQL endpoint for querying the converted data. For querying the semantic data, SPARQL query language is used.
It's a modern take on desktop management that can be scaled as per organizational needs.
Desktop Central is a unified endpoint management (UEM) solution that helps in managing servers, laptops, desktops, smartphones, and tablets from a central location.
A Flex/java based system for presenting statistics over the net. Based on Mondrian backend. Able to handle large amounts of data, administration facilities not worth shouting about.
A graphical editor based on Visual Syntax of Semantics for Business Vocabulary and Rules(SBVR). SBVR VE is based on eclipse platform. For documentation, please refer to OPAALS deliverable http://files.opaals.org/OPAALS/Year_3_Deliverables/WP10/D10.14.pdf
An extension package to Pentaho Data Integration, providing plug-ins. Steps/job entries can be downloaded independently and each comes with source code in the .zip file. All are licensed as LGPL or GPL.
The Soft N' tic Toolkit API (SNTTk API) is a Java API, designed to simplify access to Business Objects Enterprise Platform,by making an abstraction on BOE SDK. You may implements defined functionalities or obtain an existing implementation.
OO jDREW is an open source deductive reasoning engine for the RuleML web rule language. OO jDREW implements the object oriented extensions to RuleML which include: Order Sorted Types, Slots, and Object identifiers.
The GOLEM (Global Object Learning Enterprise Mediator) is a multi-module system for identity management purposes in an inter- and intra-university context. It supports eLearning applications in a very broad sense, i.e. including wikis and other web tools
DASH is a FREE development platform that enables the rapid deployment of BI dashboards. It is bundled with the Apache Jetspeed Portal. See marvelit.com for more info
Single Click Real Time searching of both structured and unstructured data and information.
Simultaneous searching of Structured: databases and unstructured: documents from within a web browser, desktop application and application plugins
This project is being developed under control of Sabanci University Industrial Engineering Dept., and is aimed to make accurate and free consultancy to corporate managers for making the right decisions to improve business needs.
SONIVIS:Tool aims at analysing social (virtual) information spaces like Wikis. These spaces are investigated by using different network definitions (collaboration/information networks). Clustering algorithms and statistiscal analyses are provided.
Easy BI family contains engine(generater) for reporting , charting, tabulardata, olap. All parts is separated. Either change any part of them or customize develop is simple and fun.
CI4FREE is an open source JAVA library and subset of InformationButler. CI4FREE helps to syndicate business information and analyse it by BSC and SWOT methology. The results are stored in a database, could be published or integrated in the CRM.