Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.
Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
Explore 10,000+ tools
Nonprofit Budgeting Software
Martus Solutions provides seamless budgeting, reporting, and forecasting tools that integrate with accounting systems for real-time financial insights
Martus' collaborative and easy-to-use budgeting and reporting platform will save you hundreds of hours each year. It's designed to make the entire budgeting process easier and create unlimited financial transparency.
Mex PREMIS Editor is an Editor for the MEX Editor Framework
PREMIS stands for Preservation Metadata: Implementation Strategies
The Editor will implement PREMIS v 1.1 (Version 2.0 is under development)
cobol2j reads or writes COBOL or RPG data files imported from mainframes, AS/400 or Baby/36 environment. Packed decimal, zoned or packed date fields decoding included. EBCDIC conversion. ETL ISAM data to any other platform. PC Cobol ( ASCII ) supported.
The development and curation of a range of XML-based tools
for using Chemical Markup Language (CML), including
XSD XML Schemas for validation, datatyping and constraining CML
documents and XSLT Stylesheets for transforming, filtering and rendering.
SAIM allows to interlink knowledge bases in the Semantic Web. It focuses on instance matching of very large knowledge bases available as SPARQL endpoints. SAIM uses machine learning techniques and is compatible with SILK.
Incredable is the first DLT-secured platform that allows you to save time, eliminate errors, and ensure your organization is compliant all in one place.
For healthcare Providers and Facilities
Incredable streamlines and simplifies the complex process of medical credentialing for hospitals and medical facilities, helping you save valuable time, reduce costs, and minimize risks. With Incredable, you can effortlessly manage all your healthcare providers and their credentials within a single, unified platform. Our state-of-the-art technology ensures top-notch data security, giving you peace of mind.
Example-based Modeling (EMO) is an tool to create data models, with examples, using a web interface. You interactively create a web-accessible database of models and samples for those models. A white paper describes the underlying assumptions.
TransOnto is a knowledge transformation and migration system for transferring knowledge between semantic representations. TransOnto includes the tron library and the SemPP POWDER processor.
This project is retired and moved into the Corn Gate project. Please switch to https://sourceforge.net/projects/corn-gate.corn.p/ link for the project.
Simple Remoting is an alternative SOA library which uses JSON instead of XML as its messaging format. Converting popular java object types like POJO, Spring and EJB to services can be done within minutes without any code change on existing classes.
All-in-one security tool helps you prevent ransomware and breaches.
SIEM + Detection and Response for IT Teams
Blumira’s detection and response platform enables faster resolution of threats to help you stop ransomware attacks and prevent data breaches. We surface real threats, providing meaningful findings so you know what to prioritize. With our 3-step rapid response, you can automatically block known threats, use our playbooks for easy remediation, or contact our security team for additional guidance. Our responsive security team helps with onboarding, triage and ongoing consultations to continuously help your organization improve your security coverage.
Trial Criteria Online Data Entry (trialCODE): a Java-based user interface that codifies eligibility requirements used to automate the screening of potential subjects to clinical trials. Used for caMATCH screening engine on BreastCancerTrials.org site.
netcdf-tools is a set of tools for creating netCDF files. It supports command line use on both Windows and Unix as well as use directly as a Java library. Written by CSIRO Australia and funded by the ANDS Australian Research Data Commons Project
CalDAV sync for Android
----
Stopped working on it. Doesn't make any sense to spend more time on it; I recommend e.g. DAVdroid as an opensource and ready-to-use solution on clientside.
----
For code look at:
https://github.com/e7o-de/android.caldav
SPIDR (Space Physics Interactive Data Resource) is a distributed database and application server network, built to select, visualize and model historical space weather data. SPIDR is a web-application and a grid of data mining web-services.
A Java API binding of the IETF Mime-Dir and vCard RFC's. This package defines implementation neutral Mime-Dir and vCard Java interfaces for exposing data objects corresponding to those defined by the RFC's.
A toy XML-aware (but otherwise generic and extensible) content management system demonstrating how to do sophisticated management of versioned hyperdocuments with a focus on issues of import and export of compound documents (e.g., XInclude-based).
The SSAF ("Secure Search And Forwards") is a dirt-simple standalone web app for inexpensive and secure information sharing. Any uploaded record may be forwarded to an intended destination, and may also be stashed in a searchable repository.
The gateway is an open source JavaEE application developed by the Vermont Dept of Taxes. It provides a web services framework for accepting Streamlined Sales Tax registrations and returns. It also includes a web interface for submitting transmissions.
The Wicket Stuff project makes third party components available using the Wicket web component framework. Subprojects of this project contain integrations for Spring, Groovy, Hibernate, Velocity and other popular Java open source projects.
The aim of the tool is to validate a particular format of metadata. Specifically, the tool checks three parts: 1. Big5 character encoding; 2. whether it is a well-formed XML document ; 3. other specifications with our own purposes.
XML carver which can carve damaged or non-standard XML out of any file. It rebuilds the XML tree, along with giving the offsets for all the carved XML data. This tool was developed for the DFRWS 2010 Forensics Challenge.