Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.
Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
Explore 10,000+ tools
Atera all-in-one platform IT management software with AI agents
Ideal for internal IT departments or managed service providers (MSPs)
Atera’s AI agents don’t just assist, they act. From detection to resolution, they handle incidents and requests instantly, taking your IT management from automated to autonomous.
The project Navigator aims at supporting automated gathering of dynamic information from third party web sites, using their web interface to post queries and to gather replies. Navigator is written in OS-independent java language.
This project aims to create a standalone application, java-written Match Viewer for the online football (soccer) management game Hattrick - www.hattrick.org.
The goal of the Openbus Project is to provide the opensource community with a set of high quality, reliable and scalable Enterprise Application Integration (EAI) products. It will include an opensource JBI implementation, an opensource ESB, etc.
Java JNDI 1.1 naming context. JNDI is not just for J2EE, with this package you can use it with any Java application. This context resides completely in memory, is configurable from xml, and capable of storing any Java object. Requires java 1.4 or higher.
Ever wanted to create notes for pages on the web and be able to share them? This framework allows to create annotations to XML and HTML documents. Annotations are independent of presentation and can be traced back even if part of the document changed.
URIQA4J is an implementation of Nokia's URI Query Agent Model for the Java platform. Used as a component for your web application, it exposes your Java Objects encoded as Concise Bounded Descriptions of RDF resources.
LMS Tools is a set of tools and libraries for administrators and users of learning management systems to perform, for example, portal integration, LMS management or usage data analysis.
an xml extension which allows for elegant parameterization of file-based application input and output files.
OS: any with a jvm
lang: Java
libs: xerces
features: File readers, writers, and parameterizers.
Axe Credit Portal - ACP- is axefinance’s future-proof AI-driven solution to digitalize the loan process from KYC to servicing, available as a locally hosted or cloud-based software.
Banks, lending institutions
Founded in 2004, axefinance is a global market-leading software provider focused on credit risk automation for lenders looking to provide an efficient, competitive, and seamless omnichannel financing journey for all client segments (FI, Retail, Commercial, and Corporate.)
The Jorne project develops software and open standards for linking Lojban text with WWW and Semantic Web metadata (e.g. RDF/N3, RSS, XML). Lojban is an artificial spoken and written language based on predicate logic.
TM4J is a topic map engine implemented entirely in Java. Topic maps are a standard paradigm for the interchange of knowledge structures. This project aims to produce a complete suite of tools for creating, processing and publishing topic map information.
StoreX is the next generation information management system. It acts as a backend repository for applications eg. emails, blogs, twikis etc. It provides an API on a local machine or on a remote machine connected through our WebDAV protocol.
Andalus is a high availability, fully featured Islamic library written entirely in Java. It is a technology suitable for nearly any application that requires to make use of integrable Islamic resources. http://openandalus.wordpress.com/
Web Objects in XML (Framework) is an interoperable and lightweight system for distributed objects. It is currently built in Java and C#. Remote objects are serialized in a standard XML format, and they are accessible trough their own URL.
Web 2.0 environment for collaborative and extensible design of object interconnection diagrams like UML, networks, circuits and other graph based diagrams. Client: Web-based (AJAX / Dojo Toolkit) Server: JEE Application (Struts Framework)
JWSPerf is an open-source utility to automate the performance evaluation using different Java Web Services toolkits. JWSPerf is executed from the command line and it uses Java Ant technology to building all the client code.
What is a domain-specific language intended for constructing web
services on top of ordinary web pages, or otherwise automate
web-related tasks.
Using a powerful pattern matching sublanguage, What strives to be for XML/HTML what Perl is for text analysis
This is a collection of REST specifications, and implementations of those specs, for very low-level information sharing and workflow operations using REST actions over HTTP.
Implementations are in various languages, mainly Java, Python, and Ruby.
A set of Java classes for handling KML. Also an application, based on the LIB of the same name that provides limited KML tasks such as merging, cropping.
Will eventually support some nice interactions with google maps.
This project offers an API to integrate Cobol/CICS program calls inside XML-Centric Entreprise Service Bus, like Synapse.
Implements data transformations from/to xml to/from flat data, based on Cobol CopyBooks definitions. Support StAX, DOM, SAX.
Generic Web Services and framework for developers and project managers. It covers: Configuration and parameter values managed within a database; High volume file storage within a hierachical file system (SAN), XML transformation and writers, etc.