Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.
Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
Explore 10,000+ tools
Run applications fast and securely in a fully managed environment
Cloud Run is a fully-managed compute platform that lets you run your code in a container directly on top of scalable infrastructure.
Run frontend and backend services, batch jobs, deploy websites and applications, and queue processing workloads without the need to manage infrastructure.
A lightweight EAI tool based on Java technology with a web browser interface for building integration processes from an extensible selection of adapter components. It runs on Apache Tomcat application server and stores its configuration in MySQL database.
Planetary Scale Event Propagation and Router ("PsEPR" pronounced "pepper") is an experimental status and notification service. Pub/sub system for exchange of XML messages creating an event service for PlanetLab (http://www.planet-lab.org/).
xmote is a standard for exchanging data in a compact standardized XML format. In addition to defining the standard, xmote aims to provide a fully compliant and easy to use reference implementation.
It's a modern take on desktop management that can be scaled as per organizational needs.
Desktop Central is a unified endpoint management (UEM) solution that helps in managing servers, laptops, desktops, smartphones, and tablets from a central location.
Relaxed is an HTML validation app. as well as a XHTML 1.0 / HTML 4.01 and WCAG 1.0 schema definition written in Relax NG with embedded Schematron. Those expressive languages allow automated validation of many additional restrictions inexpressible by DTD.
SETTLE (Seer Extract/Transform/Translate/Load Engine) This utility uses XML configuration files to define the flow of information from Data Sources to varioius Data Targets. XML is likewise used to define transformation rules in between. Transformation l
CodeSnippet is a set of Eclipse plug-ins to convert Ant, Java, Properties, CSS, DTD, HTML, and XML source code (complete files or snippets) to HTML with syntax highlighting.
BambooHR is all-in-one HR software made for small and medium businesses and the people who work in them—like you. Our software makes it easy to collect, maintain, and analyze your people data, improve the way you hire talent, onboard new employees, manage compensation, and develop your company culture. It’s designed to set you free to focus on what matters most—your people.
The metabolomics standards initiative (msi) will develop a Core Information for Metabolomics Reporting (CIMR) recommendation and a more formal and semantically defined corresponding ontology (msi-ontology).
BeanReader is a tool that uses annotations on Java classes to extract objects from XML documents. The annotations indicate (using XPaths) where in the document the object's properties are represented.
Xcarecrows4XML completes Eclipse to handle the tasks required by an XML workflow. It offers: a graphical XML, XML Schema and XML stylesheets editor; a graphical XML tree comparator; a built-in checker against XML Schemas; an XSL transformations tool kit.
XUI is a Java and XML RIA platform for building smart app's. Swing, AWT and other widget sets can be used on a range of hardware. XUI's modular framework can help many aspects of application development. NetBeans and Eclipse are available.
htmlCharset is a file conversion tool, useful for replacing HTML entities by the actual characters that they represent, or vice versa. As a spin-off, it can also be used as a general charset converter for arbitrary text files.
jumbles (Java Unified Metadata Basic Library for Extracting and Storing) is a library that enables the extraction and storing of multimedia metadata. Currently wraps "jaudiotagger" (MP3 ID3 tags) and "metadata extractor" (EXIF, et al.).
BITXml is an open xml based protocol for Machine to Machine communications. BITXml is both language and "transport" agnostic and covers all the requirements of a generic M2M telemetry & control application. For more info www.bitxml.org
Simple Mutated API for XML. A prototype of a new java based stream parser based on sax with an improved interface to improve ease of use and performance. See Web page for details.
JAVA tool for creating html image maps. It's possible to load a picture and create rectangles, circles and polygons. Additionally, content for href, alt and title can be set.
JZipLib is a very simple interface for zipping a file. It's as easy as calling a method named createZip (File input, File zipfile). Please see the Wiki for documentation.
XHTML Doclet is a standards-compliant alternative to the Javadoc standard HTML doclet. It revises the document structure to exclude outdated tags and inline styles, creates valid XHTML markup, and provides better hooks for more flexible CSS manipulation.
CHMPane is a Java program to process compiled html (.chm) files. It has a stream based API for accessing CHM file content and provides a GUI to navigate and view the content.