Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.
Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
Explore 10,000+ tools
Dominate AI Search Results
Generative Al is shaping brand discovery. AthenaHQ ensures your brand leads the conversation.
AthenaHQ is a cutting-edge platform for Generative Engine Optimization (GEO), designed to help brands optimize their visibility and performance across AI-driven search platforms like ChatGPT, Google AI, and more.
PING is a distributed, web-based, personally controlled electronic medical record system that is accessible to the nomadic user and built to public standards. The PING project includes a pluggable web services API and numerous client applications.
basE91 is an advanced method for encoding binary data as ASCII characters. It is similar to UUencode or base64, but is more efficient. The overhead produced by basE91 depends on the input data. It amounts at most to 23% and can range down to 14%.
Add-On functionality for the Apache Torque Project (http://db.apache.org/toque). This project provided user contributed add-ons that are not part of the Torque distribution, such as an XML import/export framework, custom templates, and the like.
Includes tools for creating ebooks in xml-format. xTrans helps in creating an XML-Ebook from plain text like RTF, TXT. XTrans converts xml-ebooks into the final format like PDF, HTML, RTF, PDB (various forms), ...
Dun and Bradstreet Connect simplifies the complex burden of data management
Our self-service data management platform enables your organization to gain a complete and accurate view of your accounts and contacts.
The amount, speed, and types of data created in today’s world can be overwhelming. With D&B Connect, you can instantly benchmark, enrich, and monitor your data against the Dun & Bradstreet Data Cloud to help ensure your systems of record have trusted data to fuel growth.
This is a small command-line program to split a phone bill into several seperate ones to categorise the calls (e.g. for illustration which family member phones how much). It generates HTML files and is able to send them to specified email adresses.
M2MXML is an XML based protocol for Machine-To-Machine communications. The M2MXML project includes a Java API for parsing M2MXML, but is intended to be language agnostic.
SAX for .NET is a C# port of the original Java based SAX API specifications. When compiled into a .NET assembly it becomes available to the other .NET languages as well.
BinML is a specification and a Java library. It takes XML specifications of (binary) file formats as input and while parsing a binary file it builds an Abstract File Tree containing all the information in a more processing friendly format.
JAXX is a powerful XML user interface language for Java. JAXX source files compile into ordinary Java classes, making integration a snap.
JAXX features advanced data binding and event handling, and uses (fully compiled) Java as its scripting language
Full access to Enterprise features. No credit card required.
What if You Could Automate 90% of Your Repetitive Tasks in Under 30 Days? At ServoDesk, we help businesses like yours automate operations with AI, allowing you to cut service times in half and increase productivity by 25% - without hiring more staff.
FormatCheck screens flat files looking for violations in the format of the data. It uses a set of XML files that define the rules for each file format. The Swing front-end allows the user to run the verification, view and print the errors.
Strip out useless tags and other junk from HTML files. Shrink files, enhance readability of HTML source, promote privacy, and clean HTML exported from Microsoft Word (MS-Word). Run HTMLStrip as-is or customize it with your own regular expressions.
The XMI Tools are a library for reading and manipulating XMI files. The main application is called XMI-Linker; it allows us to split UML models into several files.
Monolith (lith) is a tiny, java-based embedded file-system like implementation. It allows you to store and retreive multiple sets of named data in a single file and access it without the speed costs typically associated with file archives.
LMY image codec and tools for small, 16-bit (or higher) displays on devices with limited resources. Image file size and quality rival GIF, PNG, and JPEG, but decompression is much less complex. Used commercially by Lower Mars. C/C++ and Java/j2ME include
DNA is an open, flexible and extensible deep network analyzer software server and software architecture for gathering and analyzing network packets, network sessions and applications protocols, passively off enterprise class networks.
AJAX JDBC ResultSet in Javascript. Uses XMLHTTP to send xml server data to a web browser HTML form with no refresh, reload or submit (like Google Suggest). Data can be SQL, object relational map (like Hibernate), XML, other mapping (like SOAP)
JRower is a tool for loading database tables from flat files. You supply a XML file describing the fields in the file and their mapping to the database table columns. The data file can be a text file in CSV, delimited, fixed length, or Excel format.
The only -complete- XUL/embedded scripting solution for Java. JFCML supports not only AWT and Swing; but any class accessible to Java's ClassLoader. Also, most major scripting languages, including JavaScript, BeanShell, Jython, Jacl (Tcl), JRuby, more.
Framework for storing and editing Forestry-related XML data, with a specialised graphical interface for each object type. Plug-in API for adding functionality. Coded in Java, uses WebStart for distribution, swing GUI, ~100k lines of code.
The JPList project provides a Java library for parsing and building ASCII PList streams to and from Java objects, similar to what JDOM does for XML. PLists provide a more compact and equally rich alternative to XML as a data serialization mechanism.