Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.
Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
Explore 10,000+ tools
Easy-to-use Business Software for the Waste Management Software Industry
DOP Software’s mission is to streamline waste and recycling business’ processes by providing them with dynamic, comprehensive software and services that increase productivity and quality of performance.
Cairo (Complex Archive Ingest for Repository Objects) is a tool for processing digital archives prior to submitting them to archival storage for long-term preservation; among other features, this includes format identification and metadata extraction.
A package that works with Apache Velocity and allows you to build web pages and other content from reusable Velocity template fragments, controlled by hierarchies of layout descriptions in XML.
proDMX is a library for the Processing programming language that allows to send and receive DMX messages through the serial port. It has been superseded by the dmxP512 library, available here: http://motscousus.com/stuff/2011-01_dmxP512
XIPL/XASM is a basic environment to build high-level Image Processing operations using a XML-based language. The idea is to compile the high-level XIPL language into a low-level one, XASM (XML-based too), using XSLT technology.
Manage what you buy and how you buy it with Order.co, so you have control over your time and money spent.
Simplify every aspect of buying for your business in Order.co. From sourcing products to scaling purchasing across locations to automating your AP and approvals workstreams, Order.co is the platform of choice for growing businesses.
The DITA Open Platform is a free, open-source project which goal is to provide an enterprise platform for the edition, management and processing of DITA documents.
JFileReader allows users to view the contents of any file in a bytewise manner using a simple GUI interface. Functions include byte view changes to different primitive datatypes, endianess changes, and file content exports to new files.
Convert plain text format RFC docs into open format, such as HTML, PDF.
Features: Index page link, Document reference link, Figure/Table reference link;
customizable CSS.
EDI-Knight is an integration and translation software focussing EDI (Electronic Data Interchange), CSV (Character Separated Values), and XML. There's a command line tool for batch processing as well as a Java/J2EE conformant API.
Webiyo (pronounced "webby-O") is a small Java 1.5 library containing classes for generating web pages, processing forms, and unit-testing web sites. Since no template files are used, it allows you to take full advantage of your IDE's refactoring tools.
The Most Powerful Software Platform for EHSQ and ESG Management
Addresses the needs of small businesses and large global organizations with thousands of users in multiple locations.
Choose from a complete set of software solutions across EHSQ that address all aspects of top performing Environmental, Health and Safety, and Quality management programs.
Runes is a plugin-based framework that reverses the way of thinking about data processing: specify data accesses, and Runes will automatically choose an efficient representation for the data and execute the plugins to integrate it from different sources.
twexter formats twin twext translations to help us learn language .. demo: http://test.twext.com .. javascript code is open at http://github.com/tudisco/twexter
JLoom is a JSP like template language for text generation - e.g. source code, HTML, XML. JLoom templates are modular encapsulated. Parameters can be any Java type, even Generics or Varargs. There is a plugin for Eclipse and a command line tool.
Use Xilize to create XHTML pages or entire websites with just a plain-text editor. The markup is similar to Textile and extensible via BeanShell. Run as a jEdit plugin, from the command line, or embed in a Java program. Small, fast, easy-to-use.
"distribution" is a message and data processing tool. It allows to process information through a graph of processors. It may be used to build mailing lists, fax gateways, email filters, PDF mailing combinators, report systems and many other processes
Robust personal accounting with function of double entry booking, e-billing, tax processing, budgeting, and foreign currency ex rate revaluation. Multi-language is supported. Pl enjoy encrypted p2p invoice transfer and info-on-finger style user interface
This UBmatrix Processing Engine is an SDK for building XBRL-enabled applications. The engine is the essential component to read, write, and validate XBRL. This project was released under UBmatrix Inc. Learn more about EDGAR Online XBRL Processing Eng
LiMa means Lightweight Markup Language. It is a parser for an easy to use ASCII/Text-based markup - comparable to Markdown or the Wikipedia-Markup language with special configurable extensions in defining Links and image-resources.
PiNKY is a NetKernel module containing services and a language runtime to execute 'Yahoo Pipes' processes on the NetKernel resource-oriented processing platform.
CPLed is an OpenSIPS tool for editing CPL scripts in a friendly and easy graphical way. It can be used as a standalone application or embedded in a web page as applet. It also provide CPL script transport functionalities via SIP and HTTP protocols.
Domify is a Java library which adapts an arbitrary graph of Java objects to a W3C DOM interface. The DOM nodes are lazy-loaded to minimize processing overhead.