Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.
Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
Explore 10,000+ tools
Cloud data warehouse to power your data-driven innovation
BigQuery is a serverless and cost-effective enterprise data warehouse that works across clouds and scales with your data.
BigQuery Studio provides a single, unified interface for all data practitioners of various coding skills to simplify analytics workflows from data ingestion and preparation to data exploration and visualization to ML model creation and use. It also allows you to use simple SQL to access Vertex AI foundational models directly inside BigQuery for text processing tasks, such as sentiment analysis, entity extraction, and many more without having to deal with specialized models.
Xtree is a Document Object Model XML extension library for PHP (written in C) that is extremely fast, simple, and efficient. With this extension, loading, saving, and manipulating XML documents couldnt be easier. An XPath Interpreter is also included.
From an xml-defined data-schema, a db schema and a php object model is generated with getters/setters, db persistence, html form templates and html post/data reception. OBSOLETE! Have a look at http://propel.phpdb.org/ instead.
FormMagick is a toolkit for easily building multi-page CGI forms. It uses an XML form description to generate the forms, and supports internationalisation.
DOP Software’s mission is to streamline waste and recycling business’ processes by providing them with dynamic, comprehensive software and services that increase productivity and quality of performance.
Loxotron is XML based web application server, written in C. It makes possible to create dynamic web pages using XML/XSLT framework and compiled shared libraries.
Cfour is a collection of reusable classes to simply common programming tasks and lower development time. The collection includes classes for file i/o, class persistence, character menus, cgi, cookies, and translation between xml tags and data pairs.
disKatalog is a media database.
It allow to index media content such as cd, hard-drive and other and maintain it in xml file.
It is possible to performe search for content using regular expression or plain text.
Adding new new language resources is eas
POST (Python Obviously Simple Text) provides support for
simple, flexible dynamic document generation in multiple output
formats. Supports inputs in text or XML, outputs
in HTML, PDF, RTF, LaTeX source, nroff source, postscript,
and plain text.
Teradata VantageCloud Enterprise is a data analytics platform for performing advanced analytics on AWS, Azure, and Google Cloud.
Power faster innovation with Teradata VantageCloud
VantageCloud is the complete cloud analytics and data platform, delivering harmonized data and Trusted AI for all. Built for performance, flexibility, and openness, VantageCloud enables organizations to unify diverse data sources, run complex analytics, and deploy AI models—all within a single, scalable platform.
This is not a Content Management System written in PHP! The aim is to provide freedom! The web Developer is free to design webpages/data using any medium that can be converted to XHTML, XML, WAP, etc. Extend a 'Renderer' to add the functionality!
phpxmlclasses is a project grouping a set of classes for XML processing using PHP, classes will be added frequently to cover new features or to provide abstraction layers to existing features. Developers wanting to contribute classes/code are welcome.
The Dumas PHP/MySQL/XML Framework is a set of PHP code which allows rapid development of data-based web sites using PHP, MySQL, and XML.
Note: This framework is old. There are other, newer, probably better frameworks available. I use this for my own web site which I have not touched for years. I'm sure there are a thousands of things I would change today.
pXp is a framework to develop dynamic web sites using XML and reusable PHP components. XSLT is used to render several presentations for the content. It is a publishing system based on XML and PHP.
phpXMLDOM (phpXD) is an XML DOM-Implementation for PHP4 written in PHP. It offers methods for accessing the nodes of an XML document using the W3C Document Object Model (DOM) Level 2 Core. phpXMLDOM does not require the PHP DOM XML extension.
Having a hard time getting your php generated HTML to conform to HTML/XML standards? Now it's easy with phpTidyHt. phpTidyHt is a set of php functions which allow you to filter all HTML output through HTML Tidy (from www.w3c.org/People/Raggett/tidy)
The PoolMan library and JDBC2.0 Driver and DataSource provide a JMX-based, XML-configurable means of pooling and caching Java objects, as well as extensions for caching SQL queries and results across multiple databases.
ezAlbum is an easy to install and use photo album written in PHP 4, featuring description of albums in XML format, automatic generation of the photo's thumbnails and templates support.
XSpell is a spell checker for HTML forms. It is composed of a XML-RPC
based browser client written in Javascript and an XML-RPC service implemented in PHP 4+.
Complete real-time log reporting solution that takes advantage of XML. Logs are translated to XML and cataloged, allowing a report to be run and viewed in real-time. Assembled in a modular format it is capable of advanced customization.
XVCL is an Open Source component library that provides component based Document/View Architecture support for developers using Borland Delphi and C++ Builder. Includes a set of ready to use components for building complex XML/HTML applications.