Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.
Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
Explore 10,000+ tools
Desktop and Mobile Device Management Software
It's a modern take on desktop management that can be scaled as per organizational needs.
Desktop Central is a unified endpoint management (UEM) solution that helps in managing servers, laptops, desktops, smartphones, and tablets from a central location.
GTR [gi'tar] General/GTR Tag Replacer
Project with the aim of producing js library for automated tag replacing with a stripped set of cross-platform functions and an automated check for reusing installed libs such as Prototype and jQuery.
VNC for use with the BrowserMob Selenium JavaScript Validator. This tool is made available for users of BrowserMob FREE Website Monitoring and Load Testing. The BrowserMob Local Validation Service can be downloaded from https://browsermob.com/tools.
SAADA transforms a set of heterogeneous astronomical data FITS files in a powerfull database deployed on the Web and including VO features. The database is located on your host and keeps under your full control. This job doesn't require to write cod
A full-featured lightweight wrapper library of the native html canvas element written in Javascript, aimed to make visualization and animation using canvas simpler. Features animation support, layers, event capture, multitouch and many examples.
XDXF is a project to unite all existing open dictionaries and provide both users and developers with universal XML-based format, convertible from and to other popular formats like Mova, PtkDic, StarDict, ABBYY Lingvo etc.
* New drafts of format can be found here: https://github.com/soshial/xdxf_makedict/tree/master/format_standard
* The development of dictionary converter moved to Github: https://github.com/soshial/xdxf_makedict
PHPingMon (prev. Ping Monitoring) is a web based tool to get live data from web server about ping information: min, avg, max and packet loss in a table and graphic, individually scaling of ping graph.
XUProxy is an extensible multi-protocol proxy based on the Twisted framework. It supports multiple protocol plugins (currently only HTTP), and multiple "filter" plugins for things like logging, caching, and Proxomitron-compatible ad filtering.
This application uses XML files as input to apply a XSLT transformation and export user-readable documents, such as HTML or LaTeX.
Il was originally intended to create Yearbooks based on the already existing files with details about an organisation.
As a healthcare provider, you should be paid promptly for the services you provide to patients. Slow, inefficient, and error-prone manual coding keeps you from the financial peace you deserve. XpertDox’s autonomous coding solution accelerates the revenue cycle so you can focus on providing great healthcare.
Website Content Management System which uses Perl & MySQL to create an XHTML 1.0-compliant CSS-layout website. News, comments, discussion forum, content pages, side blocks, admin panel, anything can be added as a module or side block.
Paradice is a Theme for the popular web-based database administration tool phpMyAdmin. You will find the development version here. All official releases will be made by the phpMyAdmin project.
HTTP functional and non-functional (load and performance) toolkit based on jython/grinder (http://grinder.sf.net) ...includes capabilities to support: SOA services, REST, json/xml encoding, AES and WS security ... and a stub to collect requests
PHP web API designed to simplify object handling(loading, saving, querying, displaying, and editing), abstract the data from its display structure, and layout and allow the target data to be delivered to any supported format without special logic.
SYMPLiK RANGEHOOD is a Javadoc-like tool for Oracle database. This pure-Java program "sucks up" data dictionary and object source code from database and generate document for Tables, Views, Triggers, Packages, Procedures, Functions, and others.
The aim of this project is to agglomerate together many existing software and develop missing parts to create a full featured website creation software.
now here: https://github.com/plastex/plastex
plasTeX is a Python-based LaTeX document processing framework. It gives DOM-like access to a LaTeX document, as well as the ability to generate mulitple output formats (e.g. HTML, DocBook, tBook, etc.).
A lightweight, easy to use PHP content management system (CMS). Written to be fast and to use as little memory as possible. Placeto CMS offers browser and server caching, provides gzip compression and to cut down on bandwidth and CPU time.
This is a (very) simple chat-room created using php. Instead of using MySQL or any other database software this chat-room uses a flat-file database (created using XML). The main aim of this chat-room script is to be minimal (small codebase).
A very small PHP website application which stores anonymously uploaded files and retrieves them by SHA1 hash (a fingerprint of the file which is provided after uploading). Developed for anonysource.org, a kanux project.
Questo script consente di evidenziare, estrarre e condividere contenuti da una pagina web tramite la semplice selezione col mouse. This script allows you to highlight, extract and share content from a web page simply by mouse selecting.