Crawl websites, sync to vector databases, and power RAG applications. Pre-built integrations for LLM pipelines and AI assistants.
Build data pipelines that feed your AI models and agents without managing infrastructure. Crawl any website, transform content, and push directly to your preferred vector store. Use 10,000+ tools for RAG applications, AI assistants, and real-time knowledge bases. Monitor site changes, trigger workflows on new data, and keep your AIs fed with fresh, structured information. Cloud-native, API-first, and free to start until you need to scale.
Try for free
Atera all-in-one platform IT management software with AI agents
Ideal for internal IT departments or managed service providers (MSPs)
Atera’s AI agents don’t just assist, they act. From detection to resolution, they handle incidents and requests instantly, taking your IT management from automated to autonomous.
Picks up text from a web page using a html template.
A java html picker - text extractor
Picks up text from a web page using a html template. Useful if you have regularly data to extract from the same site. You may use the same url or you may build urls having parameters. These parameters are fetch from a text file.
jQuery4PHP ( jQuery for PHP ) is a PHP 5 library. Makes easy writing javascript code (jQuery syntax) using PHP objects. Develops Rich Internet Applications in an easy way without having to know javascript language with the help and power of jQuery.
Multi-connection command line tool to download Internet sites. Similar to wget and cURL, but it manages up to 50 parallel links. Main features are: recursive fetching, Metalink retrieving, segmented download and image filtering by width and height.
Developed by the Web Technology Group at Appalachian State University, phpWebSite provides a complete web site content management system ( CMS ). All client output is XHTML 1.0 and meets the W3C's Web Accessibility Initiative requirements.
The goal of this package of classes is to generate XHTML code perfectly indented and without errors. It also allows to manipulate HTML templates. There's one PHP class by HTML tag and some classes contain facility methods to set the content.
HTML parser which can be used for screen-scraping applications
htmldom parses the HTML file and provides methods for iterating and searching the parse tree in a similar way as Jquery. To report bugs please mail me at bhimsen.pes@gmail.com
It's a modern take on desktop management that can be scaled as per organizational needs.
Desktop Central is a unified endpoint management (UEM) solution that helps in managing servers, laptops, desktops, smartphones, and tablets from a central location.
jQuery based cross-domain feed subscriber and renderer into HTML
A simple to use jQuery-based scriptlet that allows you to render HTML within a P tag (can be changed to DIV, SPAN etc with minor modifications) from an RSS/XML feed. It is capable of making cross-domain subscriptions without the need of any server-side support. (overcome single origin policy)
Features:
1. Low footprint: compact script - 2.25KB (requires jquery-minimal 1.5+)
2. Ease of use: link the script to your html and add an empty <p id='feedPanel' rssurl='<feed url>'...
Dojo Faces is a set of JavaServerFaces components based on the widgets of the Dojo Javascript toolkit. It offers dynamic, AJAX ready user interface components for JSF based web applications.
Develop: domain secure, and performing secure Domain Name System (DNS) dynamic updates. Domain Name System Security Extensions
Servlet API Package
The javax.servlet.http package contains a number of classes and interfaces that describe and define the contracts between a servlet class running under the HTTP protocol and the runtime environment provided for an instance of such a class by a conforming servlet container.
Html Assembler is a static site generator. It automatically integrates page content such as text and photos in a modifiable page template creating a complete set of html files ready for upload to your site.
Xiss web is a Java tool to generate photo slide show on the web. Generation of photo description in XML and HTML pages based on layout and skin. Exif display. Google Maps display. Animated photo transition. Uses W3C standards: XML, XSLT, XPATH, CSS.
Arbalo adds navigation and outline to presentations written in XHTML. Neither Javascript nor server logic required, no frames. Same XHTML source is used for slides and an over-all document. Sources can be compiled to HTML or interpreted in the browse
VNC for use with the BrowserMob Selenium JavaScript Validator. This tool is made available for users of BrowserMob FREE Website Monitoring and Load Testing. The BrowserMob Local Validation Service can be downloaded from https://browsermob.com/tools.
SAADA transforms a set of heterogeneous astronomical data FITS files in a powerfull database deployed on the Web and including VO features. The database is located on your host and keeps under your full control. This job doesn't require to write cod
The aw script is written so that you can browse web sites
through the command line by specifying where to look at in a
concise manner. It can also be used to make an excerpt of
web sites.
A full-featured lightweight wrapper library of the native html canvas element written in Javascript, aimed to make visualization and animation using canvas simpler. Features animation support, layers, event capture, multitouch and many examples.
PHPingMon (prev. Ping Monitoring) is a web based tool to get live data from web server about ping information: min, avg, max and packet loss in a table and graphic, individually scaling of ping graph.
Website Content Management System which uses Perl & MySQL to create an XHTML 1.0-compliant CSS-layout website. News, comments, discussion forum, content pages, side blocks, admin panel, anything can be added as a module or side block.
Paradice is a Theme for the popular web-based database administration tool phpMyAdmin. You will find the development version here. All official releases will be made by the phpMyAdmin project.