Ready to implement AI with confidence (without sacrificing security)?
Connect your AI agents to apps and data more securely, give users control over the actions AI agents can perform and the data they can access, and enable human confirmation for critical agent actions.
Start building today
Automate contact and company data extraction
Build lead generation pipelines that pull emails, phone numbers, and company details from directories, maps, social platforms. Full API access.
Generate leads at scale without building or maintaining scrapers. Use 10,000+ ready-made tools that handle authentication, pagination, and anti-bot protection. Pull data from business directories, social profiles, and public sources, then export to your CRM or database via API. Schedule recurring extractions, enrich existing datasets, and integrate with your workflows.
ASP Slashdot Headline Parser is a simple Active Server Page (ASP) Script which fetches the latest slashdot.xml file, parses it and displays the headlines in an HTML Table format.
Java API to process or parse HTML documents.
If your Java application needs or would like to be able to process some text in HTML format, you'd probably find this API interesting.
jxml2sql is a Java application for converting database structures in XML to other formats useful for database administration (ie. SQL for table creation, HTML for reference docs). jxml2sql uses a minimalistic, non-validating, Java XML parser (NanoXML).
This is a parser which reads plain-text input files and generates HTML output files.
It combines the presentation features of HTML with the simplicity of plain-text notes.
Generates HTML index files and hyperlinks for the words you choose to index.
El-Kabong is a high-speed, forgiving, sax-style HTMLparser. Its aim is to
provide consumers with a very fast, clean, lightweight library which parses HTML
quickly, while forgiving syntactically incorrect tags.
Arachnid is a Java-based web spider framework. It includes a simple HTMLparser object that parses an input stream containing HTML content. Simple Web spiders can be created by sub-classing Arachnid and adding a few lines of code called after each page
A web development framework; includes an application server which provides a persistent object cache and transaction support, an intelligent HTMLparser, multi-threaded scripting, multiple scripting language support within a single OO framework.
LogAnal is a quick hack to parse Apache Log Files and produce graphical and textual web server statistics.
Works in incremental mode only. Supports Templates for the output HTML, as well as localization (defaults to English).
Secure and customizable compute service that lets you create and run virtual machines.
Computing infrastructure in predefined or custom machine sizes to accelerate your cloud transformation. General purpose (E2, N1, N2, N2D) machines provide a good balance of price and performance. Compute optimized (C2) machines offer high-end vCPU performance for compute-intensive workloads. Memory optimized (M2) machines offer the highest memory and are great for in-memory databases. Accelerator optimized (A2) machines are based on the A100 GPU, for very demanding applications.
PM2HTML takes PageMaker files and makes a cohesive newspaper website. It comprises a PMScript that exports all stories to a directory of tagged txts, and a python program to act as a converter to turn those tagged text files into HTML, a parser to guess
HotSAX is a fast, small footprint, non-validating SAX2 parser for HTML/XML/XHTML. It can be used in simple web agents, page scrapers and spiders. The goal is to embed this in cell phone "midlets."
A performance benchmarking package for Java XML parsers. This tool tests parsers supporting the SAX1, SAX2, JAXP, and XML Pull Parser interfaces. It produces output in XML and HTML.
A lib of Python scripts to extract exif info from digital camera-generated jpegs and provide them in a human-readable format suitable for use in some kind of html photo album generator, or somesuch.
The aim is to develop a framework to translate UIML (User Interface Markup Language) description of UI into a number of plaforms (wxPython, HTML, etc.).
Dynamic and "static" (into a program) rendering implied.
XPP stands for 'XPP Parses Perl' or 'XPML Page Parser', and is a fast/efficient HTMLparser that parses embedded perl, as well as HTML like tags, from dynamic html pages called XPML pages.
The Skêd-Schedule-Parser offers a convenient way to convert a HTML-university-schedule created by Skêd to an iCalender-compatible file (*.ics) which can be imported in many calendar-applications, e.g. Thunderbird Lightning, MS Outlook.