Ready to implement AI with confidence (without sacrificing security)?
Connect your AI agents to apps and data more securely, give users control over the actions AI agents can perform and the data they can access, and enable human confirmation for critical agent actions.
Start building today
Automate contact and company data extraction
Build lead generation pipelines that pull emails, phone numbers, and company details from directories, maps, social platforms. Full API access.
Generate leads at scale without building or maintaining scrapers. Use 10,000+ ready-made tools that handle authentication, pagination, and anti-bot protection. Pull data from business directories, social profiles, and public sources, then export to your CRM or database via API. Schedule recurring extractions, enrich existing datasets, and integrate with your workflows.
This is a small command-line program to split a phone bill into several seperate ones to categorise the calls (e.g. for illustration which family member phones how much). It generates HTML files and is able to send them to specified email adresses.
A parser generator derived from ANTLR with JavaCC like (but not the same) syntax and some new features. It also comes with sample grammars for CSharp, Java, GnuC, Html, XHtml, Xml, ECMA262... etc.
HtmlClient provides an SGML/HTML/XHTML parser and connection client making web-spidering as easy for developers as actually surfing the web with a premade browser. Based on Apache's HttpClient.
Luci is an accessibility tool (htmlparser written in Perl) designed to provide users with a text-only experience of a web site. Inspired by the Betsie project, Luci offers features such as ssl, cookies, frames, 401 authorization (ie. htaccess), etc...
Multistat is designed to be a flexible log file parser for multiple game servers. It is designed with an extensible plugin interface, thus allowing for any combination of input or output (i.e. Quake 3 input with HTML output or Halflife with HTML outp
Atropine is a library for assisting with screen-scraping tasks, particularly making that of making exhaustive assertions about the structure of HTML documents. It is built on top of the fantastic BeautifulSoup HTMLparser.
Ideal for lending professionals who are looking for a feature rich loan management system
Bryt Software is ideal for lending professionals who are looking for a feature rich loan management system that is intuitive and easy to use. We are 100% cloud-based, software as a service. We believe in providing our customers with fair and honest pricing. Our monthly fees are based on your number of users and we have a minimal implementation charge.
PyBookmark manipulates bookmark files. It can sync files (no server required), merge, sort, remove duplicates, and check links. Its library pybookmarklib provides access to these operations, data structures, and parser for further extensibility.
A web page parser for sql written in java. It breaks html into table rows. It can datamine dynamic tables and alert a user to trends. It interfaces with authenticated & search sites. It is very flexible, uses selfmoding code, and can do http push
A JAVA suite for parsing arbitrary text data. Not just HTML or XML or Java, but all of them.
Use it when the JDK tokenizers are too limited, JavaCC, JTB etc. are too complicated, or You need dynamic parser configuration
Command line HTMLParser to be used in scripts to extract data from HTML/webpage according to supplied path and options. Usefull for systematic periodic parsing pages with known structures where information keeps changing - like looking for item on ebay
Cowzilla is Atom 0.3 parser written as a PHP class. It has the functionality to parse and convert Atom to RSS 2.0 (atom2rss), Atom to a HTML document, and check your GMail.
Pix 6.3 Parser is a PERL-based utility for parsing the particular logging style of PIX OS 6.3 when you have object-groups tied to ACLs via access-groups. It outputs a detailed HTML page, an RSS feed and helps you compile and track network abuse reports.
Web documents that look similar often use different HTML tags to achieve their layout effect. These tags often make it difficult for a machine to find text or images of interest. Our goal is to implement a parser to overcome this.
With this PHP template parser you can load in one huge array of information and use it in your template (html document). It's a bit like the XML / XSL methods but these don't work very good in PHP so I wrote this replacement.
Object oriented PHP based HTMLparser. The HtmlParser class allows you to interate through HTML nodes and get their attributes, names and values. It also comes with an example class for converting HTML to formatted ASCII text.
BowML Parser is a miniature PHP-framework that lets you parse any type of markup language (XML,HTML,WML etc). Where each element, string node and attribute is its own object. Also good for creating or rectreating webpages by using objects instead of HTML.
XML_HTMLSax is a native PHP SAX based parser for badly formed XML, such as HTML. It provides an API similar to the native PHP XML extension making it easy to switch between the two. Also distributed via http://pear.php.net/XML_HTMLSax
A perl script that parses given JavaScript source files for documentation in JavaDoc style, and generates HTML description page about functions, its parameters, etc.