Showing 171 open source projects for "html parser"

View related business solutions
  • Auth0 for AI Agents now in GA Icon
    Auth0 for AI Agents now in GA

    Ready to implement AI with confidence (without sacrificing security)?

    Connect your AI agents to apps and data more securely, give users control over the actions AI agents can perform and the data they can access, and enable human confirmation for critical agent actions.
    Start building today
  • Automate contact and company data extraction Icon
    Automate contact and company data extraction

    Build lead generation pipelines that pull emails, phone numbers, and company details from directories, maps, social platforms. Full API access.

    Generate leads at scale without building or maintaining scrapers. Use 10,000+ ready-made tools that handle authentication, pagination, and anti-bot protection. Pull data from business directories, social profiles, and public sources, then export to your CRM or database via API. Schedule recurring extractions, enrich existing datasets, and integrate with your workflows.
    Explore Apify Store
  • 1
    ASP Slashdot Headline Parser is a simple Active Server Page (ASP) Script which fetches the latest slashdot.xml file, parses it and displays the headlines in an HTML Table format.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    Java API to process or parse HTML documents. If your Java application needs or would like to be able to process some text in HTML format, you'd probably find this API interesting.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    jxml2sql is a Java application for converting database structures in XML to other formats useful for database administration (ie. SQL for table creation, HTML for reference docs). jxml2sql uses a minimalistic, non-validating, Java XML parser (NanoXML).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    This is a parser which reads plain-text input files and generates HTML output files. It combines the presentation features of HTML with the simplicity of plain-text notes. Generates HTML index files and hyperlinks for the words you choose to index.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Collect! is a highly configurable debt collection software Icon
    Collect! is a highly configurable debt collection software

    Everything that matters to debt collection, all in one solution.

    The flexible & scalable debt collection software built to automate your workflow. From startup to enterprise, we have the solution for you.
    Learn More
  • 5
    El-Kabong is a high-speed, forgiving, sax-style HTML parser. Its aim is to provide consumers with a very fast, clean, lightweight library which parses HTML quickly, while forgiving syntactically incorrect tags.
    Leader badge
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    HTML parser gives a chance to parse HTML from php scripts. It wrote on PHP.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Arachnid is a Java-based web spider framework. It includes a simple HTML parser object that parses an input stream containing HTML content. Simple Web spiders can be created by sub-classing Arachnid and adding a few lines of code called after each page
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    A web development framework; includes an application server which provides a persistent object cache and transaction support, an intelligent HTML parser, multi-threaded scripting, multiple scripting language support within a single OO framework.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    LogAnal is a quick hack to parse Apache Log Files and produce graphical and textual web server statistics. Works in incremental mode only. Supports Templates for the output HTML, as well as localization (defaults to English).
    Downloads: 0 This Week
    Last Update:
    See Project
  • Create and run cloud-based virtual machines. Icon
    Create and run cloud-based virtual machines.

    Secure and customizable compute service that lets you create and run virtual machines.

    Computing infrastructure in predefined or custom machine sizes to accelerate your cloud transformation. General purpose (E2, N1, N2, N2D) machines provide a good balance of price and performance. Compute optimized (C2) machines offer high-end vCPU performance for compute-intensive workloads. Memory optimized (M2) machines offer the highest memory and are great for in-memory databases. Accelerator optimized (A2) machines are based on the A100 GPU, for very demanding applications.
    Try for free
  • 10
    PM2HTML takes PageMaker files and makes a cohesive newspaper website. It comprises a PMScript that exports all stories to a directory of tagged txts, and a python program to act as a converter to turn those tagged text files into HTML, a parser to guess
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    GSP is a "jsp/asp like" page parser. It generates cgi program source code from a html page with embedded code hidden in <% %> tags.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    HotSAX is a fast, small footprint, non-validating SAX2 parser for HTML/XML/XHTML. It can be used in simple web agents, page scrapers and spiders. The goal is to embed this in cell phone "midlets."
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    A performance benchmarking package for Java XML parsers. This tool tests parsers supporting the SAX1, SAX2, JAXP, and XML Pull Parser interfaces. It produces output in XML and HTML.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    A lib of Python scripts to extract exif info from digital camera-generated jpegs and provide them in a human-readable format suitable for use in some kind of html photo album generator, or somesuch.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Uses PCRE to get title, description, keywords, redirect, no index or no follow, hrefs, base tag, frames, img alt text, area hrefs, and plaintext.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    A quick and easy CGI Perl firewall log parser that gives an up to date look at logs created by ipchains. It diplays the firewall logs as a HTML page.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    The aim is to develop a framework to translate UIML (User Interface Markup Language) description of UI into a number of plaforms (wxPython, HTML, etc.). Dynamic and "static" (into a program) rendering implied.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    XPP stands for 'XPP Parses Perl' or 'XPML Page Parser', and is a fast/efficient HTML parser that parses embedded perl, as well as HTML like tags, from dynamic html pages called XPML pages.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Web based environment supporting:<br> <br> * Session management.<br> * User management.<br> * Themeing.<br> * Language independant object embeding.<br> * Extendable HTML parser.<br> * Distributed computing.<br> * Speed.<br> * Easy to use API.<br>
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    The Skêd-Schedule-Parser offers a convenient way to convert a HTML-university-schedule created by Skêd to an iCalender-compatible file (*.ics) which can be imported in many calendar-applications, e.g. Thunderbird Lightning, MS Outlook.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    A .Net (C#) program to convert grammar based text (like code) to colorful, CSS based HTML. Based on the GOLD parser.
    Downloads: 0 This Week
    Last Update:
    See Project