Showing 69 open source projects for "html parser"

View related business solutions
  • Run applications fast and securely in a fully managed environment Icon
    Run applications fast and securely in a fully managed environment

    Cloud Run is a fully-managed compute platform that lets you run your code in a container directly on top of scalable infrastructure.

    Run frontend and backend services, batch jobs, deploy websites and applications, and queue processing workloads without the need to manage infrastructure.
    Try for free
  • DAT Freight and Analytics - DAT Icon
    DAT Freight and Analytics - DAT

    DAT Freight and Analytics operates DAT One truckload freight marketplace

    DAT Freight & Analytics operates DAT One, North America’s largest truckload freight marketplace; DAT iQ, the industry’s leading freight data analytics service; and Trucker Tools, the leader in load visibility. Shippers, transportation brokers, carriers, news organizations, and industry analysts rely on DAT for market trends and data insights, informed by nearly 700,000 daily load posts and a database exceeding $1 trillion in freight market transactions. Founded in 1978, DAT is a business unit of Roper Technologies (Nasdaq: ROP), a constituent of the Nasdaq 100, S&P 500, and Fortune 1000. Headquartered in Beaverton, Ore., DAT continues to set the standard for innovation in the trucking and logistics industry.
    Learn More
  • 1
    go_spider

    go_spider

    An awesome Go concurrent Crawler(spider) framework

    ...It can be expanded to an Individualized crawler easily or you can use the default crawl components only. Spider gets a Request in Scheduler that has url to be crawled. Then Downloader downloads the result(html, json, jsonp, text) of the Request. The result is saved in Page for parsing in PageProcesser. Html parsing is based on goquery package. Json parsing is based on simple JSON package. Jsonp will converse to json. Text form represents plain text content without a parser. The PageProcesser moduler only parse results. The moduler gets results(key-value pairs) and URLs to be crawled next step. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    FigDice

    FigDice

    PHP Templating System

    FigDice is a template rendering system, primarily designed for web applications and the dynamic generation of HTML documents. FigDice puts the view screens (templates) at the centre of the rendering process. FigDice defines a simple notation based on XML tags and attributes, allowing the engine, using source files very close to the target HTML, to produce the rendering, dynamically enhanced by the Web application's services and presentation layers. FigDice is independent of the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    CppWeb - C++ Web developement framework

    CppWeb - C++ Web developement framework

    Cross-platform C++ library for developing CGI Web applications

    CppWeb is cross-platform C++ library for developing web applications with server push support. The library decodes CGI variables and cookies, supports file uploads, performs automatic cookie detection, provides URL and HTML entity encode/decode functions, supports server-push (long-polling via ajax), has built-in HTML parser, SQLite database wrapper etc. CppWeb compiles on Windows, Linux and MacOSX (tested with GNU C++, MingW, MS Visual C++ and Borland C++ compiler) and can run with almost any web server (Apache, IIS, Boa etc.). Can be used in embedded systems (tested with FriendlyARM Mini2440 and Raspberry PI)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4

    jStyleParser

    Java CSS parser and DOM style assignment library

    jStyleParser is a CSS parser written in Java. It has its own application interface that is designed to allow an efficient CSS processing in Java and mapping the values to the Java data types. It is also able to apply the parsed style sheets to a DOM that represents an HTML or XML document and to compute the resulting style of the individual document elements.
    Downloads: 0 This Week
    Last Update:
    See Project
  • All-in-one security tool helps you prevent ransomware and breaches. Icon
    All-in-one security tool helps you prevent ransomware and breaches.

    SIEM + Detection and Response for IT Teams

    Blumira’s detection and response platform enables faster resolution of threats to help you stop ransomware attacks and prevent data breaches. We surface real threats, providing meaningful findings so you know what to prioritize. With our 3-step rapid response, you can automatically block known threats, use our playbooks for easy remediation, or contact our security team for additional guidance. Our responsive security team helps with onboarding, triage and ongoing consultations to continuously help your organization improve your security coverage.
    Learn More
  • 5

    CPoll based C++ server pages

    Server side scripting language similar to ASP and PHP, but using C++.

    CPPSP (C++ Server Pages) is an open source web application framework similar to ASP.NET. It features a template parser that parses, compiles, and loads CPPSP pages automatically at runtime. CPPSP pages have a very similar syntax to ASP and ASP.NET, where all code is considered HTML by default, and server-side active code can be embedded using "<% ... %>". CPPSP is built upon the CPoll asynchronous I/O and utility library, which offers simple I/O abstraction, network abstraction, memory management, and container classes. ...
    Leader badge
    Downloads: 4 This Week
    Last Update:
    See Project
  • 6
    JLevel

    JLevel

    Kogonuso JLevel - a simple java library that has changed the way html

    Kogonuso JLevel - a simple java library that has changed the way html can be written inside Java(tm) both application and Sevlet. It is a library that is more than your ordinary Html parser applications. It has automated all tagging operations of Html inside java, this means that you are no longer required to learn how to format html String when developing application that requires html in java or even when using Servlet technology to develop your websites. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    A simple text process library, aims to assist parsing all kinds of text including plain text, XML, HTML, etc., which means it can be used as a simple XML parser or a HTML parser.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    HTML DOM Parser

    HTML parser which can be used for screen-scraping applications

    htmldom parses the HTML file and provides methods for iterating and searching the parse tree in a similar way as Jquery. To report bugs please mail me at bhimsen.pes@gmail.com
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    SourceDoc is a powerful system for automatic creation, extraction, and verification of embedded documentation. Designed for C code, it features both a C parser and a preprocessor. The default output format is HTML, but other formats are possible to plug in using a public Java interface.
    Downloads: 0 This Week
    Last Update:
    See Project
  • eProcurement Software Icon
    eProcurement Software

    Enterprises and companies seeking a solution to manage all their procurement operations and processes

    eBuyerAssist by Eyvo is a cloud-based procurement solution designed for businesses of all sizes and industries. Fully modular and scalable, it streamlines the entire procurement lifecycle—from requisition to fulfillment. The platform includes powerful tools for strategic sourcing, supplier management, warehouse operations, and contract oversight. Additional modules cover purchase orders, approval workflows, inventory and asset management, customer orders, budget control, cost accounting, invoice matching, vendor credit checks, and risk analysis. eBuyerAssist centralizes all procurement functions into a single, easy-to-use system—improving visibility, control, and efficiency across your organization. Whether you're aiming to reduce costs, enhance compliance, or align procurement with broader business goals, eBuyerAssist helps you get there faster, smarter, and with measurable results.
    Learn More
  • 10

    HXPath

    XPath HTML parser

    HXPath is a command line tool useful to extract data from HTML documents. HXPath can select sub trees, like the standard xpath tool, but is also able to read contents and attributes and output them in a bash friendly format. HTML Tidy and HTTP/HTTPS get are built in too.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    HTML Parser
    HTML Parser is a Java library used to parse HTML in either a linear or nested fashion. Primarily used for transformation or extraction, it features filters, visitors, custom tags and easy to use JavaBeans. It is a fast, robust and well tested package.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 12
    TagParser is a java parser based on CSS formulas (like JQuery) and can parse any documents based on tags such as XML, HTML. Furthermore, it doesn't require documents to be well formed and can parse complex documents with embedded scripts or CSS parts
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13

    DocML Viewer

    Documentation browser for DocML files

    The DocML Viewer lets you browse a DocML file with a PHP/HTML interface similar to Javadoc. This allows development teams to easily access documentation across a network or from a website. Styles can be applied from a single CSS file. Visit http://www.docml.org for more details.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    C# .NET library implementing the Pop3 message retrieval protocol
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Lightweight XML processor; XML-POJO mapping via Java5 annotations or DTD; Preprocessing of XML documents using expression language; Binary XML; RMI friendly XML; JSON format support; XML marshall/unmarshall; HTML as XML parser; Swing XML Viewer
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    NBBC is a high-speed, extensible, sophisticated, well-documented, easy-to-use validating BBCode parser written in PHP. ________________________ NOTE: THIS COPY OF NBBC HAS NOT BEEN UPDATED IN A LONG TIME. There are currently two active maintainers of NBBC: - Theyak (https://github.com/theyak/nbbc) has a copy of NBBC that still works with PHP 4, and is still more-or-less the 1.x source tree, but it has had some of the outstanding bugs fixed. - Vanilla...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Light weight, fault tolerant, high speed single pass HTML parser. Builds HTML DOM similar to accessing the browsers DOM with javascript. Compatible with PHP4 and higher. Send in your feature requests.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    ShaniXmlParser is a small and fast Xml/Html DOM/SAX non validating parser written in java. It can parse invalid xml files. It uses the org.w3c.dom interfaces and the jaxp interfaces. It works also on mono/.net thanks to ikvm.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Lobo is an extensible all-Java web browser and RIA platform. It supports HTML 4, Javascript (AJAX) and CSS 2 plus direct JavaFX and Java (Swing/AWT) rendering. Cobra is the web browser's renderer API; also a Javascript-aware HTML parser.
    Downloads: 21 This Week
    Last Update:
    See Project
  • 20
    QuickDoc is a java document parser that reads documents from plain text files using a simple language and exports the document to other formats like PDF, HTML, Java Help and XML.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    NeatCleaner is a velocity template file parser written in Java. It transforms dirty HTML/VTL/JS to well-formed Node Tree and render the Node Tree to output pretty source.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Automatic generation of documentation on Delphi projects from source code. Distinctive features are exact parsing gathering lots of information and a division of the parser and configurable generators (HTML, Win- & HTML-Help, PDF, LaTeX, XMI export)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    A simple PHP parser class that scans through poorly formed html documents as if they were valid XML, and generates calls to customizable callback functions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    `parsehtml.php` is a HTML parser written in PHP. It should be pretty fast and can even cope with invalid HTML input. Additionally a function is provided to indent HTML code properly.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    PTM is a Perl/HTML hybrid, similar to PHP in tag syntax. It allows for the ease of PHP-style inline code in your dynamic HTML documents while supplying the power of Perl that many find to be sorely lacking in PHP.
    Downloads: 0 This Week
    Last Update:
    See Project