Showing 77 open source projects for "html parser"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Cloud tools for web scraping and data extraction Icon
    Cloud tools for web scraping and data extraction

    Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.

    Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
    Explore 10,000+ tools
  • 1
    htmlparser2

    htmlparser2

    The fast & forgiving HTML and XML parser

    The fast & forgiving HTML and XML parser. htmlparser2 is the fastest HTML parser, and takes some shortcuts to get there. If you need strict HTML spec compliance, have a look at parse5. htmlparser2 itself provides a callback interface that allows the consumption of documents with minimal allocations. While the Parser interface closely resembles Node.js streams, it’s not a 100% match.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    League CommonMark

    League CommonMark

    Highly-extensible PHP Markdown parser

    CommonMark is a PHP library that implements the CommonMark Markdown specification, allowing developers to convert Markdown into HTML. It provides a robust and extensible parser with support for additional syntax and extensions. The library is designed to be efficient and standards-compliant, making it ideal for applications that require consistent and reliable Markdown processing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Jupyter Notebook Tools for Sphinx

    Jupyter Notebook Tools for Sphinx

    Sphinx source parser for Jupyter notebooks

    nbsphinx is a Sphinx extension that provides a source parser for *.ipynb files. Custom Sphinx directives are used to show Jupyter Notebook code cells (and of course their results) in both HTML and LaTeX output. Un-evaluated notebooks – i.e. notebooks without stored output cells – will be automatically executed during the Sphinx build process.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    goquery

    goquery

    A little like that j-thing, only in Go

    goquery brings a syntax and a set of features similar to jQuery to the Go language. It is based on Go's net/HTML package and the CSS Selector library Cascadia. Since the net/html parser returns nodes, and not a full-featured DOM tree, jQuery's stateful manipulation functions (like height(), css(), and detach()) have been left off. Also, because the net/HTML parser requires UTF-8 encoding, so does goquery: it is the caller's responsibility to ensure that the source document provides UTF-8 encoded HTML. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Financial reporting cloud-based software. Icon
    Financial reporting cloud-based software.

    For companies looking to automate their consolidation and financial statement function

    The software is cloud based and automates complexities around consolidating and reporting for groups with multiple year ends, currencies and ERP systems with a slice and dice approach to reporting. While retaining the structure, control and validation needed in a financial reporting tool, we’ve managed to keep things flexible.
    Learn More
  • 5
    jsoup

    jsoup

    Java library for working with real-world HTML

    ...The parser will make every attempt to create a clean parse from the HTML you provide, regardless of whether the HTML is well-formed or not. You have HTML in a Java String, and you want to parse that HTML to get at its contents, or to make sure it's well formed, or to modify it. The String may have come from user input, a file, or from the web.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 6
    Redcarpet

    Redcarpet

    The safe Markdown parser, reloaded

    ...Redcarpet comes with two built-in renderers, Redcarpet::Render::HTML and Redcarpet::Render::XHTML, which output HTML and XHTML, respectively.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Markdig

    Markdig

    A fast, powerful, CommonMark compliant, extensible Markdown processor

    A fast, powerful, CommonMark compliant, extensible Markdown processor for .NET. Very fast parser and HTML renderer (no-regexp), very lightweight in terms of GC pressure. Abstract Syntax Tree with precise source code location for syntax tree, useful when building a Markdown editor. Check out MarkdownEditor for Visual Studio powered by Markdig! Even the core Markdown/CommonMark parsing is pluggable, so it allows to disable built-in Markdown/Commonmark parsing (e.g Disable HTML parsing) or change behavior. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 8
    sakura

    sakura

    A minimal CSS framework/theme

    ...Don't want to develop using sakura, but instead want to use it on websites with outdated 90's design (i.e. no CSS)? Quick prototyping, especially when working on backend sites and can't yet be bothered to fidget with CSS/HTML. Building a quick (but pretty) site/blog for your best friend or aunt! No need to remember tons of different class names for every other CSS framework. Works amazingly with markdown generated HTML pages (eliminates the need of hacks like including .img img-responsive in markdown-parser generated <img></img> tags).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Highlightr

    Highlightr

    iOS & OSX Syntax Highlighter

    ...Yes, Highlightr relies on iOS & macOS JavaScriptCore to parse the code using highlight.js. This is actually quite fast! It will never be as fast as a native solution, but it's fast enough to be used on a real-time editor. It comes with a custom-made HTML parser for creating NSAttributtedStrings, is pre-processing the themes, and is preloading the JS libraries. As result it's taking around of 50 ms on my iPhone 6s for processing 500 lines of code.
    Downloads: 0 This Week
    Last Update:
    See Project
  • G-P - Global EOR Solution Icon
    G-P - Global EOR Solution

    Companies searching for an Employer of Record solution to mitigate risk and manage compliance, taxes, benefits, and payroll anywhere in the world

    With G-P's industry-leading Employer of Record (EOR) and Contractor solutions, you can hire, onboard and manage teams in 180+ countries — quickly and compliantly — without setting up entities.
    Learn More
  • 10
    JSDoc

    JSDoc

    An API documentation generator for JavaScript

    JSDoc 3 is an API documentation generator for JavaScript, similar to Javadoc or phpDocumentor. You add documentation comments directly to your source code, right alongside the code itself. The JSDoc tool will scan your source code and generate an HTML documentation website for you. JSDoc's purpose is to document the API of your JavaScript application or library. It is assumed that you will want to document things like modules, namespaces, classes, methods, method parameters, and so on. JSDoc comments should generally be placed immediately before the code being documented. Each comment must start with a /** sequence in order to be recognized by the JSDoc parser.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    LaRecipe

    LaRecipe

    Write gorgeous documentation for your products using Markdown

    ...LaRecipe provides a bunch of amazing-looking UI Vue-based components powered by TailwindCSS. LaRecipe allows you add your own custom assets though custom packages in order to match your needs. LaRecipe automatically leverages Markdown to HTML parser out of the box including typography, images, links and others. LaRecipe provides a bunch of amazing looking UI Vue based components due to the fact it compiles the markdown documentation in the back-end to HTML. If you have a very large documentation it's very handy to have search function available so that your users can find their needs quickly.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 12
    ngx-markdown

    ngx-markdown

    Angular markdown component/directive/pipe/service to parse static

    Angular markdown component/directive/pipe/service to parse static, dynamic or remote content to HTML with syntax highlight and more. As the library is using Marked parser you will need to add node_modules/marked/marked.min.js to your application. To use the line numbers plugin that shows line numbers in code blocks, in addition to Prism.js configuration files, you will need to include the following files from prismjs/plugins/line-numbers directory to your application. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Cheerio

    Cheerio

    Implementation of core jQuery designed for the server

    ...Cheerio removes all the DOM inconsistencies and browser cruft from the jQuery library, revealing its truly gorgeous API. Cheerio works with a very simple, consistent DOM model. As a result parsing, manipulating, and rendering are incredibly efficient. Cheerio wraps around parse5 parser and can optionally use @FB55's forgiving htmlparser2. Cheerio can parse nearly any HTML or XML document. Cheerio parses markup and provides an API for traversing/manipulating the resulting data structure. It does not interpret the result as a web browser does. Specifically, it does not produce a visual rendering, apply CSS, load external resources, or execute JavaScript. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    AngleSharp

    AngleSharp

    The ultimate angle brackets parser library parsing HTML5, MathML, SVG

    AngleSharp follows the W3C specifications and gives you the same results as state of the art browsers. Besides the official API AngleSharp adds some useful extension methods on top. This makes working with the DOM convenient. AngleSharp integrates everything you need to explore and mutate the DOM tree. Node retrieval is straight forward by using powerful CSS query selectors. The CSS queries in AngleSharp are super fast and very simple to use. AngleSharp respects the relationship of HTML with...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    mdBook

    mdBook

    Create books from markdown files

    mdBook is a command line tool and Rust crate to create books with Markdown. The output resembles tools like Gitbook, and is ideal for creating product or API documentation, tutorials, course materials or anything that requires a clean, easily navigable and customizable presentation. mdBook is written in Rust; its performance and simplicity made it ideal for use as a tool to publish directly to hosted websites such as GitHub Pages via automation. This guide, in fact, serves as both the mdBook...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Opal

    Opal

    Opal is a Ruby to JavaScript source-to-source compiler

    Opal is a Ruby to JavaScript source-to-source compiler. It comes packed with the Ruby corelib you know and love. It is both fast as a runtime and small in its footprint. The lib directory holds the Opal parser/compiler used to compile Ruby into JavaScript. It is also built ready for the browser into opal-parser.js to allow compilation in any JavaScript environment. This directory holds the Opal runtime and corelib implemented in Ruby and JavaScript. opal-parser allows you to eval Ruby code directly from your HTML (and from Opal) files without needing any other building process. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    hyperx

    hyperx

    Tagged template string virtual dom builder

    tagged template string virtual dom builder. This module is similar to JSX, but provided as a standards-compliant ES6 tagged template string function. hyperx works with virtual-dom, react, hyperscript, or any DOM builder with a hyperscript-style API: h(tagName, attrs, children). You might also want to check out the hyperxify browserify transform to statically compile hyperx into javascript expressions to save sending the hyperx parser down the wire. Template strings are available in: node 4+,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Rome formatter

    Rome formatter

    Unified developer tools for JavaScript, TypeScript, and the web

    ...Read more about our project philosophy. Rome is written in Rust. Rome has first-class IDE support, with a sophisticated parser that represents the source text in full fidelity and top-notch error recovery. Rome is MIT licensed and moderated under the Contributor Covenant Code of Conduct.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19

    PHP Simple HTML DOM Parser

    A php based DOM parser.

    A simple PHP HTML DOM parser written in PHP5+, supports invalid HTML, and provides a very easy way to find, extract and modify the HTML elements of the dom. jquery like syntax allow sophisticated finding methods for locating the elements you care about.
    Leader badge
    Downloads: 1,664 This Week
    Last Update:
    See Project
  • 20
    JDynamiTe, Dynamic Template in Java

    JDynamiTe, Dynamic Template in Java

    Dynamically generate documents from templates

    JDynamiTe is a tool which allows you to dynamically create documents in any format from "template" documents. And very few lines of code (or no line at all!) are needed to do that. Some typical usage domains of JDynamiTe are: - dynamic Web pages creation, - text document generation, - source code generation... In fact, it can be useful in any case where pre-defined documents (templates) have to be dynamically populated with data. The main benefit of JDynamiTe is to allow a true...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21

    HTML parser in Delphi

    A Delphi class with functions to read and dissect a HTML file

    THTMLdom is a (Delphi) class with functions to read a HTML source file and dissect it into a tree of THTMLelement. The attributes of the HTML tags are stored in the elements. Functions are provided to select elements on the basis of the attribute values or tag names. The structure of the tree can be shown and it can be rendered as plain text. The source is plain Delphi pascal, requiring a version that supports Tdictionary. There is no dependency on 3rd party units. The file to be parsed...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 22
    Jodd

    Jodd

    Jodd! Lightweight, Java, zero dependencies, use what you like

    Jodd is a set of micro-frameworks and developer-friendly tools and utilities. Code-things-done. Great performances. Tiny dependencies. Use what you like. Welcome to the Jodd, a unique, alternative world of Java. You can find here some micro-frameworks and some handy tools. Nothing more, nothing less. The decision which component is promoted is based on the Maven Central Repo statistics. Version v5 contains remaining libraries and frameworks. They are in maintenance mode.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Parsedown

    Parsedown

    Better markdown parser in PHP

    Parsedown is capable of escaping user-input within the HTML that it generates. Additionally Parsedown will apply sanitisation to additional scripting vectors (such as scripting link destinations) that are introduced by the markdown syntax itself. If you wish to allow HTML within untrusted user-input, but still want output to be free from XSS it is recommended that you make use of a HTML sanitiser that allows HTML tags to be whitelisted, like HTML Purifier. In both cases you should strongly...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    A common markup language and a parser to generate documentation in any target format (Html, Latex, Trac, Mediawiki...). The core command relies on a Tcl library: it is easy to create new target formats. Doc files are parameterizable via a header.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Aglio

    Aglio

    An API Blueprint renderer with theme support that outputs static HTML

    Aglio is a renderer for API Blueprint documents (Markdown‑based API description format), producing static HTML documentation with theme support. It functions as both a command‑line executable and a Node.js library, allowing integration into pipelines or live-preview workflows. An API Blueprint renderer that supports multiple themes and outputs static HTML that can be served by any web host. API Blueprint is a Markdown-based document format that lets you write API descriptions and...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • Next