Showing 252 open source projects for "xml-parser"

View related business solutions
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI Studio. Switch between models without switching platforms.
    Start Free
  • Fully Managed MySQL, PostgreSQL, and SQL Server Icon
    Fully Managed MySQL, PostgreSQL, and SQL Server

    Automatic backups, patching, replication, and failover. Focus on your app, not your database.

    Cloud SQL handles your database ops end to end. Migrate from on-prem or other clouds with free migration tools.
    Try Free
  • 1
    htmlparser2

    htmlparser2

    The fast & forgiving HTML and XML parser

    The fast & forgiving HTML and XML parser. htmlparser2 is the fastest HTML parser, and takes some shortcuts to get there. If you need strict HTML spec compliance, have a look at parse5. htmlparser2 itself provides a callback interface that allows the consumption of documents with minimal allocations. While the Parser interface closely resembles Node.js streams, it’s not a 100% match.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Nokogiri

    Nokogiri

    Tool to work with XML and HTML from Ruby

    Nokogiri (鋸) makes it easy and painless to work with XML and HTML from Ruby. It provides a sensible, easy-to-understand API for reading, writing, modifying, and querying documents. It is fast and standards-compliant by relying on native parsers like libxml2 (C) and xerces (Java). Be secure-by-default by treating all documents as untrusted by default. Be a thin-as-reasonable layer on top of the underlying parsers, and don't attempt to fix behavioral differences between the parsers. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    quick-xml

    quick-xml

    Rust high performance xml reader and writer

    High-performance xml pull reader/writer. The reader is almost zero-copy (use of Cow whenever possible) It is easy on memory allocation (the API provides a way to reuse buffers) It supports various encoding (with encoding feature), namespaces resolution, and special characters. Syntax is inspired by xml-rs. When using the serialize feature, quick-xml can be used with serde's Serialize/Deserialize traits.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4
    PostHTML

    PostHTML

    PostHTML is a tool to transform HTML/XML with JS plugins

    PostHTML is a tool for transforming HTML/XML with JS plugins. PostHTML itself is very small. It includes only an HTML parser, an HTML node tree API and a node tree stringified. All HTML transformations are made by plugins. And these plugins are just small plain JS functions, which receive an HTML node tree, transform it, and return a modified tree. PostHTML is a tool for transforming HTML/XML with JS plugins.
    Downloads: 0 This Week
    Last Update:
    See Project
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 5
    html-react-parser

    html-react-parser

    HTML to React parser

    HTML to React parser that works on both the server (Node.js) and the client (browser). The parser converts an HTML string to one or more React elements. Available as part of the Tidelift Subscription. For TypeScript projects, you may need to check that domNode is an instance of domhandler's Element. Make sure to render parsed adjacent elements under a parent element.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    LOL HTML

    LOL HTML

    Low output latency streaming HTML parser/rewriter with CSS API

    ...The parser switches back to the tag scanner as soon as input leaves the scope of all selector matches. The tag scanner may also sometimes switch the parser to the Lexer - if it requires additional tag information for the parsing feedback simulation. Having two different parser implementations for the same grammar will increase development costs and is error-prone due to implementation inconsistencies.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7
    parse5

    parse5

    HTML parsing/serialization toolset for Node.js.

    HTML parsing/serialization toolset for Node.js. WHATWG HTML Living Standard (aka HTML5)-compliant. parse5 provides nearly everything you may need when dealing with HTML. It's the fastest spec-compliant HTML parser for Node to date. It parses HTML the way the latest version of your browser does. It has proven itself reliable in such projects as jsdom, Angular, Lit, Cheerio, rehype and many more.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    Floki

    Floki

    Floki is a simple HTML parser that enables search for nodes using CSS

    Floki is a simple HTML parser that enables search for nodes using CSS selectors. Floki needs the :leex module in order to compile. Normally this module is installed with Erlang in a complete installation. By default, Floki uses a patched version of mochiweb_html for parsing fragments due to its ease of installation (it's written in Erlang and has no outside dependencies). fast_html is generally faster, according to the benchmarks conducted by its developers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Parsedown

    Parsedown

    Better markdown parser in PHP

    Parsedown is capable of escaping user-input within the HTML that it generates. Additionally Parsedown will apply sanitisation to additional scripting vectors (such as scripting link destinations) that are introduced by the markdown syntax itself. If you wish to allow HTML within untrusted user-input, but still want output to be free from XSS it is recommended that you make use of a HTML sanitiser that allows HTML tags to be whitelisted, like HTML Purifier. In both cases you should strongly...
    Downloads: 4 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 10
    html-loader

    html-loader

    HTML Loader

    ...Allows to setup which tags and attributes to process and how, as well as the ability to filter some of them. Filter can also be used to extend the supported elements and attributes. By default, the parser in html-loader interprets content inside noscript tags as #text, so processing of content inside this tag will be ignored. A very common scenario is exporting the HTML into their own .html file, to serve them directly instead of injecting with javascript.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 11
    jsoup

    jsoup

    Java library for working with real-world HTML

    ...It provides a very convenient API for fetching URLs and extracting and manipulating data, using the best of HTML5 DOM methods and CSS selectors. jsoup implements the WHATWG HTML5 specification, and parses HTML to the same DOM as modern browsers do. jsoup is designed to deal with all varieties of HTML found in the wild; from pristine and validating, to invalid tag-soup; jsoup will create a sensible parse tree. The parser will make every attempt to create a clean parse from the HTML you provide, regardless of whether the HTML is well-formed or not. You have HTML in a Java String, and you want to parse that HTML to get at its contents, or to make sure it's well formed, or to modify it. The String may have come from user input, a file, or from the web.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 12
    sakura

    sakura

    A minimal CSS framework/theme

    ...No need to remember tons of different class names for every other CSS framework. Works amazingly with markdown generated HTML pages (eliminates the need of hacks like including .img img-responsive in markdown-parser generated <img></img> tags).
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    Sanitize

    Sanitize

    Ruby HTML and CSS sanitizer

    ...You can also allow specific CSS properties, @ rules, and URL protocols in elements or attributes containing CSS. Any HTML or CSS that you don't explicitly allow will be removed. Sanitize is based on the Nokogiri HTML5 parser, which parses HTML the same way modern browsers do, and Crass, which parses CSS the same way modern browsers do. As long as your allowlist config only allows safe markup and CSS, even the most malformed or malicious input will be transformed into safe output.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    HtmlSanitizer

    HtmlSanitizer

    Cleans HTML to avoid XSS attacks

    HtmlSanitizer is a .NET library for cleaning HTML fragments and documents from constructs that can lead to XSS attacks. It uses AngleSharp to parse, manipulate, and render HTML and CSS. Because HtmlSanitizer is based on a robust HTML parser it can also shield you from deliberate or accidental "tag poisoning" where invalid HTML in one fragment can corrupt the whole document leading to broken layout or style. In order to facilitate different use cases, HtmlSanitizer can be customized at several levels. Configure allowed HTML tags through the property AllowedTags. All other tags will be stripped. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Atributika

    Atributika

    Convert text with HTML tags, links, hashtags, mentions, etc.

    Atributika is an easy and painless way to build NSAttributedString. It is able to detect HTML-like tags, links, phone numbers, hashtags, any regex or even standard ios data detectors and style them with various attributes like font, color, etc. Atributika comes with drop-in label replacement AttributedLabel which is able to make any detection clickable. NSAttributedString is really powerful but still a low-level API that requires a lot of work to set up things. It is especially painful if...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    WebHarvest - web data extraction tool
    Web data extraction (web data mining, web scraping) tool. It leverages well proved XML and text processing techologies in order to easely extract useful data from arbitrary web pages.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 17
    docconv

    docconv

    Converts PDF, DOC, DOCX, XML, HTML, RTF, etc to plain text

    A Go wrapper library to convert PDF, DOC, DOCX, XML, HTML, RTF, ODT, Pages documents and images (see optional dependencies below) to plain text. See go help install for details on the installation location of the installed docd executable. Make sure that the full path to the executable is in your PATH environment variable. To add image support to the docconv library you first need to install and build gosseract.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 18
    adx - addressbook.xml

    adx - addressbook.xml

    Minimalistic address book in web browser. No server or plugin needed.

    ...FEATURES Contact Management, portable, small (~350KB), lightweight, contact tagging, geo mapping, web accounts, trigger phone/Skype calls, etc. EXPORT FUNCTIONALITY vCard (as file or QR code via offline generator) HOW IT WORKS Your address-book (XML file) is transformed in your web browser (via XSLT) to a full-featured web application (HTML). REQUIREMENTS Web browser for viewing adx; online hosting or local storage is possible (for local storage most modern browsers need to be configured to allow access to local files); NO server, plugin or anything else needed. Any text editor can be used for contact editing (addressbook.xml).
    Downloads: 5 This Week
    Last Update:
    See Project
  • 19
    hyperx

    hyperx

    Tagged template string virtual dom builder

    ...This module is similar to JSX, but provided as a standards-compliant ES6 tagged template string function. hyperx works with virtual-dom, react, hyperscript, or any DOM builder with a hyperscript-style API: h(tagName, attrs, children). You might also want to check out the hyperxify browserify transform to statically compile hyperx into javascript expressions to save sending the hyperx parser down the wire. Template strings are available in: node 4+, chrome 41, firefox 34, edge, opera 28, safari 9.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    FSharp.Data

    FSharp.Data

    F# Data: Library for Data Access

    The FSharp.Data package implements core functionality to access common data formats in your F# applications and scripts. It contains F# type providers for working with structured file formats (CSV, HTML, JSON and XML) and helpers for parsing CSV, HTML and JSON files and for sending HTTP requests. This library focuses on providing simple access to structured documents and other data sources. The FSharp.Data type providers for CSV, HTML, JSON and XML infer types from the structure of a sample document (or a document containing multiple samples). ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    HtmlCleaner is HTML parser written in Java. It transforms dirty HTML to well-formed XML following the same rules that the most web-browsers use.
    Downloads: 13 This Week
    Last Update:
    See Project
  • 22
    DiDOM

    DiDOM

    Simple and fast HTML and XML parser

    Simple and fast HTML and XML parser. DiDom allows loading HTML in several ways.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Plot

    Plot

    A DSL for writing type-safe HTML, XML and RSS in Swift

    Welcome to Plot, a domain-specific language (DSL) for writing type-safe HTML, XML and RSS in Swift. It can be used to build websites, documents and feeds, as a templating tool, or as a renderer for higher-level components and tools. It’s primary focus is on static site generation and Swift-based web development. Plot enables you to write HTML using native, fully compiled Swift code, by modeling the HTML5 standard’s various elements as Swift APIs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    XForms to XHTML+Javascript (AJAX) conversion based on a unique XSL transformation. Suitable server-side (PHP) or client-side (Internet Explorer, Mozilla FireFox, Opera, Safari) browser treatement where an XSLT 1.0 engine is available
    Downloads: 6 This Week
    Last Update:
    See Project
  • 25
    Tidy

    Tidy

    The granddaddy of HTML tools, with support for modern standards

    Tidy is a console application for macOS, Linux, Windows, UNIX, and more. It corrects and cleans up HTML and XML documents by fixing markup errors and upgrading legacy code to modern standards. libtidy is a C static and dynamic library that developers can integrate into their applications in order to bring all of Tidy’s power to your favorite tools. libtidy is used today in desktop applications, web servers, and more. Tidy tidies HTML and XML.
    Downloads: 47 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB