Showing 372 open source projects for "java html parser"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Say goodbye to broken revenue funnels and poor customer experiences Icon
    Say goodbye to broken revenue funnels and poor customer experiences

    Connect and coordinate your data, signals, tools, and people at every step of the customer journey.

    LeanData is a Demand Management solution that supports all go-to-market strategies such as account-based sales development, geo-based territories, and more. LeanData features a visual, intuitive workflow native to Salesforce that enables users to view their entire lead flow in one interface. LeanData allows users to access the drag-and-drop feature to route their leads. LeanData also features an algorithms match that uses multiple fields in Salesforce.
    Learn More
  • 1
    LOL HTML

    LOL HTML

    Low output latency streaming HTML parser/rewriter with CSS API

    Low Output Latency streaming HTML rewriter/parser with CSS-selector based API. It is designed to modify HTML on the fly with minimal buffering. It can quickly handle very large documents, and operate in environments with limited memory resources. The crate serves as a back-end for the HTML rewriting functionality of Cloudflare Workers, but can be used as a standalone library with a convenient API for a wide variety of HTML rewriting/analysis tasks. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 2
    html-react-parser

    html-react-parser

    HTML to React parser

    HTML to React parser that works on both the server (Node.js) and the client (browser). The parser converts an HTML string to one or more React elements. Available as part of the Tidelift Subscription. For TypeScript projects, you may need to check that domNode is an instance of domhandler's Element. Make sure to render parsed adjacent elements under a parent element.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    html-loader

    html-loader

    HTML Loader

    ...Filter can also be used to extend the supported elements and attributes. By default, the parser in html-loader interprets content inside noscript tags as #text, so processing of content inside this tag will be ignored. A very common scenario is exporting the HTML into their own .html file, to serve them directly instead of injecting with javascript.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 4
    htmlparser2

    htmlparser2

    The fast & forgiving HTML and XML parser

    The fast & forgiving HTML and XML parser. htmlparser2 is the fastest HTML parser, and takes some shortcuts to get there. If you need strict HTML spec compliance, have a look at parse5. htmlparser2 itself provides a callback interface that allows the consumption of documents with minimal allocations. While the Parser interface closely resembles Node.js streams, it’s not a 100% match.
    Downloads: 3 This Week
    Last Update:
    See Project
  • Failed Payment Recovery for Subscription Businesses Icon
    Failed Payment Recovery for Subscription Businesses

    For subscription companies searching for a failed payment recovery solution to grow revenue, and retain customers.

    FlexPay’s innovative platform uses multiple technologies to achieve the highest number of retained customers, resulting in reduced involuntary churn, longer life span after recovery, and higher revenue. Leading brands like LegalZoom, Hooked on Phonics, and ClinicSense trust FlexPay to recover failed payments, reduce churn, and increase customer lifetime value.
    Learn More
  • 5
    parse5

    parse5

    HTML parsing/serialization toolset for Node.js.

    HTML parsing/serialization toolset for Node.js. WHATWG HTML Living Standard (aka HTML5)-compliant. parse5 provides nearly everything you may need when dealing with HTML. It's the fastest spec-compliant HTML parser for Node to date. It parses HTML the way the latest version of your browser does. It has proven itself reliable in such projects as jsdom, Angular, Lit, Cheerio, rehype and many more.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    PostHTML

    PostHTML

    PostHTML is a tool to transform HTML/XML with JS plugins

    ...PostHTML itself is very small. It includes only an HTML parser, an HTML node tree API and a node tree stringified.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Nokogiri

    Nokogiri

    Tool to work with XML and HTML from Ruby

    Nokogiri (鋸) makes it easy and painless to work with XML and HTML from Ruby. It provides a sensible, easy-to-understand API for reading, writing, modifying, and querying documents. It is fast and standards-compliant by relying on native parsers like libxml2 (C) and xerces (Java). Be secure-by-default by treating all documents as untrusted by default. Be a thin-as-reasonable layer on top of the underlying parsers, and don't attempt to fix behavioral differences between the parsers. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    fast-xml-parser

    fast-xml-parser

    Validate XML, Parse XML and Build XML rapidly

    Validate XML, Parse XML to JS Object, or Build XML from JS Object without C/C++ based libraries and no callback.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    jsoup

    jsoup

    Java library for working with real-world HTML

    ...The parser will make every attempt to create a clean parse from the HTML you provide, regardless of whether the HTML is well-formed or not. You have HTML in a Java String, and you want to parse that HTML to get at its contents, or to make sure it's well formed, or to modify it. The String may have come from user input, a file, or from the web.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Comet Backup - Fast, Secure Backup Software for MSPs Icon
    Comet Backup - Fast, Secure Backup Software for MSPs

    Fast, Secure Backup Software for Businesses and IT Providers

    Comet is a flexible backup platform, giving you total control over your backup environment and storage destinations.
    Learn More
  • 10
    Floki

    Floki

    Floki is a simple HTML parser that enables search for nodes using CSS

    Floki is a simple HTML parser that enables search for nodes using CSS selectors. Floki needs the :leex module in order to compile. Normally this module is installed with Erlang in a complete installation. By default, Floki uses a patched version of mochiweb_html for parsing fragments due to its ease of installation (it's written in Erlang and has no outside dependencies). fast_html is generally faster, according to the benchmarks conducted by its developers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Sanitize

    Sanitize

    Ruby HTML and CSS sanitizer

    ...You can also allow specific CSS properties, @ rules, and URL protocols in elements or attributes containing CSS. Any HTML or CSS that you don't explicitly allow will be removed. Sanitize is based on the Nokogiri HTML5 parser, which parses HTML the same way modern browsers do, and Crass, which parses CSS the same way modern browsers do. As long as your allowlist config only allows safe markup and CSS, even the most malformed or malicious input will be transformed into safe output.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    HtmlSanitizer

    HtmlSanitizer

    Cleans HTML to avoid XSS attacks

    HtmlSanitizer is a .NET library for cleaning HTML fragments and documents from constructs that can lead to XSS attacks. It uses AngleSharp to parse, manipulate, and render HTML and CSS. Because HtmlSanitizer is based on a robust HTML parser it can also shield you from deliberate or accidental "tag poisoning" where invalid HTML in one fragment can corrupt the whole document leading to broken layout or style.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    sakura

    sakura

    A minimal CSS framework/theme

    ...Don't want to develop using sakura, but instead want to use it on websites with outdated 90's design (i.e. no CSS)? Quick prototyping, especially when working on backend sites and can't yet be bothered to fidget with CSS/HTML. Building a quick (but pretty) site/blog for your best friend or aunt! No need to remember tons of different class names for every other CSS framework. Works amazingly with markdown generated HTML pages (eliminates the need of hacks like including .img img-responsive in markdown-parser generated <img></img> tags).
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    Atributika

    Atributika

    Convert text with HTML tags, links, hashtags, mentions, etc.

    Atributika is an easy and painless way to build NSAttributedString. It is able to detect HTML-like tags, links, phone numbers, hashtags, any regex or even standard ios data detectors and style them with various attributes like font, color, etc. Atributika comes with drop-in label replacement AttributedLabel which is able to make any detection clickable. NSAttributedString is really powerful but still a low-level API that requires a lot of work to set up things. It is especially painful if...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Froala Editor

    Froala Editor

    The next generation Javascript WYSIWYG HTML Editor

    Froala Editor is a lightweight WYSIWYG HTML Editor written in Javascript that enables rich text editing capabilities for your applications. Froala WYSIWYG HTML Editor is one of the most powerful JavaScript rich text editors ever. Froala Rich Text Editor has a vast range of both simple and complex features for all kind of use cases. Lots of features don't have to overwhelm the user with hundreds of buttons. The Froala's WYSIWYG editor smart toolbar can accommodate over 100 features in this...
    Downloads: 30 This Week
    Last Update:
    See Project
  • 16
    JSON Schema Validator

    JSON Schema Validator

    A fast Java JSON schema validator that supports draft V4, V6, V7, V201

    This is a Java implementation of the JSON Schema Core Draft v4, v6, v7, v2019-09 and v2020-12(partial) specification for JSON schema validation. In addition, it also works for OpenAPI 3.0 request/response validation with some configuration flags. For users who want to collect information from a JSON node based on the schema, the walkers can help.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    jackson-core

    jackson-core

    Core part of Jackson that defines Streaming API

    This project contains core low-level incremental ("streaming") parser and generator abstractions used by Jackson Data Processor. It also includes the default implementation of handler types (parser, generator) that handle JSON format. The core abstractions are not JSON specific, although naming does contain 'JSON' in many places, due to historical reasons. Only packages that specifically contain the word 'json' are JSON-specific. This package is the base on which Jackson's data-binding...
    Downloads: 17 This Week
    Last Update:
    See Project
  • 18
    Jawn

    Jawn

    Jawn is for parsing jay-sawn (JSON)

    ...Also, there isn't a better way to describe objects encoded in JSON than "things". Finally, we get a catchy slogan. Jawn was designed to parse JSON into an AST as quickly as possible. Currently, Jawn is competitive with the fastest Java JSON libraries (GSON and Jackson) and in the author's benchmarks, it often wins. It seems to be faster than any other Scala parser that exists (as of July 2014).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Kryo

    Kryo

    Java binary serialization and cloning, fast, efficient, automatic

    Kryo is a fast and efficient binary object graph serialization framework for Java. The goals of the project are high speed, low size, and an easy-to-use API. The project is useful any time objects need to be persisted, whether to a file, database or over the network. Kryo can also perform automatic deep and shallow copying/cloning. This is direct copying from object to object, not object to bytes to object. Kryo has three sets of methods for reading and writing objects. If the concrete class...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    Infer

    Infer

    A static analyzer for Java, C, C++, and Objective-C

    Infer is a static analysis tool - if you give Infer some Java or C/C++/Objective-C code it produces a list of potential bugs. Anyone can use Infer to intercept critical bugs before they have shipped to users, and help prevent crashes or poor performance. Infer checks for null pointer exceptions, resource leaks, annotation reachability, missing lock guards, and concurrency race conditions in Android and Java code. Infer checks for null pointer dereferences, memory leaks, coding conventions...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    hyperx

    hyperx

    Tagged template string virtual dom builder

    tagged template string virtual dom builder. This module is similar to JSX, but provided as a standards-compliant ES6 tagged template string function. hyperx works with virtual-dom, react, hyperscript, or any DOM builder with a hyperscript-style API: h(tagName, attrs, children). You might also want to check out the hyperxify browserify transform to statically compile hyperx into javascript expressions to save sending the hyperx parser down the wire. Template strings are available in: node 4+,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Pedestal

    Pedestal

    The Pedestal Server-side Libraries

    Pedestal is a set of libraries that we use to build services and applications. It runs in the back end and can serve up whole HTML pages or handle API requests. There are a lot of tools in that space, so why did we build Pedestal? We had two main reasons. Pedestal is designed for APIs first. Most web app frameworks still focus on the "page model" and server-side rendering. Pedestal lets you start simple and add that if you need it. Pedestal makes it easy to create "live" applications....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    JSON implementation for Ruby

    JSON implementation for Ruby

    JSON implementation for Ruby

    This is an implementation of the JSON specification according to RFC 4627. You can think of it as a low-fat alternative to XML if you want to store data on disk or transmit it over a network rather than use a verbose markup language. Both variants of the JSON generator generate UTF-8 character sequences by default. If an:ascii_only option with a true value is given, they escape all non-ASCII and control characters with \uXXXX escape sequences, and support UTF-16 surrogate pairs in order to...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    bluemonday

    bluemonday

    Fast golang HTML sanitizer (inspired by the OWASP Java HTML Sanitizer

    bluemonday is an HTML sanitizer implemented in Go. It is fast and highly configurable. bluemonday takes untrusted user-generated content as an input, and will return HTML that has been sanitized against an allowlist of approved HTML elements and attributes so that you can safely include the content in your web page. If you accept user-generated content, and your server uses Go, you need bluemonday. It protects sites from XSS attacks. There are many vectors for an XSS attack and the best way...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    A Simple CSV Parser for Java under a commercial-friendly Apache 2.0 license
    Leader badge
    Downloads: 140 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next