Showing 2095 open source projects for "html parser java"

View related business solutions
  • Cloud-native procurement software platform Icon
    Cloud-native procurement software platform

    GLOBAL LEADER IN DIGITAL PROCUREMENT AND SUPPLY CHAIN SOLUTIONS

    SMART by GEP is an AI-powered, unified source-to-pay (S2P) procurement software that brings end-to-end procurement functionality for both direct and indirect spend management into a single, cloud-native platform. It features a range of procurement tools built into one unified procurement system; eliminating the need for separate, stand-alone software, modules, or tools for managing specific functions.
  • currIQunet - Curriculum Management, Higher Education Software Icon
    currIQunet - Curriculum Management, Higher Education Software

    For Education institutions and teachers looking for a curriculum management solution to design, build, and manage their curriculum

    Deliver the best in curriculum management with tech that is easy to learn, simple to use, and built for your future. And making that experience the best you've ever had! That is what drives us. It is our mission! Your courses and programs drive your brand, your impact. Designing, building, and managing your curriculum is mission critical to your success. currIQūnet META is the foremost curriculum-centric, fully integrated platform that allows you to manage your curriculum your way, the best way! We believe our solution should not limit your curriculum needs, today and into the future. Our proven higher education experience and technical know-how combine to create the most feature rich curriculum management offering in the market. currIQūnet META is based on your processes, your data, your way - the technology conforms to you. Technology should not drive, control how you manage your curriculum processes.
  • 1
    html-react-parser

    html-react-parser

    HTML to React parser

    HTML to React parser that works on both the server (Node.js) and the client (browser). The parser converts an HTML string to one or more React elements. Available as part of the Tidelift Subscription. For TypeScript projects, you may need to check that domNode is an instance of domhandler's Element. Make sure to render parsed adjacent elements under a parent element.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Java Tablesaw

    Java Tablesaw

    Java dataframe and visualization library

    Tablesaw is a dataframe and visualization library that supports loading, cleaning, transforming, filtering, and summarizing data. If you work with data in Java, it may save you time and effort. Tablesaw also supports descriptive statistics and can be used to prepare data for working with machine learning libraries like Smile, Tribuo, H20.ai, DL4J. Import data from RDBMS, Excel, CSV, TSV, JSON, HTML, or Fixed Width text files, whether they are local or remote (http, S3, etc.) Tablesaw supports...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 3
    html-loader

    html-loader

    HTML Loader

    ... and attributes. By default, the parser in html-loader interprets content inside noscript tags as #text, so processing of content inside this tag will be ignored. A very common scenario is exporting the HTML into their own .html file, to serve them directly instead of injecting with javascript.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    LOL HTML

    LOL HTML

    Low output latency streaming HTML parser/rewriter with CSS API

    Low Output Latency streaming HTML rewriter/parser with CSS-selector based API. It is designed to modify HTML on the fly with minimal buffering. It can quickly handle very large documents, and operate in environments with limited memory resources. The crate serves as a back-end for the HTML rewriting functionality of Cloudflare Workers, but can be used as a standalone library with a convenient API for a wide variety of HTML rewriting/analysis tasks. The parser switches back to the tag scanner...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Smarter Packing Decisions for Retailers and 3PLs Icon
    Smarter Packing Decisions for Retailers and 3PLs

    Paccurate is an API-first cartonization solution.

    Paccurate is the only patented cartonization solution that optimizes for transportation costs directly. So you can have the right boxes, and control how they're packed.
  • 5
    fast-xml-parser

    fast-xml-parser

    Validate XML, Parse XML and Build XML rapidly

    Validate XML, Parse XML to JS Object, or Build XML from JS Object without C/C++ based libraries and no callback.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Amazon Kinesis Video Streams Parser Lib

    Amazon Kinesis Video Streams Parser Lib

    Easy to work with the output of video streams

    Amazon Kinesis Video Streams parser library is for developers to include in their applications that makes it easy to work with the output of video streams such as retrieving frame-level objects, metadata for fragments, and more. The Amazon Kinesis Video Streams Parser Library for Java enables Java developers to parse the streams returned by GetMedia calls to Amazon Kinesis Video. A streaming Mkv Parser called StreamingMkvReader that provides an iterative interface to read the MkvElements...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    commonmark-java

    commonmark-java

    Java library for parsing and rendering CommonMark (Markdown)

    Java library for parsing and rendering Markdown text according to the CommonMark specification (and some extensions). Provides classes for parsing input to an abstract syntax tree of nodes (AST), visiting and manipulating nodes, and rendering to HTML. It started out as a port of commonmark.js, but has since evolved into a full library with a nice API.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    ANTLR

    ANTLR

    Parser generator to read, process, or translate structured text

    ANTLR (ANother Tool for Language Recognition) is a powerful parser generator for reading, processing, executing, or translating structured text or binary files. It's widely used to build languages, tools, and frameworks. From a grammar, ANTLR generates a parser that can build and walk parse trees. It’s widely used in academia and industry to build all sorts of languages, tools, and frameworks. Twitter search uses ANTLR for query parsing, with over 2 billion queries a day. The languages for Hive...
    Downloads: 20 This Week
    Last Update:
    See Project
  • 9
    jsoup

    jsoup

    Java library for working with real-world HTML

    jsoup is a Java library for working with real-world HTML. It provides a very convenient API for fetching URLs and extracting and manipulating data, using the best of HTML5 DOM methods and CSS selectors. jsoup implements the WHATWG HTML5 specification, and parses HTML to the same DOM as modern browsers do. jsoup is designed to deal with all varieties of HTML found in the wild; from pristine and validating, to invalid tag-soup; jsoup will create a sensible parse tree. The parser will make every...
    Downloads: 3 This Week
    Last Update:
    See Project
  • Enterprise Asset Management and Predictive Maintenance Software Icon
    Enterprise Asset Management and Predictive Maintenance Software

    LLumin provides operations software that dramatically reduces downtime and helps enterprises optimize the performance of their assets

    Track and manage any type of materials, parts, and inventory, including hazardous materials; monitored and controlled from cradle-to-grave. LLumin’s software suite integrates with ERP applications as well as vendor and 3rd party eProcurement tools, ensuring all materials demand requisitions, approval routes, orders, and receipts are fully documented and policy enforced.
  • 10
    FastQC

    FastQC

    A quality control analysis tool for high throughput sequencing data

    FastQC is a quality control analysis tool designed to spot potential problems in high throughput sequencing datasets. Its goal is to provide a simple way by which to check the quality of raw sequence data coming from high throughput sequencing pipelines. It does this by running a modular set of analyses on one or more raw sequence files in fastq or bam format. It then produces a report summarizing the results, and highlighting any areas where the library may appear unusual. This should then...
    Downloads: 72 This Week
    Last Update:
    See Project
  • 11
    Froala Editor

    Froala Editor

    The next generation Javascript WYSIWYG HTML Editor

    Froala Editor is a lightweight WYSIWYG HTML Editor written in Javascript that enables rich text editing capabilities for your applications. Froala WYSIWYG HTML Editor is one of the most powerful JavaScript rich text editors ever. Froala Rich Text Editor has a vast range of both simple and complex features for all kind of use cases. Lots of features don't have to overwhelm the user with hundreds of buttons. The Froala's WYSIWYG editor smart toolbar can accommodate over 100 features...
    Downloads: 28 This Week
    Last Update:
    See Project
  • 12
    pdf-extractor

    pdf-extractor

    Node.js module for rendering pdf pages to images, svgs and HTML files

    Pdf-extractor is a wrapper around pdf.js to generate images, svgs, html files, text files and json files from a pdf on node.js. A DOM Canvas is used to render and export the graphical layer of the pdf. Canvas exports *.png as a default but can be extended to export to other file types like .jpg. Pdf objects are converted to svg using the SVGGraphics parser of pdf.js. Pdf text is converted to HTML. This can be used as a (transparent) layer over the image to enable text selection. Pdf text...
    Downloads: 15 This Week
    Last Update:
    See Project
  • 13

    PHP Simple HTML DOM Parser

    A php based DOM parser.

    A simple PHP HTML DOM parser written in PHP5+, supports invalid HTML, and provides a very easy way to find, extract and modify the HTML elements of the dom. jquery like syntax allow sophisticated finding methods for locating the elements you care about.
    Leader badge
    Downloads: 977 This Week
    Last Update:
    See Project
  • 14
    Karate

    Karate

    Test automation made simple

    Karate is the only open-source tool to combine API test-automation, mocks, performance-testing and even UI automation into a single, unified framework. The BDD syntax popularized by Cucumber is language-neutral, and easy for even non-programmers. Assertions and HTML reports are built-in, and you can run tests in parallel for speed. There’s also a cross-platform stand-alone executable for teams not comfortable with Java. You don’t have to compile code. Just write tests in a simple, readable...
    Downloads: 21 This Week
    Last Update:
    See Project
  • 15
    DiDOM

    DiDOM

    Simple and fast HTML and XML parser

    Simple and fast HTML and XML parser. DiDom allows loading HTML in several ways.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 16
    Swagger Codegen

    Swagger Codegen

    Template-driven engine to generate documentation

    Swagger-Codegen contains a template-driven engine to generate documentation, API clients and server stubs in different languages by parsing your OpenAPI / Swagger definition. Simplify API development for users, teams, and enterprises with the Swagger open source and professional toolset. Find out how Swagger can help you design and document your APIs at scale. The power of Swagger tools starts with the OpenAPI Specification, the industry standard for RESTful API design. Individual tools to...
    Downloads: 14 This Week
    Last Update:
    See Project
  • 17
    htmlparser2

    htmlparser2

    The fast & forgiving HTML and XML parser

    The fast & forgiving HTML and XML parser. htmlparser2 is the fastest HTML parser, and takes some shortcuts to get there. If you need strict HTML spec compliance, have a look at parse5. htmlparser2 itself provides a callback interface that allows the consumption of documents with minimal allocations. While the Parser interface closely resembles Node.js streams, it’s not a 100% match. Use the WritableStream interface to process a streaming input.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    crawley

    crawley

    The unix-way web crawler

    Crawls web pages and prints any link it can find. Fast HTML SAX-parser (powered by golang.org/x/net/html) Small (below 1500 SLOC), idiomatic, 100% test-covered codebase. Grabs most of useful resources URLs (pics, videos, audios, forms, etc...) Found URLs are streamed to stdout and guaranteed to be unique (with fragments omitted) Scan depth (limited by starting host and path, by default - 0) can be configured. Can crawl rules and sitemaps from robots.txt. Brute mode - scan HTML comments for URLs...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 19
    Intellij rainbow brackets

    Intellij rainbow brackets

    Rainbow Brackets for IntelliJ based IDEs/Android Studio/HUAWEI DevEco

    Compatible with IntelliJ IDEA, PhpStorm, WebStorm, PyCharm, RubyMine, AppCode, CLion, Gogland, DataGrip, Rider, MPS, Android Studio, HUAWEI DevEco Studio, DataSpell, Code With Me. Supports Java, Scala, Clojure, Kotlin, Python, Haskell, Agda, Rust, JavaScript, TypeScript, Erlang, Go, Groovy, Ruby, Elixir, ObjectiveC, PHP, HTML, XML, SQL, Apex language, C#, Dart, Pug/Jade, Bash, Vue.js, C# Razor Pages, GLSL(the OpenGL Shading Language), Go Template, C++, C, etc. Eliminate context switching...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 20
    Portfolio Performance

    Portfolio Performance

    Track and evaluate the performance of your investment portfolio

    Portfolio Performance: Track and evaluate the performance of your investment portfolio across stocks, cryptocurrencies, and other assets. Performance indicators like that True-Time Weighted Rate of Return or the internal rate of return ( Internal rate of return ) are calculated. Historical courses are automatically run by Yahoo Finance loaded or can be extracted from any HTML pages. Due to the open file format, all data is available as XML available or can be used as CSV be exported. Support...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 21
    Joern

    Joern

    Open-source code analysis platform for C/C++/Java/Binary/Javascript

    Joern is a platform for analyzing source code, bytecode, and binary executables. It generates code property graphs (CPGs), a graph representation of code for cross-language code analysis. Code property graphs are stored in a custom graph database. This allows code to be mined using search queries formulated in a Scala-based domain-specific query language. Joern is developed with the goal of providing a useful tool for vulnerability discovery and research in static program analysis.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 22
    JSON Schema Validator

    JSON Schema Validator

    A fast Java JSON schema validator that supports draft V4, V6, V7, V201

    This is a Java implementation of the JSON Schema Core Draft v4, v6, v7, v2019-09 and v2020-12(partial) specification for JSON schema validation. In addition, it also works for OpenAPI 3.0 request/response validation with some configuration flags. For users who want to collect information from a JSON node based on the schema, the walkers can help. The default JSON parser is the Jackson which is the most popular one. As it is a key component in our light-4j microservices framework to validate...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 23
    springdoc-openapi

    springdoc-openapi

    Library for OpenAPI 3 with spring-boot

    An extended support for springdoc-openapi v1 project is now available for organizations that need support beyond 2023. The springdoc-openapi Java library helps automating the generation of API documentation using Spring Boot projects. springdoc-openapi works by examining an application at runtime to infer API semantics based on Spring configurations, class structure and various annotations. The library automatically generates documentation in JSON/YAML and HTML formatted pages. The generated...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 24
    HtmlSanitizer

    HtmlSanitizer

    Cleans HTML to avoid XSS attacks

    HtmlSanitizer is a .NET library for cleaning HTML fragments and documents from constructs that can lead to XSS attacks. It uses AngleSharp to parse, manipulate, and render HTML and CSS. Because HtmlSanitizer is based on a robust HTML parser it can also shield you from deliberate or accidental "tag poisoning" where invalid HTML in one fragment can corrupt the whole document leading to broken layout or style. In order to facilitate different use cases, HtmlSanitizer can be customized at several...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    WebMagic

    WebMagic

    A scalable web crawler framework for Java

    WebMagic is a scalable crawler framework. It covers the whole lifecycle of crawler, downloading, url management, content extraction and persistent. It can simplify the development of a specific crawler. WebMagic is a simple but scalable crawler framework. You can develop a crawler easily based on it. WebMagic has a simple core with high flexibility, a simple API for html extracting. It also provides annotation with POJO to customize a crawler, and no configuration is needed. Some other features...
    Downloads: 2 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next