Showing 325 open source projects for "java html parser"

View related business solutions
  • Gen AI apps are built with MongoDB Atlas Icon
    Gen AI apps are built with MongoDB Atlas

    The database for AI-powered applications.

    MongoDB Atlas is the developer-friendly database used to build, scale, and run gen AI and LLM-powered apps—without needing a separate vector database. Atlas offers built-in vector search, global availability across 115+ regions, and flexible document modeling. Start building AI apps faster, all in one place.
    Start Free
  • Cloud-based help desk software with ServoDesk Icon
    Cloud-based help desk software with ServoDesk

    Full access to Enterprise features. No credit card required.

    What if You Could Automate 90% of Your Repetitive Tasks in Under 30 Days? At ServoDesk, we help businesses like yours automate operations with AI, allowing you to cut service times in half and increase productivity by 25% - without hiring more staff.
    Try ServoDesk for free
  • 1
    undom

    undom

    1kb minimally viable DOM Document implementation

    Minimally viable DOM Document implementation. A bare-bones HTML DOM in a box. If you want the DOM but not a parser, this might be for you. JSFiddle Demo: Rendering preact components into an undom Document. Undom aims to find a sweet spot between size/performance and utility. The goal is to provide the simplest possible implementation of a DOM Document, such that libraries relying on the DOM can run in places where there isn't one available.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    HTML Creator

    HTML Creator

    An easy-to-use tool to create html pages.

    This Java application allow users to easyly create web pages using a block structure to rapresent the HTML components.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Cool Reader

    Cool Reader

    A cross-platform XML/CSS based eBook reader

    CoolReader is fast and small cross-platform XML/CSS based eBook reader for desktops and handheld devices. Supported formats: FB2, TXT, RTF, DOC, TCR, HTML, EPUB, CHM, PDB, MOBI. Platforms: Win32, Linux, Android. Ported on some eInk based devices.
    Leader badge
    Downloads: 560 This Week
    Last Update:
    See Project
  • 4
    Himalaya

    Himalaya

    JavaScript HTML to JSON Parser

    Parse HTML into JSON. Himalaya bundles well with Browersify and Webpack. Himalaya transforms HTML into JSON, that's it. Himalaya is synchronous and does not require any complicated callbacks. Himalaya does not cut corners and returns an accurate representation of the HTML supplied. To remove whitespace, post-process the JSON; check out an example script. Himalaya can include the start and end positions of nodes in the parse output. To enable this, you can pass parse the parseDefaults...
    Downloads: 0 This Week
    Last Update:
    See Project
  • eProcurement Software Icon
    eProcurement Software

    Enterprises and companies seeking a solution to manage all their procurement operations and processes

    eBuyerAssist by Eyvo is a cloud-based procurement solution designed for businesses of all sizes and industries. Fully modular and scalable, it streamlines the entire procurement lifecycle—from requisition to fulfillment. The platform includes powerful tools for strategic sourcing, supplier management, warehouse operations, and contract oversight. Additional modules cover purchase orders, approval workflows, inventory and asset management, customer orders, budget control, cost accounting, invoice matching, vendor credit checks, and risk analysis. eBuyerAssist centralizes all procurement functions into a single, easy-to-use system—improving visibility, control, and efficiency across your organization. Whether you're aiming to reduce costs, enhance compliance, or align procurement with broader business goals, eBuyerAssist helps you get there faster, smarter, and with measurable results.
    Learn More
  • 5
    React HTML Parser

    React HTML Parser

    Converts HTML strings directly into React components

    Utility for converting HTML strings into React components. Converts standard HTML elements, attributes and inline styles into their React equivalents and provides a simple way to modify and replace the content. It is important to understand that this library should not be used as a direct replacement for using properly sanitized HTML and that it only provides the same level of protection that React does which does not provide 100% protection. All HTML should be properly sanitized using a...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    crawler4j

    crawler4j

    Open source web crawler for Java

    crawler4j is an open source web crawler for Java which provides a simple interface for crawling the Web. Using it, you can setup a multi-threaded web crawler in few minutes. You need to create a crawler class that extends WebCrawler. This class decides which URLs should be crawled and handles the downloaded page. shouldVisit function decides whether the given URL should be crawled or not.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    Snowflake HTML

    Snowflake HTML

    SVG in Swift

    SVG in Swift. Use XML parser from Reindeer. The Shape object maps to SVG elements. The Style object encapsulates style information. The cool thing about CALayer is that most of its properties are animatable. Snowflake is available through CocoaPods. Snowflake is also available through Carthage. Snowflake can also be installed manually. Just download and drop Sources folders in your project.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Web Widget Toolkit (WTK): Server-side components for easily creating web-based user interfaces with complex navigation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    JuniCoder is a Java project that uses unicode as a base for decoding and encoding formats that invented workarounds to express characters not covered by ASCII. Decoders translate those inventions to unicode. Encoders encode to these inventions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Leverage AI to Automate Medical Coding Icon
    Leverage AI to Automate Medical Coding

    Medical Coding Solution

    As a healthcare provider, you should be paid promptly for the services you provide to patients. Slow, inefficient, and error-prone manual coding keeps you from the financial peace you deserve. XpertDox’s autonomous coding solution accelerates the revenue cycle so you can focus on providing great healthcare.
    Learn More
  • 10

    StoryParser

    A set of tools and libraries to help with writing eBooks

    A set of tools and libraries (available for C# and Java) that help with writing fiction and non-fiction drafts and then produce ePUB and Kindle eBooks. With these tools/libraries, drafts, written in HTML, can be analyzed to help with writing. such as generating outlines and associating scenes with keywords. When done writing, the tools/libraries can be used to make publishable eBook, automatically producing additional material, such as Table of Contents and Title Pages.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Cross-platform visual XSLT generator
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Xml2Json Converter

    Xml2Json Converter

    Simple tool for converting large XML-files to JSON or JSON to XML

    Simple converter tool with GUI (written on JavaFX) for converting large XML-files to JSON and JSON to XML with indicating progress and uses small amount of memory for converting. Starting from 1.2.0 application supports batch converting files from directory by pattern. Uses Java 1.8+ (http://www.oracle.com/technetwork/java/javase/downloads/jre8-downloads-2133155.html). Distributions for Mac, Linux and Windows already have embedded JRE, so just download appropriate distribution and start application.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 13

    JHOVE

    File validation and characterization

    JHOVE provides functions to perform format-specific identification, validation, and characterization of digital objects. JHOVE should not be confused with JHOVE2, a product with similar aims but a completely separate code base.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    VTD-XML is the next generation XML parser/indexer/editor/slicer/assembler/xpath-engine that goes beyond DOM, SAX and PULL in performance, memory usage, and ease of use.
    Downloads: 12 This Week
    Last Update:
    See Project
  • 15
    Simple-Scrape is a simple web-scraping library that allows for programmatic access to HTML code. No further techniques are needed and the library is very compact and thus easy to use.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Grotag
    Grotag views Amigaguide documents or converts them to HTML and DocBook XML. Additionally it can validate and pretty print such documents.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17

    eXtensible Text Framework (XTF)

    Framework for search and display of heterogenous document collections.

    NOTICE: This code repository is deprecated. Please visit https://github.com/cdlib/xtf for the latest updates. Obsolete Description: The eXtensible Text Framework (XTF) is an architecture that supports searching across collections of heterogeneous textual data (XML, PDF, HTML, text, and more), and the presentation of results and documents in a highly configurable manner. Includes highly customized versions of the proven open-source components Lucene and Saxon.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 18

    JSONjuicer

    JSON parser and encoder

    A Java open-source library which makes encoding and decoding Java data-structures to and from JSON text easy and intuitive.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19

    XmlDoclet

    A JavaDoc doclet that outputs source code structure in XML format.

    XmlDoclet is a JavaDoc doclet that outputs the source code structure of the packages, classes etc. in XML format. Later, the XML data may easily be processed by standard tools such as XSLT to produce HTML, PDF, dot graphs etc. Technically, this is done by wrapping the class and interfaces of the com.sun.javadoc packages into JAXB annotated classes, which allows for an easy serialization.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Jericho HTML Parser is a java library allowing analysis and manipulation of parts of an HTML document, including server-side tags, while reproducing verbatim any unrecognised or invalid HTML.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 21
    thymeleaf
    Thymeleaf is a java web template engine designed for XML/XHTML/HTML5.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    CosmoFile

    CosmoFile

    Convert your files,Edit pdf Files,Edit Images,Download files

    Looking for free software to convert your files ?CosmoFile is created for you ,a great software absolutely free for users to convert your files to many different formats.CosmoFile is very Simple and very fast and support many formats PDF,HTML,JPG,PNG,JPG,ICO,SVG,XLSX,PPTX... Edit Pdf Files with CosmoFile Looking for free software to modify PDF documents? Sometimes you need to make minor changes to a PDF file. For instance, you may want to hide your personal phone number from a PDF...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    ISO SAX

    ISO SAX

    Callback-based ISO media file parser

    ISO SAX is a callback-based parser for ISO container files (ISO/IEC 14496-12), e.g. MPEG-4. The libraries that are out there either won't run on Android, have many megabytes of dependent JARs, or will fail to parse your favorite media file due to a "technicality" it thinks it is mal-formed. For example, a perfectly good M4B file gets declared "invalid" because it had a video track in it (the album art), along with the sound track. Really!? Don't let these libraries "judge" the format...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Chunk, an HTML Template Engine for Java

    Chunk, an HTML Template Engine for Java

    Clean, powerful templates for Java

    A powerful Java Template Engine, great for building HTML or XML docs. Chunk can handle many other needs and situations as well. In-tag filters & default values, multiple snippets per file, layered themes, macros, conditional includes, localization & more.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Ava: Testdata Xsl

    Ava: Testdata Xsl

    generates Testdata on base of excel: creates xml,excel,csv,html,sql,+

    this tool for test-data-generation receives an 'excel-sheet' as primary input. second important paramter is the 'number of test-records to produce'. The excel-data will be reused as long data is needed. This tool is hightly paramatrisazable by the use of 'xsl scripts'. data can be created, updated, modified and finally exported in a format of your choice Main Fuctions: (1) Generates Testdata (excel, xsl, xml) (2) Exports generated testdata in multiple formats (csv, excel, html,...
    Downloads: 0 This Week
    Last Update:
    See Project