Showing 372 open source projects for "java html parser"

View related business solutions
  • Cloud data warehouse to power your data-driven innovation Icon
    Cloud data warehouse to power your data-driven innovation

    BigQuery is a serverless and cost-effective enterprise data warehouse that works across clouds and scales with your data.

    BigQuery Studio provides a single, unified interface for all data practitioners of various coding skills to simplify analytics workflows from data ingestion and preparation to data exploration and visualization to ML model creation and use. It also allows you to use simple SQL to access Vertex AI foundational models directly inside BigQuery for text processing tasks, such as sentiment analysis, entity extraction, and many more without having to deal with specialized models.
    Try for free
  • Automated RMM Tools | RMM Software Icon
    Automated RMM Tools | RMM Software

    Proactively monitor, manage, and support client networks with ConnectWise Automate

    Out-of-the-box scripts. Around-the-clock monitoring. Unmatched automation capabilities. Start doing more with less and exceed service delivery expectations.
    Learn More
  • 1
    XRichText

    XRichText

    An Android rich text class library that supports graphic & text mixing

    An Android-rich text class library that supports graphic and text mixing, supports editing and previewing and supports inserting and deleting pictures. Use ScrollView as the outermost layout containing LineaLayout, filled with TextView and ImageView. When deleting, delete the TextView and ImageView according to the position of the cursor, and the text will be automatically merged. The generated data is a list collection, and the data format can be customized. Version V1.4 opens the image...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    HTMLMinifier

    HTMLMinifier

    Javascript-based HTML compressor/minifier (with Node.js support)

    HTMLMinifier is a highly configurable, well-tested, JavaScript-based HTML minifier. Minifier options like sortAttributes and sortClassName won't impact the plain-text size of the output. However, they form long repetitive chains of characters that should improve compression ratio of gzip used in HTTP compression. SVG tags are automatically recognized, and when they are minified, both case-sensitivity and closing slashes are preserved, regardless of the minification settings used for the rest...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 3
    A common markup language and a parser to generate documentation in any target format (Html, Latex, Trac, Mediawiki...). The core command relies on a Tcl library: it is easy to create new target formats. Doc files are parameterizable via a header.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4

    ConcatPDF

    PDF Concatenation Tool

    ConcatPDF is the tool to concatenate PDF files. It can concatenate, extract, encrypt, decrypt, configure PDF files, convert image files to PDF. GUI version and CUI version are both available. iText.NET is iText porting on .NET Framework by J#. This library allows you to generate PDF, (X)HTML, XML, RTF files on Microsoft.NET Framework including ASP.NET.
    Leader badge
    Downloads: 51 This Week
    Last Update:
    See Project
  • AestheticsPro Medical Spa Software Icon
    AestheticsPro Medical Spa Software

    Our new software release will dramatically improve your medspa business performance while enhancing the customer experience

    AestheticsPro is the most complete Aesthetics Software on the market today. HIPAA Cloud Compliant with electronic charting, integrated POS, targeted marketing and results driven reporting; AestheticsPro delivers the tools you need to manage your medical spa business. It is our mission To Provide an All-in-One Cutting Edge Software to the Aesthetics Industry.
    Learn More
  • 5
    MyHTML

    MyHTML

    Fast C/C++ HTML 5 Parser

    Fast C/C++ HTML 5 Parser. Using threads. MyHTML is a fast HTML Parser using Threads implemented as a pure C99 library with no outside dependencies. Asynchronous Parsing, Build Tree, and Indexation. Fully conformant to the HTML5 specification. Two APIs - high and low-level. Manipulation of elements: add, change, delete, and others. Manipulation of elements attributes: add, change, delete, and other.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6

    libdropbox

    Small ANSI C lib for dropbox/windows azure communication

    Small ANSI C lib for dropbox and windows azure communication. Built for small platforms. Using PolarSSL for https communication. Features a small self contained https module and a modified version of the JSMN json parser. Originally based on the dropbox_uploader script. Able to do most dropbox actions. Eg. Upload file, download file, list, file info, account info, share link. Also contains a small CLI programs that interfaces with the lib. Also capable of windows azure service bus...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    iText®, a JAVA PDF library

    iText®, a JAVA PDF library

    PDF Library for Developers

    iText is an open-source PDF library available for Java and .NET (C#). iText allows you to effortlessly generate and manipulate standards-compliant PDF documents with a powerful and feature-rich SDK. With iText, you can create archivable and accessible PDFs, split and merge documents, fill and flatten forms, digitally sign documents, and more. iText add-ons enable additional functionality, such as PDF creation from HTML templates, secure redaction, OCR, and much more. ...
    Leader badge
    Downloads: 273 This Week
    Last Update:
    See Project
  • 8
    undom

    undom

    1kb minimally viable DOM Document implementation

    Minimally viable DOM Document implementation. A bare-bones HTML DOM in a box. If you want the DOM but not a parser, this might be for you. JSFiddle Demo: Rendering preact components into an undom Document. Undom aims to find a sweet spot between size/performance and utility. The goal is to provide the simplest possible implementation of a DOM Document, such that libraries relying on the DOM can run in places where there isn't one available.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    HTML Creator

    HTML Creator

    An easy-to-use tool to create html pages.

    This Java application allow users to easyly create web pages using a block structure to rapresent the HTML components.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Comet Backup - Fast, Secure Backup Software for MSPs Icon
    Comet Backup - Fast, Secure Backup Software for MSPs

    Fast, Secure Backup Software for Businesses and IT Providers

    Comet is a flexible backup platform, giving you total control over your backup environment and storage destinations.
    Learn More
  • 10
    Cool Reader

    Cool Reader

    A cross-platform XML/CSS based eBook reader

    CoolReader is fast and small cross-platform XML/CSS based eBook reader for desktops and handheld devices. Supported formats: FB2, TXT, RTF, DOC, TCR, HTML, EPUB, CHM, PDB, MOBI. Platforms: Win32, Linux, Android. Ported on some eInk based devices.
    Leader badge
    Downloads: 560 This Week
    Last Update:
    See Project
  • 11
    Himalaya

    Himalaya

    JavaScript HTML to JSON Parser

    Parse HTML into JSON. Himalaya bundles well with Browersify and Webpack. Himalaya transforms HTML into JSON, that's it. Himalaya is synchronous and does not require any complicated callbacks. Himalaya does not cut corners and returns an accurate representation of the HTML supplied. To remove whitespace, post-process the JSON; check out an example script. Himalaya can include the start and end positions of nodes in the parse output. To enable this, you can pass parse the parseDefaults...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    React HTML Parser

    React HTML Parser

    Converts HTML strings directly into React components

    Utility for converting HTML strings into React components. Converts standard HTML elements, attributes and inline styles into their React equivalents and provides a simple way to modify and replace the content. It is important to understand that this library should not be used as a direct replacement for using properly sanitized HTML and that it only provides the same level of protection that React does which does not provide 100% protection. All HTML should be properly sanitized using a...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    crawler4j

    crawler4j

    Open source web crawler for Java

    crawler4j is an open source web crawler for Java which provides a simple interface for crawling the Web. Using it, you can setup a multi-threaded web crawler in few minutes. You need to create a crawler class that extends WebCrawler. This class decides which URLs should be crawled and handles the downloaded page. shouldVisit function decides whether the given URL should be crawled or not.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    Web Widget Toolkit (WTK): Server-side components for easily creating web-based user interfaces with complex navigation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Snowflake HTML

    Snowflake HTML

    SVG in Swift

    SVG in Swift. Use XML parser from Reindeer. The Shape object maps to SVG elements. The Style object encapsulates style information. The cool thing about CALayer is that most of its properties are animatable. Snowflake is available through CocoaPods. Snowflake is also available through Carthage. Snowflake can also be installed manually. Just download and drop Sources folders in your project.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    JuniCoder is a Java project that uses unicode as a base for decoding and encoding formats that invented workarounds to express characters not covered by ASCII. Decoders translate those inventions to unicode. Encoders encode to these inventions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17

    StoryParser

    A set of tools and libraries to help with writing eBooks

    A set of tools and libraries (available for C# and Java) that help with writing fiction and non-fiction drafts and then produce ePUB and Kindle eBooks. With these tools/libraries, drafts, written in HTML, can be analyzed to help with writing. such as generating outlines and associating scenes with keywords. When done writing, the tools/libraries can be used to make publishable eBook, automatically producing additional material, such as Table of Contents and Title Pages.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Cross-platform visual XSLT generator
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Xml2Json Converter

    Xml2Json Converter

    Simple tool for converting large XML-files to JSON or JSON to XML

    Simple converter tool with GUI (written on JavaFX) for converting large XML-files to JSON and JSON to XML with indicating progress and uses small amount of memory for converting. Starting from 1.2.0 application supports batch converting files from directory by pattern. Uses Java 1.8+ (http://www.oracle.com/technetwork/java/javase/downloads/jre8-downloads-2133155.html). Distributions for Mac, Linux and Windows already have embedded JRE, so just download appropriate distribution and start application.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 20

    JHOVE

    File validation and characterization

    JHOVE provides functions to perform format-specific identification, validation, and characterization of digital objects. JHOVE should not be confused with JHOVE2, a product with similar aims but a completely separate code base.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    VTD-XML is the next generation XML parser/indexer/editor/slicer/assembler/xpath-engine that goes beyond DOM, SAX and PULL in performance, memory usage, and ease of use.
    Downloads: 12 This Week
    Last Update:
    See Project
  • 22
    Simple-Scrape is a simple web-scraping library that allows for programmatic access to HTML code. No further techniques are needed and the library is very compact and thus easy to use.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Grotag
    Grotag views Amigaguide documents or converts them to HTML and DocBook XML. Additionally it can validate and pretty print such documents.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24

    eXtensible Text Framework (XTF)

    Framework for search and display of heterogenous document collections.

    NOTICE: This code repository is deprecated. Please visit https://github.com/cdlib/xtf for the latest updates. Obsolete Description: The eXtensible Text Framework (XTF) is an architecture that supports searching across collections of heterogeneous textual data (XML, PDF, HTML, text, and more), and the presentation of results and documents in a highly configurable manner. Includes highly customized versions of the proven open-source components Lucene and Saxon.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25

    JSONjuicer

    JSON parser and encoder

    A Java open-source library which makes encoding and decoding Java data-structures to and from JSON text easy and intuitive.
    Downloads: 0 This Week
    Last Update:
    See Project