Showing 373 open source projects for "java html parser"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Test your software product anywhere in the world Icon
    Test your software product anywhere in the world

    Get feedback from real people across 190+ countries with the devices, environments, and payment instruments you need for your perfect test.

    Global App Testing is a managed pool of freelancers used by Google, Meta, Microsoft, and other world-beating software companies.
    Try us today.
  • 1
    Libre-3

    Libre-3

    Job Board

    Libre-3 (Libre free) Opérations :Grillons, Crickets, Sauterelles #Tanach Web-APP core architecture #JAVA EE7 #EJB3 #Eclipselink #MVC #JSF2.2 #Glassfish #MOJARRA Presentation #Facelet,#jquery #bootstrap Security #JAAS three #REALMs
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    HTMLMinifier

    HTMLMinifier

    Javascript-based HTML compressor/minifier (with Node.js support)

    HTMLMinifier is a highly configurable, well-tested, JavaScript-based HTML minifier. Minifier options like sortAttributes and sortClassName won't impact the plain-text size of the output. However, they form long repetitive chains of characters that should improve compression ratio of gzip used in HTTP compression. SVG tags are automatically recognized, and when they are minified, both case-sensitivity and closing slashes are preserved, regardless of the minification settings used for the rest...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    A common markup language and a parser to generate documentation in any target format (Html, Latex, Trac, Mediawiki...). The core command relies on a Tcl library: it is easy to create new target formats. Doc files are parameterizable via a header.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4

    ConcatPDF

    PDF Concatenation Tool

    ConcatPDF is the tool to concatenate PDF files. It can concatenate, extract, encrypt, decrypt, configure PDF files, convert image files to PDF. GUI version and CUI version are both available. iText.NET is iText porting on .NET Framework by J#. This library allows you to generate PDF, (X)HTML, XML, RTF files on Microsoft.NET Framework including ASP.NET.
    Leader badge
    Downloads: 61 This Week
    Last Update:
    See Project
  • Secure remote access solution to your private network, in the cloud or on-prem. Icon
    Secure remote access solution to your private network, in the cloud or on-prem.

    Deliver secure remote access with OpenVPN.

    OpenVPN is here to bring simple, flexible, and cost-effective secure remote access to companies of all sizes, regardless of where their resources are located.
    Get started — no credit card required.
  • 5
    MyHTML

    MyHTML

    Fast C/C++ HTML 5 Parser

    Fast C/C++ HTML 5 Parser. Using threads. MyHTML is a fast HTML Parser using Threads implemented as a pure C99 library with no outside dependencies. Asynchronous Parsing, Build Tree, and Indexation. Fully conformant to the HTML5 specification. Two APIs - high and low-level. Manipulation of elements: add, change, delete, and others. Manipulation of elements attributes: add, change, delete, and other. Support 39-character encoding. Support detecting character encodings. Support Single Mode parsing...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    undom

    undom

    1kb minimally viable DOM Document implementation

    Minimally viable DOM Document implementation. A bare-bones HTML DOM in a box. If you want the DOM but not a parser, this might be for you. JSFiddle Demo: Rendering preact components into an undom Document. Undom aims to find a sweet spot between size/performance and utility. The goal is to provide the simplest possible implementation of a DOM Document, such that libraries relying on the DOM can run in places where there isn't one available. The intent to keep things as simple as possible means...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    iText®, a JAVA PDF library

    iText®, a JAVA PDF library

    PDF Library for Developers

    iText is an open-source PDF library available for Java and .NET (C#). iText allows you to effortlessly generate and manipulate standards-compliant PDF documents with a powerful and feature-rich SDK. With iText, you can create archivable and accessible PDFs, split and merge documents, fill and flatten forms, digitally sign documents, and more. iText add-ons enable additional functionality, such as PDF creation from HTML templates, secure redaction, OCR, and much more. The latest versions...
    Leader badge
    Downloads: 269 This Week
    Last Update:
    See Project
  • 8
    React HTML Parser

    React HTML Parser

    Converts HTML strings directly into React components

    Utility for converting HTML strings into React components. Converts standard HTML elements, attributes and inline styles into their React equivalents and provides a simple way to modify and replace the content. It is important to understand that this library should not be used as a direct replacement for using properly sanitized HTML and that it only provides the same level of protection that React does which does not provide 100% protection. All HTML should be properly sanitized using...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9

    libdropbox

    Small ANSI C lib for dropbox/windows azure communication

    Small ANSI C lib for dropbox and windows azure communication. Built for small platforms. Using PolarSSL for https communication. Features a small self contained https module and a modified version of the JSMN json parser. Originally based on the dropbox_uploader script. Able to do most dropbox actions. Eg. Upload file, download file, list, file info, account info, share link. Also contains a small CLI programs that interfaces with the lib. Also capable of windows azure service bus...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Build Securely on Azure with Proven Frameworks Icon
    Build Securely on Azure with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 10
    Himalaya

    Himalaya

    JavaScript HTML to JSON Parser

    Parse HTML into JSON. Himalaya bundles well with Browersify and Webpack. Himalaya transforms HTML into JSON, that's it. Himalaya is synchronous and does not require any complicated callbacks. Himalaya does not cut corners and returns an accurate representation of the HTML supplied. To remove whitespace, post-process the JSON; check out an example script. Himalaya can include the start and end positions of nodes in the parse output. To enable this, you can pass parse the parseDefaults extended...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    crawler4j

    crawler4j

    Open source web crawler for Java

    crawler4j is an open source web crawler for Java which provides a simple interface for crawling the Web. Using it, you can setup a multi-threaded web crawler in few minutes. You need to create a crawler class that extends WebCrawler. This class decides which URLs should be crawled and handles the downloaded page. shouldVisit function decides whether the given URL should be crawled or not. In the above example, this example is not allowing .css, .js and media files and only allows pages within...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    HTML Creator

    HTML Creator

    An easy-to-use tool to create html pages.

    This Java application allow users to easyly create web pages using a block structure to rapresent the HTML components.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    Snowflake HTML

    Snowflake HTML

    SVG in Swift

    SVG in Swift. Use XML parser from Reindeer. The Shape object maps to SVG elements. The Style object encapsulates style information. The cool thing about CALayer is that most of its properties are animatable. Snowflake is available through CocoaPods. Snowflake is also available through Carthage. Snowflake can also be installed manually. Just download and drop Sources folders in your project.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Cool Reader

    Cool Reader

    A cross-platform XML/CSS based eBook reader

    CoolReader is fast and small cross-platform XML/CSS based eBook reader for desktops and handheld devices. Supported formats: FB2, TXT, RTF, DOC, TCR, HTML, EPUB, CHM, PDB, MOBI. Platforms: Win32, Linux, Android. Ported on some eInk based devices.
    Leader badge
    Downloads: 481 This Week
    Last Update:
    See Project
  • 15
    Web Widget Toolkit (WTK): Server-side components for easily creating web-based user interfaces with complex navigation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    JuniCoder is a Java project that uses unicode as a base for decoding and encoding formats that invented workarounds to express characters not covered by ASCII. Decoders translate those inventions to unicode. Encoders encode to these inventions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17

    StoryParser

    A set of tools and libraries to help with writing eBooks

    A set of tools and libraries (available for C# and Java) that help with writing fiction and non-fiction drafts and then produce ePUB and Kindle eBooks. With these tools/libraries, drafts, written in HTML, can be analyzed to help with writing. such as generating outlines and associating scenes with keywords. When done writing, the tools/libraries can be used to make publishable eBook, automatically producing additional material, such as Table of Contents and Title Pages.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Xml2Json Converter

    Xml2Json Converter

    Simple tool for converting large XML-files to JSON or JSON to XML

    Simple converter tool with GUI (written on JavaFX) for converting large XML-files to JSON and JSON to XML with indicating progress and uses small amount of memory for converting. Starting from 1.2.0 application supports batch converting files from directory by pattern. Uses Java 1.8+ (http://www.oracle.com/technetwork/java/javase/downloads/jre8-downloads-2133155.html). Distributions for Mac, Linux and Windows already have embedded JRE, so just download appropriate distribution and start...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 19
    Cross-platform visual XSLT generator
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20

    JHOVE

    File validation and characterization

    JHOVE provides functions to perform format-specific identification, validation, and characterization of digital objects. JHOVE should not be confused with JHOVE2, a product with similar aims but a completely separate code base.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    VTD-XML is the next generation XML parser/indexer/editor/slicer/assembler/xpath-engine that goes beyond DOM, SAX and PULL in performance, memory usage, and ease of use.
    Downloads: 16 This Week
    Last Update:
    See Project
  • 22
    WebHarvest - web data extraction tool
    Web data extraction (web data mining, web scraping) tool. It leverages well proved XML and text processing techologies in order to easely extract useful data from arbitrary web pages.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 23
    Simple-Scrape is a simple web-scraping library that allows for programmatic access to HTML code. No further techniques are needed and the library is very compact and thus easy to use.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24

    eXtensible Text Framework (XTF)

    Framework for search and display of heterogenous document collections.

    NOTICE: This code repository is deprecated. Please visit https://github.com/cdlib/xtf for the latest updates. Obsolete Description: The eXtensible Text Framework (XTF) is an architecture that supports searching across collections of heterogeneous textual data (XML, PDF, HTML, text, and more), and the presentation of results and documents in a highly configurable manner. Includes highly customized versions of the proven open-source components Lucene and Saxon.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    Grotag
    Grotag views Amigaguide documents or converts them to HTML and DocBook XML. Additionally it can validate and pretty print such documents.
    Downloads: 0 This Week
    Last Update:
    See Project
Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.