Showing 306 open source projects for "java html parser"

View related business solutions
  • Grafana: The open and composable observability platform Icon
    Grafana: The open and composable observability platform

    Faster answers, predictable costs, and no lock-in built by the team helping to make observability accessible to anyone.

    Grafana is the open source analytics & monitoring solution for every database.
    Learn More
  • Cloud-based help desk software with ServoDesk Icon
    Cloud-based help desk software with ServoDesk

    Full access to Enterprise features. No credit card required.

    What if You Could Automate 90% of Your Repetitive Tasks in Under 30 Days? At ServoDesk, we help businesses like yours automate operations with AI, allowing you to cut service times in half and increase productivity by 25% - without hiring more staff.
    Try ServoDesk for free
  • 1
    jsoup

    jsoup

    Java library for working with real-world HTML

    ...The parser will make every attempt to create a clean parse from the HTML you provide, regardless of whether the HTML is well-formed or not. You have HTML in a Java String, and you want to parse that HTML to get at its contents, or to make sure it's well formed, or to modify it. The String may have come from user input, a file, or from the web.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    JSON Schema Validator

    JSON Schema Validator

    A fast Java JSON schema validator that supports draft V4, V6, V7, V201

    This is a Java implementation of the JSON Schema Core Draft v4, v6, v7, v2019-09 and v2020-12(partial) specification for JSON schema validation. In addition, it also works for OpenAPI 3.0 request/response validation with some configuration flags. For users who want to collect information from a JSON node based on the schema, the walkers can help.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 3
    jackson-core

    jackson-core

    Core part of Jackson that defines Streaming API

    This project contains core low-level incremental ("streaming") parser and generator abstractions used by Jackson Data Processor. It also includes the default implementation of handler types (parser, generator) that handle JSON format. The core abstractions are not JSON specific, although naming does contain 'JSON' in many places, due to historical reasons. Only packages that specifically contain the word 'json' are JSON-specific. This package is the base on which Jackson's data-binding...
    Downloads: 17 This Week
    Last Update:
    See Project
  • 4
    Kryo

    Kryo

    Java binary serialization and cloning, fast, efficient, automatic

    Kryo is a fast and efficient binary object graph serialization framework for Java. The goals of the project are high speed, low size, and an easy-to-use API. The project is useful any time objects need to be persisted, whether to a file, database or over the network. Kryo can also perform automatic deep and shallow copying/cloning. This is direct copying from object to object, not object to bytes to object. Kryo has three sets of methods for reading and writing objects. If the concrete class...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Run applications fast and securely in a fully managed environment Icon
    Run applications fast and securely in a fully managed environment

    Cloud Run is a fully-managed compute platform that lets you run your code in a container directly on top of scalable infrastructure.

    Run frontend and backend services, batch jobs, deploy websites and applications, and queue processing workloads without the need to manage infrastructure.
    Try for free
  • 5
    A Simple CSV Parser for Java under a commercial-friendly Apache 2.0 license
    Leader badge
    Downloads: 140 This Week
    Last Update:
    See Project
  • 6
    WebHarvest - web data extraction tool
    Web data extraction (web data mining, web scraping) tool. It leverages well proved XML and text processing techologies in order to easely extract useful data from arbitrary web pages.
    Downloads: 11 This Week
    Last Update:
    See Project
  • 7

    RecordEditor

    Editor for Fixed Width, Csv and Existing Xml files.

    The RecordEditor is a Data File editor for Flat Files (delimited and fixed field position). It supports Unix / PC / Legacy (e.g. Mainframe) file formats, both Text and binary files. The Editor uses a Record-Layout description to format the files. This is ideal for Fixed width (Text or Binary) files, Cobol Data Files, Mainframe files and complicated Csv files. Cobol Copybooks can be used to format Cobol Data files. As well as an editor, The following utilities are supplied * Formatted...
    Leader badge
    Downloads: 62 This Week
    Last Update:
    See Project
  • 8
    jPicEdt

    jPicEdt

    Another drawing editor for LaTeX with PSTricks & TikZ

    jPicEdt is an extensible internationalized vector-based drawing editor for LaTeX and related packages (TikZ, PsTricks,...), written in Java. It is also a library of reusable high-level graphic primitives.
    Downloads: 17 This Week
    Last Update:
    See Project
  • 9
    ant4docbook

    ant4docbook

    ANT4DOCBOOK is an ANT task for DOCBOOK

    ANT4DOCBOOK is an ANT task for DOCBOOK, a semantic markup language for technical documentation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Free and Open Source HR Software Icon
    Free and Open Source HR Software

    OrangeHRM provides a world-class HRIS experience and offers everything you and your team need to be that HR hero you know that you are.

    Give your HR team the tools they need to streamline administrative tasks, support employees, and make informed decisions with the OrangeHRM free and open source HR software.
    Learn More
  • 10

    xsd2pgschema

    Relational database replication tool based on XML Schema

    xsd2pgschema is a Java application suite, which converts XML Schema 1.1 (hierarchical data model) to PostgreSQL DDL (relational data model) and supports XML data migration into PostgreSQL based on the XML Schema without defects on information content. It also supports full-text indexing via either Apache Lucene or Sphinx Search utilizing the relational data model. File conversion from XML to CSV, TSV, or JSON is possible as well as mapping XML Schema to JSON Schema. Obtained PostgreSQL...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 11
    HtmlCleaner is HTML parser written in Java. It transforms dirty HTML to well-formed XML following the same rules that the most web-browsers use.
    Downloads: 30 This Week
    Last Update:
    See Project
  • 12
    Writer2LaTeX and Writer2xhtml is a collection of converters from OpenDocument Format (ODF) to LaTeX/BibTeX, HTML+MathML and EPUB. It is delivered as a standalone java library, as a command line application and as extensions for LibreOffice.
    Leader badge
    Downloads: 36 This Week
    Last Update:
    See Project
  • 13
    FluentLenium

    FluentLenium

    FluentLenium is a web & mobile automation framework

    FluentLenium is a React-ready website automation framework that extends Selenium to write readable, reusable, reliable and resilient UI functional tests. It’s written and maintained by people who are automating browser-based tests on a daily basis. FluentLenium provides a Java-fluent interface to Selenium, and brings some magic to avoid common issues faced by Selenium users. FluentLenium is shipped with adapters for JUnit4, JUnit5, TestNG, Spock, Spring TestNG, Cucumber and Kotest, but it...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    JBake

    JBake

    Java based open source static site/blog generator for developers

    JBake is a Java-based, open source, static site/blog generator for developers & designers. The project uses Gradle 4.9+ as the build system. We configured the gradle check style Plugin to run with the check task. It does not break the build if convention violations are found. But prints a warning and generates a report. Source available on GitHub, licensed under MIT License.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    RTextDoc

    RTextDoc

    An editor for structured documents

    RTextDoc is an editor for structured text documents such as LaTeX, AsciiDoc, DocBook. RTextDoc has proofreading capabilities: on-the-fly spelling, instant grammar checking and built-in free dictionaries. RTextDoc has syntax highlighting, bracket matching, folding, document structure browser for sections and labels, bookmarks, manager for LaTeX symbols, an editor for mathematical equations,integrated BibTeX database manager and several tools to convert LaTeX to HTML and back. AsciiDoc...
    Leader badge
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16
    Java Marine API

    Java Marine API

    NMEA 0183 library for Java

    Java Marine API is an NMEA 0183 library for Java. For furher info and latest source code, see project's Github pages. http://github.com/ktuukkan/marine-api/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Swagger2Markup

    Swagger2Markup

    Swagger to AsciiDoc or Markdown converter

    The primary goal of this project is to simplify the generation of up-to-date RESTful API documentation by combining documentation that’s been hand-written with auto-generated API documentation produced by Swagger. The result is intended to be an up-to-date, easy-to-read, on- and offline user guide, comparable to GitHub’s API documentation. The output of Swagger2Markup can be used as an alternative to swagger-UI and can be served as static content. Swagger2Markup converts a Swagger JSON or...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Problem

    Problem

    A Java library that implements application/problem+json

    Problem is a library that implements application/problem+json. It comes with an extensible set of interfaces/implementations as well as convenient functions for everyday use. It's decoupled from any JSON library but contains a separate module for Jackson.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    TeXtidote

    TeXtidote

    Spelling, grammar and style checking on LaTeX documents

    If so, you probably know that the process is far from simple. Since LaTeX documents contain special commands and keywords (the so-called "markup") that are not part of the "real" text, you cannot run a grammar checker directly on these files: it cannot tell the difference between markup and text. The other option is to remove all this markup, leaving only the "clear" text; however, when a grammar tool points to a problem at a specific line in this clear text, it becomes hard to retrace that...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    HTML Article Generator

    HTML Article Generator

    Quickly create custom webpages from your content

    HTML Article Generator is a tool for quickly generating webpages based on content you enter, including both text and images. These webpages can be customised to give a unique appearance, with a selection of 5 different themes. Other features include the ability to save the current values you have entered and restore these values after future changes have been made. Images can have caption text added to them and given alt text to improve accessibility. Each webpage can also be given a...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    CSSBox

    CSSBox

    Pure Java HTML / CSS rendering engine

    CSSBox is an (X)HTML/CSS rendering engine written in pure Java. Its primary purpose is to provide a complete information about the rendered page suitable for further processing. However, it also allows displaying the rendered document.
    Leader badge
    Downloads: 18 This Week
    Last Update:
    See Project
  • 22
    XML Editor/Validator/Designer with CAMV

    XML Editor/Validator/Designer with CAMV

    CAM XML Editor for XML+JSON+Hibernate+SQL Open-XDX sponsored by Oracle

    Java/Eclipse +Saxon/XSL
    Downloads: 12 This Week
    Last Update:
    See Project
  • 23
    The Accelerator Markup Language (AML) / Universal Accelerator Parser (UAP) project will develop an XML based format for describing high energy particle accelerators along with associated software to convert lattice files to a standard internal struct
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Libre-3

    Libre-3

    Job Board

    Libre-3 (Libre free) Opérations :Grillons, Crickets, Sauterelles #Tanach Web-APP core architecture #JAVA EE7 #EJB3 #Eclipselink #MVC #JSF2.2 #Glassfish #MOJARRA Presentation #Facelet,#jquery #bootstrap Security #JAAS three #REALMs
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    XRichText

    XRichText

    An Android rich text class library that supports graphic & text mixing

    An Android-rich text class library that supports graphic and text mixing, supports editing and previewing and supports inserting and deleting pictures. Use ScrollView as the outermost layout containing LineaLayout, filled with TextView and ImageView. When deleting, delete the TextView and ImageView according to the position of the cursor, and the text will be automatically merged. The generated data is a list collection, and the data format can be customized. Version V1.4 opens the image...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next