Showing 24 open source projects for "html parser"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Cloud tools for web scraping and data extraction Icon
    Cloud tools for web scraping and data extraction

    Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.

    Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
    Explore 10,000+ tools
  • 1
    jsoup

    jsoup

    Java library for working with real-world HTML

    ...The parser will make every attempt to create a clean parse from the HTML you provide, regardless of whether the HTML is well-formed or not. You have HTML in a Java String, and you want to parse that HTML to get at its contents, or to make sure it's well formed, or to modify it. The String may have come from user input, a file, or from the web.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    JDynamiTe, Dynamic Template in Java

    JDynamiTe, Dynamic Template in Java

    Dynamically generate documents from templates

    JDynamiTe is a tool which allows you to dynamically create documents in any format from "template" documents. And very few lines of code (or no line at all!) are needed to do that. Some typical usage domains of JDynamiTe are: - dynamic Web pages creation, - text document generation, - source code generation... In fact, it can be useful in any case where pre-defined documents (templates) have to be dynamically populated with data. The main benefit of JDynamiTe is to allow a true...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Jodd

    Jodd

    Jodd! Lightweight, Java, zero dependencies, use what you like

    Jodd is a set of micro-frameworks and developer-friendly tools and utilities. Code-things-done. Great performances. Tiny dependencies. Use what you like. Welcome to the Jodd, a unique, alternative world of Java. You can find here some micro-frameworks and some handy tools. Nothing more, nothing less. The decision which component is promoted is based on the Maven Central Repo statistics. Version v5 contains remaining libraries and frameworks. They are in maintenance mode.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4
    Jericho HTML Parser is a java library allowing analysis and manipulation of parts of an HTML document, including server-side tags, while reproducing verbatim any unrecognised or invalid HTML.
    Downloads: 8 This Week
    Last Update:
    See Project
  • Say goodbye to broken revenue funnels and poor customer experiences Icon
    Say goodbye to broken revenue funnels and poor customer experiences

    Connect and coordinate your data, signals, tools, and people at every step of the customer journey.

    LeanData is a Demand Management solution that supports all go-to-market strategies such as account-based sales development, geo-based territories, and more. LeanData features a visual, intuitive workflow native to Salesforce that enables users to view their entire lead flow in one interface. LeanData allows users to access the drag-and-drop feature to route their leads. LeanData also features an algorithms match that uses multiple fields in Salesforce.
    Learn More
  • 5
    NekoHTML is a simple HTML scanner and tag balancer that enables application programmers to parse HTML documents and access the information using standard XML interfaces.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6

    jStyleParser

    Java CSS parser and DOM style assignment library

    jStyleParser is a CSS parser written in Java. It has its own application interface that is designed to allow an efficient CSS processing in Java and mapping the values to the Java data types. It is also able to apply the parsed style sheets to a DOM that represents an HTML or XML document and to compute the resulting style of the individual document elements.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    JLevel

    JLevel

    Kogonuso JLevel - a simple java library that has changed the way html

    Kogonuso JLevel - a simple java library that has changed the way html can be written inside Java(tm) both application and Sevlet. It is a library that is more than your ordinary Html parser applications. It has automated all tagging operations of Html inside java, this means that you are no longer required to learn how to format html String when developing application that requires html in java or even when using Servlet technology to develop your websites. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    SourceDoc is a powerful system for automatic creation, extraction, and verification of embedded documentation. Designed for C code, it features both a C parser and a preprocessor. The default output format is HTML, but other formats are possible to plug in using a public Java interface.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    HTML Parser
    HTML Parser is a Java library used to parse HTML in either a linear or nested fashion. Primarily used for transformation or extraction, it features filters, visitors, custom tags and easy to use JavaBeans. It is a fast, robust and well tested package.
    Downloads: 6 This Week
    Last Update:
    See Project
  • Desktop and Mobile Device Management Software Icon
    Desktop and Mobile Device Management Software

    It's a modern take on desktop management that can be scaled as per organizational needs.

    Desktop Central is a unified endpoint management (UEM) solution that helps in managing servers, laptops, desktops, smartphones, and tablets from a central location.
    Learn More
  • 10
    TagParser is a java parser based on CSS formulas (like JQuery) and can parse any documents based on tags such as XML, HTML. Furthermore, it doesn't require documents to be well formed and can parse complex documents with embedded scripts or CSS parts
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Lightweight XML processor; XML-POJO mapping via Java5 annotations or DTD; Preprocessing of XML documents using expression language; Binary XML; RMI friendly XML; JSON format support; XML marshall/unmarshall; HTML as XML parser; Swing XML Viewer
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    ShaniXmlParser is a small and fast Xml/Html DOM/SAX non validating parser written in java. It can parse invalid xml files. It uses the org.w3c.dom interfaces and the jaxp interfaces. It works also on mono/.net thanks to ikvm.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Lobo is an extensible all-Java web browser and RIA platform. It supports HTML 4, Javascript (AJAX) and CSS 2 plus direct JavaFX and Java (Swing/AWT) rendering. Cobra is the web browser's renderer API; also a Javascript-aware HTML parser.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 14
    QuickDoc is a java document parser that reads documents from plain text files using a simple language and exports the document to other formats like PDF, HTML, Java Help and XML.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    NeatCleaner is a velocity template file parser written in Java. It transforms dirty HTML/VTL/JS to well-formed Node Tree and render the Node Tree to output pretty source.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    A parser generator derived from ANTLR with JavaCC like (but not the same) syntax and some new features. It also comes with sample grammars for CSharp, Java, GnuC, Html, XHtml, Xml, ECMA262... etc.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    A JAVA suite for parsing arbitrary text data. Not just HTML or XML or Java, but all of them. Use it when the JDK tokenizers are too limited, JavaCC, JTB etc. are too complicated, or You need dynamic parser configuration
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Web documents that look similar often use different HTML tags to achieve their layout effect. These tags often make it difficult for a machine to find text or images of interest. Our goal is to implement a parser to overcome this.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Free lightweight HTML/XML parser.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Java API to process or parse HTML documents. If your Java application needs or would like to be able to process some text in HTML format, you'd probably find this API interesting.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    jxml2sql is a Java application for converting database structures in XML to other formats useful for database administration (ie. SQL for table creation, HTML for reference docs). jxml2sql uses a minimalistic, non-validating, Java XML parser (NanoXML).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Arachnid is a Java-based web spider framework. It includes a simple HTML parser object that parses an input stream containing HTML content. Simple Web spiders can be created by sub-classing Arachnid and adding a few lines of code called after each page
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    HotSAX is a fast, small footprint, non-validating SAX2 parser for HTML/XML/XHTML. It can be used in simple web agents, page scrapers and spiders. The goal is to embed this in cell phone "midlets."
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    A performance benchmarking package for Java XML parsers. This tool tests parsers supporting the SAX1, SAX2, JAXP, and XML Pull Parser interfaces. It produces output in XML and HTML.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next