Showing 41 open source projects for "html parser"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Context for your AI agents Icon
    Context for your AI agents

    Crawl websites, sync to vector databases, and power RAG applications. Pre-built integrations for LLM pipelines and AI assistants.

    Build data pipelines that feed your AI models and agents without managing infrastructure. Crawl any website, transform content, and push directly to your preferred vector store. Use 10,000+ tools for RAG applications, AI assistants, and real-time knowledge bases. Monitor site changes, trigger workflows on new data, and keep your AIs fed with fresh, structured information. Cloud-native, API-first, and free to start until you need to scale.
    Try for free
  • 1
    jsoup

    jsoup

    Java library for working with real-world HTML

    ...The parser will make every attempt to create a clean parse from the HTML you provide, regardless of whether the HTML is well-formed or not. You have HTML in a Java String, and you want to parse that HTML to get at its contents, or to make sure it's well formed, or to modify it. The String may have come from user input, a file, or from the web.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 2
    Lobo Evolution - Java Web Browser

    Lobo Evolution - Java Web Browser

    Lobo Evolution is an extensible all-Java web browser and RIA platform

    Lobo Evolution is a fork of Lobo Browser. The project continuing the work of Lobo Browser(lobochief). Lobo Evolution is an extensible all-Java web browser and RIA platform. It supports HTML 4, HTML5 Javascript, CSS 3 and Java (Swing) rendering. CobraEvolution is the web browser's renderer API; also a Javascript-aware HTML parser. Lobo Evolution 5.0 relesed CHANGELOG: https://github.com/LoboEvolution/LoboEvolution/releases Read wiki: https://loboevolution.github.io/LoboEvolution/project-info.html Javadoc site: https://oswetto.github.io/LoboEvolution Now you can fork the project and help me with code. ...
    Leader badge
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    HtmlCleaner is HTML parser written in Java. It transforms dirty HTML to well-formed XML following the same rules that the most web-browsers use.
    Leader badge
    Downloads: 25 This Week
    Last Update:
    See Project
  • 4
    JDynamiTe, Dynamic Template in Java

    JDynamiTe, Dynamic Template in Java

    Dynamically generate documents from templates

    JDynamiTe is a tool which allows you to dynamically create documents in any format from "template" documents. And very few lines of code (or no line at all!) are needed to do that. Some typical usage domains of JDynamiTe are: - dynamic Web pages creation, - text document generation, - source code generation... In fact, it can be useful in any case where pre-defined documents (templates) have to be dynamically populated with data. The main benefit of JDynamiTe is to allow a true...
    Downloads: 0 This Week
    Last Update:
    See Project
  • DAT Freight and Analytics - DAT Icon
    DAT Freight and Analytics - DAT

    DAT Freight and Analytics operates DAT One truckload freight marketplace

    DAT Freight & Analytics operates DAT One, North America’s largest truckload freight marketplace; DAT iQ, the industry’s leading freight data analytics service; and Trucker Tools, the leader in load visibility. Shippers, transportation brokers, carriers, news organizations, and industry analysts rely on DAT for market trends and data insights, informed by nearly 700,000 daily load posts and a database exceeding $1 trillion in freight market transactions. Founded in 1978, DAT is a business unit of Roper Technologies (Nasdaq: ROP), a constituent of the Nasdaq 100, S&P 500, and Fortune 1000. Headquartered in Beaverton, Ore., DAT continues to set the standard for innovation in the trucking and logistics industry.
    Learn More
  • 5
    Jodd

    Jodd

    Jodd! Lightweight, Java, zero dependencies, use what you like

    Jodd is a set of micro-frameworks and developer-friendly tools and utilities. Code-things-done. Great performances. Tiny dependencies. Use what you like. Welcome to the Jodd, a unique, alternative world of Java. You can find here some micro-frameworks and some handy tools. Nothing more, nothing less. The decision which component is promoted is based on the Maven Central Repo statistics. Version v5 contains remaining libraries and frameworks. They are in maintenance mode.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    LumberJack4Logs
    LumberJack4Logs is a viewer for log and trace files with the ability to extend the recognized data formats by adding text parser plugins.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    The MangaStream Downloader is an open source application written in Java for managing and downloading manga from the site mangastream.com and mangafox.me. It is written under the GNU-GPL license and uses an open source HTML parser - TagSoup. Follow the project page on Facebook for updates: https://www.facebook.com/MangastreamDownloader
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    JSONjuicer

    JSON parser and encoder

    A Java open-source library which makes encoding and decoding Java data-structures to and from JSON text easy and intuitive.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Jericho HTML Parser is a java library allowing analysis and manipulation of parts of an HTML document, including server-side tags, while reproducing verbatim any unrecognised or invalid HTML.
    Downloads: 6 This Week
    Last Update:
    See Project
  • G-P - Global EOR Solution Icon
    G-P - Global EOR Solution

    Companies searching for an Employer of Record solution to mitigate risk and manage compliance, taxes, benefits, and payroll anywhere in the world

    With G-P's industry-leading Employer of Record (EOR) and Contractor solutions, you can hire, onboard and manage teams in 180+ countries — quickly and compliantly — without setting up entities.
    Learn More
  • 10
    NekoHTML is a simple HTML scanner and tag balancer that enables application programmers to parse HTML documents and access the information using standard XML interfaces.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 11
    A java-based parser for parsing/grabbing web sites and other text or XML documents, based on a nondeterministic parser language, creating XML output. Also contains a few utility classes for HTML, CSV and text parsing, and additional character sets.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    A java m3u8 playlist parser for Http Live Streaming. See ietf draft http://tools.ietf.org/html/draft-pantos-http-live-streaming-02 for details. See See: http://m3u8parser.chilicat.net for JavaDoc and simple Webstart application to test the parser
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13

    QueLang

    QueLang is a designing tool to use for Questionnaire Design.

    This is the first implementation of QueLang. QueLang is a language I designed for Questionnaire Design and Implementation. This software can compile your code (written in .ql text files) into a special .qlc format (a kind of database). Then it can read those .qlc files to open them in viewer and export them to PDF format. It can be also used for exam and test designing! Tested on: -Linux Ubuntu 12.04 -Windows 7 64-bit QueLang can run by double clicking the .jar (or .exe)...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14

    jStyleParser

    Java CSS parser and DOM style assignment library

    jStyleParser is a CSS parser written in Java. It has its own application interface that is designed to allow an efficient CSS processing in Java and mapping the values to the Java data types. It is also able to apply the parsed style sheets to a DOM that represents an HTML or XML document and to compute the resulting style of the individual document elements.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    JLevel

    JLevel

    Kogonuso JLevel - a simple java library that has changed the way html

    Kogonuso JLevel - a simple java library that has changed the way html can be written inside Java(tm) both application and Sevlet. It is a library that is more than your ordinary Html parser applications. It has automated all tagging operations of Html inside java, this means that you are no longer required to learn how to format html String when developing application that requires html in java or even when using Servlet technology to develop your websites. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    SourceDoc is a powerful system for automatic creation, extraction, and verification of embedded documentation. Designed for C code, it features both a C parser and a preprocessor. The default output format is HTML, but other formats are possible to plug in using a public Java interface.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    HTML Parser
    HTML Parser is a Java library used to parse HTML in either a linear or nested fashion. Primarily used for transformation or extraction, it features filters, visitors, custom tags and easy to use JavaBeans. It is a fast, robust and well tested package.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 18
    TagParser is a java parser based on CSS formulas (like JQuery) and can parse any documents based on tags such as XML, HTML. Furthermore, it doesn't require documents to be well formed and can parse complex documents with embedded scripts or CSS parts
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Kelvina is a platform independent Java HTML parser, which outputs Document(org.w3c.dom.Document) object from any html input, including invalid one.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Lightweight XML processor; XML-POJO mapping via Java5 annotations or DTD; Preprocessing of XML documents using expression language; Binary XML; RMI friendly XML; JSON format support; XML marshall/unmarshall; HTML as XML parser; Swing XML Viewer
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    JTidy is a Java port of HTML Tidy, a HTML syntax checker and pretty printer. Like its non-Java cousin, JTidy can be used as a tool for cleaning up malformed and faulty HTML. In addition, JTidy provides a DOM parser for real-world HTML.
    Downloads: 22 This Week
    Last Update:
    See Project
  • 22
    ShaniXmlParser is a small and fast Xml/Html DOM/SAX non validating parser written in java. It can parse invalid xml files. It uses the org.w3c.dom interfaces and the jaxp interfaces. It works also on mono/.net thanks to ikvm.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Lobo is an extensible all-Java web browser and RIA platform. It supports HTML 4, Javascript (AJAX) and CSS 2 plus direct JavaFX and Java (Swing/AWT) rendering. Cobra is the web browser's renderer API; also a Javascript-aware HTML parser.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 24
    MozillaParser is a Java Html parser based on mozilla's html parser. it acts as a bridge from java classes to Mozilla's classes and outputs a java Document object from a raw ( and dirty) HTML input
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Wow Log Parser is a combat log parser for the game World of Warcraft. The purpose of the program is to parse the files generated with the /combatlog command. The source code can be found on: http://www.gurre.eu/wowlogparser/forum
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next