Showing 302 open source projects for "java html parser"

View related business solutions
  • Gen AI apps are built with MongoDB Atlas Icon
    Gen AI apps are built with MongoDB Atlas

    Build gen AI apps with an all-in-one modern database: MongoDB Atlas

    MongoDB Atlas provides built-in vector search and a flexible document model so developers can build, scale, and run gen AI apps without stitching together multiple databases. From LLM integration to semantic search, Atlas simplifies your AI architecture—and it’s free to get started.
    Start Free
  • Cloud-based help desk software with ServoDesk Icon
    Cloud-based help desk software with ServoDesk

    Full access to Enterprise features. No credit card required.

    What if You Could Automate 90% of Your Repetitive Tasks in Under 30 Days? At ServoDesk, we help businesses like yours automate operations with AI, allowing you to cut service times in half and increase productivity by 25% - without hiring more staff.
    Try ServoDesk for free
  • 1
    TeXConverter is a converter to parse LaTeX input and create representations of the document in other formats (currently HTML, XDocs(Maven), a specific Wiki-format). Internally a Java object model is used to represent the document, to ease extensibility.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    CurlyML is a super-lightweight alternative to XML, properties files and BBCode. It is a concise data format for representing heirarchical data, and is extremely human-readable. Project provides parser, API and other tools using this format.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Includes tools for creating ebooks in xml-format. xTrans helps in creating an XML-Ebook from plain text like RTF, TXT. XTrans converts xml-ebooks into the final format like PDF, HTML, RTF, PDB (various forms), ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    The DataExtractor (HTMLtoXML) extracts data from a HTML page according to a configuration file and puts the data into an XML file according to a specified structure. It is a tool to extract data from HTML pages and to store the data in XML files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Cloud data warehouse to power your data-driven innovation Icon
    Cloud data warehouse to power your data-driven innovation

    BigQuery is a serverless and cost-effective enterprise data warehouse that works across clouds and scales with your data.

    BigQuery Studio provides a single, unified interface for all data practitioners of various coding skills to simplify analytics workflows from data ingestion and preparation to data exploration and visualization to ML model creation and use. It also allows you to use simple SQL to access Vertex AI foundational models directly inside BigQuery for text processing tasks, such as sentiment analysis, entity extraction, and many more without having to deal with specialized models.
    Try for free
  • 5
    java2web is a java to html source code converter which a wide array of features, including CSS customization, outline generation, automatic javadoc liking, selective highlighting and a lot more.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    HtmlClient provides an SGML/HTML/XHTML parser and connection client making web-spidering as easy for developers as actually surfing the web with a premade browser. Based on Apache's HttpClient.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Xanot is an XOM (Xml to Object Mapper). Very similar to well known Apache Digester. But it uses java 5 annotation feature. So that the classes can "tell" the parser how to map xml data into its object model.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    XDC is a program that generates HTML documentation from special comments within XML files - much like Javadoc does for documentation in Java source files. It supports various XML dialects (XSLT, Apache Ant build files, ...); others may easily be added.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    An implementation (parser/interpreter) of SPARQL, a query language for RDF data that is the current recommendation of the W3C. Allows easy cusomization of the language and plugs in to several popular RDF servers, like Sesame.
    Leader badge
    Downloads: 0 This Week
    Last Update:
    See Project
  • Desktop and Mobile Device Management Software Icon
    Desktop and Mobile Device Management Software

    It's a modern take on desktop management that can be scaled as per organizational needs.

    Desktop Central is a unified endpoint management (UEM) solution that helps in managing servers, laptops, desktops, smartphones, and tablets from a central location.
    Learn More
  • 10
    Strip out useless tags and other junk from HTML files. Shrink files, enhance readability of HTML source, promote privacy, and clean HTML exported from Microsoft Word (MS-Word). Run HTMLStrip as-is or customize it with your own regular expressions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    dft is a support class or library (depending on language/implementation) to allow dynamic formatting of HTML, data transmissions, print streams or anything else. The system uses templates to separate data views from procedural code.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Generate javascript validation from an xml template; you can configure on xml your html forms, giving params (min / max length, etc) and creates an JavaScript code to put in your html to validate your forms. Its an easy way to write js validation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    FOXY is a filtering web proxy. Originally designed to provide device-independent access to the World Wide Web, it may also be used for HTTP-filtering, extraction and reauthoring of existing web content or as security device against web based attacks.
    Leader badge
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Location Aware Mobile Assistant (LAMA) provides travelers a dynamic directory service. Users receive information on hotels, restaurants, and more, which are within their vicinity.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    With one mouse click you generate an XForms form (within an XHTML site) based on your XML Schema. (Tranformation of XML Schemata to other technologies such as JSF, XUL, Hibernate are planned).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Super Simple XML. A reduced non w3 supported xml parser. It has only the basic parsing abilities. But is small and easy to use.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Thor is a Webapplication Framework that is easy to use, modular and not overburdened with complicated xml configuration files. You can easily write and integrate reusable components.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    XSDB XML is to DATA as HTML is to DOCUMENT. Publish and combine data as easily as HTML format and web browsers publish and view documents. Implementations in Python, javascript, java, C#/.NET.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    The BibXML Editor helps to manage bibliography information stored in an XML-based file. Export filters allows to generate BibTex files or HTML pages.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Abacus Technologies Relational XQuery is an XQJ compatible XQuery processor that provides the ability to execute XQuery against both relational and XML data sources. Relational data can be transformed to any XML schema using XQuery's powerful features.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    PyBookmark manipulates bookmark files. It can sync files (no server required), merge, sort, remove duplicates, and check links. Its library pybookmarklib provides access to these operations, data structures, and parser for further extensibility.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    AJAX JDBC ResultSet in Javascript. Uses XMLHTTP to send xml server data to a web browser HTML form with no refresh, reload or submit (like Google Suggest). Data can be SQL, object relational map (like Hibernate), XML, other mapping (like SOAP)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    The project Navigator aims at supporting automated gathering of dynamic information from third party web sites, using their web interface to post queries and to gather replies. Navigator is written in OS-independent java language.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    The Nheengatu Project is a Java library that provides HTML markup abstraction allowing you to reutilize it to generate PDF files, OpenOffice documents, image files, etc. The goal of this project is to maximize the use of HTML markup procedures.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Connla is a Java library for creating data collections which can be exported to TXT, CSV, HTML, XHTML, XML, PDF and XLS formats.
    Downloads: 0 This Week
    Last Update:
    See Project