Showing 12 open source projects for "java html parser"

View related business solutions
  • Gen AI apps are built with MongoDB Atlas Icon
    Gen AI apps are built with MongoDB Atlas

    The database for AI-powered applications.

    MongoDB Atlas is the developer-friendly database used to build, scale, and run gen AI and LLM-powered apps—without needing a separate vector database. Atlas offers built-in vector search, global availability across 115+ regions, and flexible document modeling. Start building AI apps faster, all in one place.
    Start Free
  • Photo and Video Editing APIs and SDKs Icon
    Photo and Video Editing APIs and SDKs

    Trusted by 150 million+ creators and businesses globally

    Unlock Picsart's full editing suite by embedding our Editor SDK directly into your platform. Offer your users the power of a full design suite without leaving your site.
    Learn More
  • 1
    WebHarvest - web data extraction tool
    Web data extraction (web data mining, web scraping) tool. It leverages well proved XML and text processing techologies in order to easely extract useful data from arbitrary web pages.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2

    eXtensible Text Framework (XTF)

    Framework for search and display of heterogenous document collections.

    NOTICE: This code repository is deprecated. Please visit https://github.com/cdlib/xtf for the latest updates. Obsolete Description: The eXtensible Text Framework (XTF) is an architecture that supports searching across collections of heterogeneous textual data (XML, PDF, HTML, text, and more), and the presentation of results and documents in a highly configurable manner. Includes highly customized versions of the proven open-source components Lucene and Saxon.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Ssspeak is a framework providing an SSML speech synthesizer wrapper around festival (the synthesizer) and a Firefox extension enabling your browser to read HTML pages to you. Via audible notifications and voice announcements about the state of the page be
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    uEasy is a wysiwyg editor for use with Content Management System (CMS). It features a wysiwyg client (as applet or application) and a server. You can easily develop: - interfaces for any CMS - plugins for any format (HTML, Wiki, ...)
    Downloads: 0 This Week
    Last Update:
    See Project
  • Sales CRM and Pipeline Management Software | Pipedrive Icon
    Sales CRM and Pipeline Management Software | Pipedrive

    The easy and effective CRM for closing deals

    Pipedrive’s simple interface empowers salespeople to streamline workflows and unite sales tasks in one workspace. Unlock instant sales insights with Pipedrive’s visual sales pipeline and fine-tune your strategy with robust reporting features and a personalized AI Sales Assistant.
    Try it for free
  • 5
    ServingXML is an open source, Apache 2.0 licensed, framework for flat/XML data transformations. It defines an extensible markup vocabulary for expressing flat-XML, XML-flat, flat-flat, and XML-XML processing in pipelines.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 6
    TXS provides a basic HTTP server. Its own XML Script language can produce, via XSL-T, XForms or XHTML pages (tested with FireFox). It can also be used to read and generate CSV, XML or ZIP files. HTML page capture and email generation also available.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Jetsam is a xml based Content Management System.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    XmlHttpLib(XHL) is a pure java developers library and cmd line tool for executing http requests with xml-input(, converting response html to xhtml) and transforming it with xslt to grab information from webpages.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    jinFORM is a J2EE 1.4 web application that allows filling Microsoft Infopath forms. jinFORM relies on Orbeon Presentation Server (http://www.orbeon.com/) for HTML rendering, for which it transforms XSN files to XForms and acts as a submission handler.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Test your software product anywhere in the world Icon
    Test your software product anywhere in the world

    Get feedback from real people across 190+ countries with the devices, environments, and payment instruments you need for your perfect test.

    Global App Testing is a managed pool of freelancers used by Google, Meta, Microsoft, and other world-beating software companies.
    Try us today.
  • 10
    Helona is a project to provide xml based plugins (modules) and themes for Apache cocoon, forrest and lenya based web applications. The project is created to provide code that are not meeting ASF policies (e.g. including GNU LGPL licenced code).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    FOXY is a filtering web proxy. Originally designed to provide device-independent access to the World Wide Web, it may also be used for HTTP-filtering, extraction and reauthoring of existing web content or as security device against web based attacks.
    Leader badge
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Transfolio, transform your folders. Full applications in different contexts (Ant, Cocoon, Mozilla, OOo...), using a transformations pack (XSL), to import and export documents in various formats (HTML, XMLs, but also OOo or jpg...).
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.