Showing 370 open source projects for "java html parser"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • 1
    Ace Operator is a tool for enabling live communications between company representatives and online users from web sites. By clicking on the "live help" button, customers can get access to company representatives and exchange information in real-time.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    crawler4j

    crawler4j

    Open source web crawler for Java

    crawler4j is an open source web crawler for Java which provides a simple interface for crawling the Web. Using it, you can setup a multi-threaded web crawler in few minutes. You need to create a crawler class that extends WebCrawler. This class decides which URLs should be crawled and handles the downloaded page. shouldVisit function decides whether the given URL should be crawled or not.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Web Widget Toolkit (WTK): Server-side components for easily creating web-based user interfaces with complex navigation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    JuniCoder is a Java project that uses unicode as a base for decoding and encoding formats that invented workarounds to express characters not covered by ASCII. Decoders translate those inventions to unicode. Encoders encode to these inventions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 5
    cpDetector is a proxy for codepage detection of documents. It delegates to multiple instances that try to detect the codepage by different techinques. A command line executeable is shipped that allows to sort documents by codepage.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Lanproxy

    Lanproxy

    Intranet penetration tool that proxies local area network computers

    Lanproxy is an intranet penetration tool that proxies local area network personal computers and servers to the public network. It supports tcp traffic forwarding and any tcp upper layer protocol (access to intranet websites, local payment interface debugging, ssh access, remote desktop, http proxy) , https proxy, socks5 proxy...). Penetration basic functions, same as the open source version, high performance, can support tens of thousands of penetration connections at the same time. Support...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7
    The MangaStream Downloader is an open source application written in Java for managing and downloading manga from the site mangastream.com and mangafox.me. It is written under the GNU-GPL license and uses an open source HTML parser - TagSoup. Follow the project page on Facebook for updates: https://www.facebook.com/MangastreamDownloader
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Gecco

    Gecco

    Lightweight Java web crawler framework with jQuery-style extraction

    Gecco is a lightweight web crawler framework written in Java that simplifies the process of building web scraping applications. It is designed to make crawler development straightforward by allowing developers to extract page elements using jQuery-style selectors rather than complex parsing logic. It integrates several well-known Java libraries and frameworks, including tools for HTTP requests, HTML parsing, JSON processing, and application development.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9

    webmongo

    Accessing server-side mongodb through client javascript API.

    Accessing server-side mongodb through client javascript API. This project is a branch of dbcloud You can do almost invoke on mongodb through the javascript API in browser. The client javascript api support IE6.0+ Chrome FireFox and Wechat
    Downloads: 1 This Week
    Last Update:
    See Project
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 10

    Java-WebTTS

    API-Makes static web pages readable with no coding

    This JAVA API helps create java web applications where static html pages can be read out to viewer. It helps people who are visually challenged partially. It is helpful to common people and children , too, and might work out great in educational site. It's fully customized and does not need developer to write a single line of code . All you need to do is to assign a specific id to the DOM element , whose innerHTML you want to be read out.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Save For Offline

    Save For Offline

    Android app for saving webpages for offline reading

    Android app for saving webpages for offline reading. Save For Offline is an Android app for saving full web pages for offline reading, with lots of features and options. In you web browser selects 'Share', and then 'Save For Offline'. Saves real HTML files which can be opened in other apps/devices. Download & save entire web pages with all assets for offline reading & viewing. Save HTML files in a custom directory. Save in the background, no need to wait for it to finish saving. Night mode,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Simple-Scrape is a simple web-scraping library that allows for programmatic access to HTML code. No further techniques are needed and the library is very compact and thus easy to use.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13

    eXtensible Text Framework (XTF)

    Framework for search and display of heterogenous document collections.

    NOTICE: This code repository is deprecated. Please visit https://github.com/cdlib/xtf for the latest updates. Obsolete Description: The eXtensible Text Framework (XTF) is an architecture that supports searching across collections of heterogeneous textual data (XML, PDF, HTML, text, and more), and the presentation of results and documents in a highly configurable manner. Includes highly customized versions of the proven open-source components Lucene and Saxon.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14

    filtrando google sheets

    Aplicación que permite filtrar hojas de cálculo de google

    Aplicación que permite conectarse a una hoja de cálculo de google y filtrar sus datos, mediante una fórmula de filtrado. Es un poco complejo utilizarla, pero dispone de documentación detallada sobre cómo funciona paso a paso. You will find more about it at this web site: https://www.frojasg1.com:8443/downloads_web/web/html/googleSheets.html?origin=sourceforge
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    ItsNat is a Component based AJAX Java Web Application Framework.No XML programming,no mixed view/code,no custom JavaScript.Only pure HTML,pure Java and server centric Swing-like programming with W3C standards where "The Browser is The Server"
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Jericho HTML Parser is a java library allowing analysis and manipulation of parts of an HTML document, including server-side tags, while reproducing verbatim any unrecognised or invalid HTML.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 17
    Chunk, an HTML Template Engine for Java

    Chunk, an HTML Template Engine for Java

    Clean, powerful templates for Java

    A powerful Java Template Engine, great for building HTML or XML docs. Chunk can handle many other needs and situations as well. In-tag filters & default values, multiple snippets per file, layered themes, macros, conditional includes, localization & more.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    CleanCode

    CleanCode

    .NET, PowerShell, SQL, Java, Perl, and Javascript developer libraries

    Develop clean code with our .NET components (plus PowerShell, SQL, Java, Perl, and JavaScript components as well!). CleanCode highlights include user controls, a validation engine, a diagnostic system, an XML/HTML pre-processor, and a variety of articles on code design.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19

    Extra Life Donation Parser

    Parses donations from EL and displays them

    Intended to be used during a live stream, this parser will poll for new donations based on a comma-delimited list of ids and display them in a frameless background with a total. Colors can be changed without closing the display to support streamers using a chroma key. Licensed under GPL version 3
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Jigy Generator Linux

    Jigy Generator Linux

    The best way to generate a Spring Java Web Application. It just works!

    The Jigy Generator is the simplest way to build and code a Java web application with a Spring Framework by using a project creation wizard and now it's been made even simpler by downloading a very small HTML file (337 bytes) and filling out a few form fields and let it construct your project with ease! The resulting project will be a Java web application with a Spring Framework configured to run in Eclipse or Netbeans.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Jigy Generator OSX

    Jigy Generator OSX

    The best way to generate a Spring Java Web Application. It just works!

    The Jigy Generator is the simplest way to build and code a Java web application with a Spring Framework by using a project creation wizard and now it's been made even simpler by downloading a very small HTML file (337 bytes) and filling out a few form fields and let it construct your project with ease! The resulting project will be a Java web application with a Spring Framework configured to run in Eclipse or Netbeans.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    NekoHTML is a simple HTML scanner and tag balancer that enables application programmers to parse HTML documents and access the information using standard XML interfaces.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23

    wikihtml

    Converts wikitext documents into HTML documents

    This project is an application that converts wikitext documents into HTML documents. Wiki markup or wikitext is a markup language to write documents in wiki-based systems, such as web sites powered by MediaWiki.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    xowa

    xowa

    A free, open-source, offline Wikipedia application

    XOWA is a desktop application for reading and editing Wikipedia offline (XOWA has moved to http://gnosygnu.github.io/xowa/download.html)
    Downloads: 3 This Week
    Last Update:
    See Project
  • 25
    Aspose Java for Liferay

    Aspose Java for Liferay

    Provides export options for blogs, journals and dynamic lists

    This is Liferay CMS / Portal plugin released by Aspose pty ltd. Aspose.Total Java for Liferay (hook plugin app) provides options for exporting web-contents and blogs created in html to MS-WORD, MS-EXCEL and PDF file formats using Aspose.Total Java APIs. (Aspose.Words, Aspose.Cells and Aspose.PDF) The Plugin also provides very useful functionality / options for exporting the Dynamic Data Lists to MS-WORD, MS-EXCEL and PDF file formats using Aspose.Total Java APIs. ...
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB