Showing 369 open source projects for "java html parser"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Cloud-based help desk software with ServoDesk Icon
    Cloud-based help desk software with ServoDesk

    Full access to Enterprise features. No credit card required.

    What if You Could Automate 90% of Your Repetitive Tasks in Under 30 Days? At ServoDesk, we help businesses like yours automate operations with AI, allowing you to cut service times in half and increase productivity by 25% - without hiring more staff.
    Try ServoDesk for free
  • 1
    Ace Operator is a tool for enabling live communications between company representatives and online users from web sites. By clicking on the "live help" button, customers can get access to company representatives and exchange information in real-time.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    crawler4j

    crawler4j

    Open source web crawler for Java

    crawler4j is an open source web crawler for Java which provides a simple interface for crawling the Web. Using it, you can setup a multi-threaded web crawler in few minutes. You need to create a crawler class that extends WebCrawler. This class decides which URLs should be crawled and handles the downloaded page. shouldVisit function decides whether the given URL should be crawled or not.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    Web Widget Toolkit (WTK): Server-side components for easily creating web-based user interfaces with complex navigation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    cpDetector is a proxy for codepage detection of documents. It delegates to multiple instances that try to detect the codepage by different techinques. A command line executeable is shipped that allows to sort documents by codepage.
    Downloads: 7 This Week
    Last Update:
    See Project
  • Skillfully - The future of skills based hiring Icon
    Skillfully - The future of skills based hiring

    Realistic Workplace Simulations that Show Applicant Skills in Action

    Skillfully transforms hiring through AI-powered skill simulations that show you how candidates actually perform before you hire them. Our platform helps companies cut through AI-generated resumes and rehearsed interviews by validating real capabilities in action. Through dynamic job specific simulations and skill-based assessments, companies like Bloomberg and McKinsey have cut screening time by 50% while dramatically improving hire quality.
    Learn More
  • 5
    JuniCoder is a Java project that uses unicode as a base for decoding and encoding formats that invented workarounds to express characters not covered by ASCII. Decoders translate those inventions to unicode. Encoders encode to these inventions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Lanproxy

    Lanproxy

    Intranet penetration tool that proxies local area network computers

    Lanproxy is an intranet penetration tool that proxies local area network personal computers and servers to the public network. It supports tcp traffic forwarding and any tcp upper layer protocol (access to intranet websites, local payment interface debugging, ssh access, remote desktop, http proxy) , https proxy, socks5 proxy...). Penetration basic functions, same as the open source version, high performance, can support tens of thousands of penetration connections at the same time. Support...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7
    The MangaStream Downloader is an open source application written in Java for managing and downloading manga from the site mangastream.com and mangafox.me. It is written under the GNU-GPL license and uses an open source HTML parser - TagSoup. Follow the project page on Facebook for updates: https://www.facebook.com/MangastreamDownloader
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    webmongo

    Accessing server-side mongodb through client javascript API.

    Accessing server-side mongodb through client javascript API. This project is a branch of dbcloud You can do almost invoke on mongodb through the javascript API in browser. The client javascript api support IE6.0+ Chrome FireFox and Wechat
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9

    Java-WebTTS

    API-Makes static web pages readable with no coding

    This JAVA API helps create java web applications where static html pages can be read out to viewer. It helps people who are visually challenged partially. It is helpful to common people and children , too, and might work out great in educational site. It's fully customized and does not need developer to write a single line of code . All you need to do is to assign a specific id to the DOM element , whose innerHTML you want to be read out.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Dun and Bradstreet Connect simplifies the complex burden of data management Icon
    Dun and Bradstreet Connect simplifies the complex burden of data management

    Our self-service data management platform enables your organization to gain a complete and accurate view of your accounts and contacts.

    The amount, speed, and types of data created in today’s world can be overwhelming. With D&B Connect, you can instantly benchmark, enrich, and monitor your data against the Dun & Bradstreet Data Cloud to help ensure your systems of record have trusted data to fuel growth.
    Learn More
  • 10
    Save For Offline

    Save For Offline

    Android app for saving webpages for offline reading

    Android app for saving webpages for offline reading. Save For Offline is an Android app for saving full web pages for offline reading, with lots of features and options. In you web browser selects 'Share', and then 'Save For Offline'. Saves real HTML files which can be opened in other apps/devices. Download & save entire web pages with all assets for offline reading & viewing. Save HTML files in a custom directory. Save in the background, no need to wait for it to finish saving. Night mode,...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 11
    Simple-Scrape is a simple web-scraping library that allows for programmatic access to HTML code. No further techniques are needed and the library is very compact and thus easy to use.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12

    eXtensible Text Framework (XTF)

    Framework for search and display of heterogenous document collections.

    NOTICE: This code repository is deprecated. Please visit https://github.com/cdlib/xtf for the latest updates. Obsolete Description: The eXtensible Text Framework (XTF) is an architecture that supports searching across collections of heterogeneous textual data (XML, PDF, HTML, text, and more), and the presentation of results and documents in a highly configurable manner. Includes highly customized versions of the proven open-source components Lucene and Saxon.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    ItsNat is a Component based AJAX Java Web Application Framework.No XML programming,no mixed view/code,no custom JavaScript.Only pure HTML,pure Java and server centric Swing-like programming with W3C standards where "The Browser is The Server"
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Jericho HTML Parser is a java library allowing analysis and manipulation of parts of an HTML document, including server-side tags, while reproducing verbatim any unrecognised or invalid HTML.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 15
    Chunk, an HTML Template Engine for Java

    Chunk, an HTML Template Engine for Java

    Clean, powerful templates for Java

    A powerful Java Template Engine, great for building HTML or XML docs. Chunk can handle many other needs and situations as well. In-tag filters & default values, multiple snippets per file, layered themes, macros, conditional includes, localization & more.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    CleanCode

    CleanCode

    .NET, PowerShell, SQL, Java, Perl, and Javascript developer libraries

    Develop clean code with our .NET components (plus PowerShell, SQL, Java, Perl, and JavaScript components as well!). CleanCode highlights include user controls, a validation engine, a diagnostic system, an XML/HTML pre-processor, and a variety of articles on code design.
    Leader badge
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17

    Extra Life Donation Parser

    Parses donations from EL and displays them

    Intended to be used during a live stream, this parser will poll for new donations based on a comma-delimited list of ids and display them in a frameless background with a total. Colors can be changed without closing the display to support streamers using a chroma key. Licensed under GPL version 3
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    NekoHTML is a simple HTML scanner and tag balancer that enables application programmers to parse HTML documents and access the information using standard XML interfaces.
    Downloads: 25 This Week
    Last Update:
    See Project
  • 19
    Jigy Generator Linux

    Jigy Generator Linux

    The best way to generate a Spring Java Web Application. It just works!

    The Jigy Generator is the simplest way to build and code a Java web application with a Spring Framework by using a project creation wizard and now it's been made even simpler by downloading a very small HTML file (337 bytes) and filling out a few form fields and let it construct your project with ease! The resulting project will be a Java web application with a Spring Framework configured to run in Eclipse or Netbeans.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Jigy Generator OSX

    Jigy Generator OSX

    The best way to generate a Spring Java Web Application. It just works!

    The Jigy Generator is the simplest way to build and code a Java web application with a Spring Framework by using a project creation wizard and now it's been made even simpler by downloading a very small HTML file (337 bytes) and filling out a few form fields and let it construct your project with ease! The resulting project will be a Java web application with a Spring Framework configured to run in Eclipse or Netbeans.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21

    wikihtml

    Converts wikitext documents into HTML documents

    This project is an application that converts wikitext documents into HTML documents. Wiki markup or wikitext is a markup language to write documents in wiki-based systems, such as web sites powered by MediaWiki.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    xowa

    xowa

    A free, open-source, offline Wikipedia application

    XOWA is a desktop application for reading and editing Wikipedia offline (XOWA has moved to http://gnosygnu.github.io/xowa/download.html)
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    Aspose Java for Liferay

    Aspose Java for Liferay

    Provides export options for blogs, journals and dynamic lists

    This is Liferay CMS / Portal plugin released by Aspose pty ltd. Aspose.Total Java for Liferay (hook plugin app) provides options for exporting web-contents and blogs created in html to MS-WORD, MS-EXCEL and PDF file formats using Aspose.Total Java APIs. (Aspose.Words, Aspose.Cells and Aspose.PDF) The Plugin also provides very useful functionality / options for exporting the Dynamic Data Lists to MS-WORD, MS-EXCEL and PDF file formats using Aspose.Total Java APIs. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    wiki2xhtml converts wiki syntax into (X)HTML code and styles the page with CSS. It makes it easy to create good-looking pages without many know-how, and advanced users can use own code. The program can be run either in the console or with a GUI.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Site monitoring

    Site monitoring

    Monitoring of websites with spider and email notifications

    Free website monitoring software, easy to set up and use for monitoring web sites. It is a web application programmed in Java programming language. You can monitor HTML pages, JSON and XML, pages in sitemap and even your whole web site using spider. Naturally you can check multiple websites. You can check HTTP result codes and even contents of the checked pages. Website checking is done periodically using build-in cron mechanism. In case of a check failure, application will automatically send email to site administrator.
    Downloads: 0 This Week
    Last Update:
    See Project