Showing 64 open source projects for "pdf 2 html"

View related business solutions
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI Studio. Switch between models without switching platforms.
    Start Free
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • 1

    DocJGenerator

    Wiki generator and Java Help System

    Allows to generate a wiki (interlinked HTML files) from a bunch of XML formatted files. It also allows to add a Help-system to a Swing or JavaFX application. Also it is also possible to generate a PDF, Word (docx), or epub document rather than a wiki. The tool also provides a visual editor to edit the wiki. The project also support both the Mediawiki and Markdown syntax.
    Downloads: 11 This Week
    Last Update:
    See Project
  • 2
    WebHarvest - web data extraction tool
    Web data extraction (web data mining, web scraping) tool. It leverages well proved XML and text processing techologies in order to easely extract useful data from arbitrary web pages.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 3

    JWP

    Java Web Printer

    Java Web Printer interface 0.1 JWP 0.1 is the Java Web Printer interface that lets you print your documents from anywhere, anytime, to any installed printer. No need to install any printer drivers on the client machine, just use a web browser and upload your PDF file. You can print any PDF file with just a few clicks, using one of the available print methods: PDFbox, CUPS, or Filesystem. JWP 0.1 is compatible with Windows and other OS, and requires only a Jakarta EE capable Web...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4

    jmarkdownviewer

    markdown viewer for java

    This is a Markdown viewer for java. Primarily it tries to display github styled markdown scripts. Download the release jar file and run java -jar jmdviewer-x.y.jar where x.y is the version number. You can pass the markdown filename on the command line java -jar jmdviewer-x.y.jar [filename.md]
    Downloads: 0 This Week
    Last Update:
    See Project
  • Catch Bugs Before Your Customers Do Icon
    Catch Bugs Before Your Customers Do

    Real-time error alerts, performance insights, and anomaly detection across your full stack. Free 30-day trial.

    Move from alert to fix before users notice. AppSignal monitors errors, performance bottlenecks, host health, and uptime—all from one dashboard. Instant notifications on deployments, anomaly triggers for memory spikes or error surges, and seamless log management. Works out of the box with Rails, Django, Express, Phoenix, Next.js, and dozens more. Starts at $23/month with no hidden fees.
    Try AppSignal Free
  • 5

    ConcatPDF

    PDF Concatenation Tool

    ConcatPDF is the tool to concatenate PDF files. It can concatenate, extract, encrypt, decrypt, configure PDF files, convert image files to PDF. GUI version and CUI version are both available. iText.NET is iText porting on .NET Framework by J#. This library allows you to generate PDF, (X)HTML, XML, RTF files on Microsoft.NET Framework including ASP.NET.
    Downloads: 58 This Week
    Last Update:
    See Project
  • 6
    Web Book Downloader

    Web Book Downloader

    Download websites as e-book: pdf, txt, epub.

    This application allows user to download chapters from website in 3 ways: - from table of contents; - from range: first chapter address, last chapter address; - by crawling from first chapter to n; In settings you can customize language, input(website encoding) for simplicity output is in the same encoding. If you want your language add new class into strings package, and new fields into Settings class and GUI menu(initialize method).
    Downloads: 5 This Week
    Last Update:
    See Project
  • 7
    iText®, a JAVA PDF library

    iText®, a JAVA PDF library

    PDF Library for Developers

    iText is an open-source PDF library available for Java and .NET (C#). iText allows you to effortlessly generate and manipulate standards-compliant PDF documents with a powerful and feature-rich SDK. With iText, you can create archivable and accessible PDFs, split and merge documents, fill and flatten forms, digitally sign documents, and more. iText add-ons enable additional functionality, such as PDF creation from HTML templates, secure redaction, OCR, and much more. ...
    Leader badge
    Downloads: 154 This Week
    Last Update:
    See Project
  • 8
    OpenSearchServer Search Engine

    OpenSearchServer Search Engine

    An open source search engine with RESTFul API and crawlers

    OpenSearchServer is a powerful, enterprise-class, search engine program. Using the web user interface, the crawlers (web, file, database, etc.) and the client libraries (REST/API , Ruby, Rails, Node.js, PHP, Perl) you will be able to integrate quickly and easily advanced full-text search capabilities in your application: Full-text with basic semantic, join queries, boolean queries, facet and filter, document (PDF, Office, etc.) indexation, web scrapping,etc. OpenSearchServer runs on...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9

    eXtensible Text Framework (XTF)

    Framework for search and display of heterogenous document collections.

    ...Please visit https://github.com/cdlib/xtf for the latest updates. Obsolete Description: The eXtensible Text Framework (XTF) is an architecture that supports searching across collections of heterogeneous textual data (XML, PDF, HTML, text, and more), and the presentation of results and documents in a highly configurable manner. Includes highly customized versions of the proven open-source components Lucene and Saxon.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Fully Managed MySQL, PostgreSQL, and SQL Server Icon
    Fully Managed MySQL, PostgreSQL, and SQL Server

    Automatic backups, patching, replication, and failover. Focus on your app, not your database.

    Cloud SQL handles your database ops end to end. Migrate from on-prem or other clouds with free migration tools.
    Try Free
  • 10
    Aspose Java for Liferay

    Aspose Java for Liferay

    Provides export options for blogs, journals and dynamic lists

    This is Liferay CMS / Portal plugin released by Aspose pty ltd. Aspose.Total Java for Liferay (hook plugin app) provides options for exporting web-contents and blogs created in html to MS-WORD, MS-EXCEL and PDF file formats using Aspose.Total Java APIs. (Aspose.Words, Aspose.Cells and Aspose.PDF) The Plugin also provides very useful functionality / options for exporting the Dynamic Data Lists to MS-WORD, MS-EXCEL and PDF file formats using Aspose.Total Java APIs. (Aspose.Words, Aspose.Cells and Aspose.PDF)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    waterlooFX

    Scientific Charting with JavaFX

    waterlooFX provides a library for scientific charting using JavaFX
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Regain is a Java search engine based on Jakarta Lucene. It provides indexing and searching files for plenty of formats (HTML,XML,doc(x),xls(x),ppt(x),oo,PDF,RTF,mp3,mp4,Java). A TagLibrary eases integrating search results in your JSP based web page.
    Downloads: 20 This Week
    Last Update:
    See Project
  • 13
    IDRA (InDexing and Retrieving Automatically) is a tool which allows indexing a wide range of text (TXT, DOC, PDF) and image annotations files (XML), query-based searching, visualizing an index, saving it for re-usability, evaluation, etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14

    WebDiff

    Graphical tool for visualizing changes in web pages

    WebDiff is a graphical tool for visualizing changes in web pages. It is written in Java and uses Eclipse's SWT toolkit. You can view changes between any two HTML files on your file system or a web server, distinguishing them in a manner of your choice. There is a plan to eventually support viewing changes between Git/Subversion/Mercurial clients.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Java tools for building a web based encyclopedia of articles for a roleplaying campaign from XML source files, including schemas and XSLT stylesheets for transforming the XML into HTML and PDF formats.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    ssSearchEngine

    keyword search engine for semi-structured data (Tables, lists,...)

    This application implement an approach for doing keyword based search over semi-structured data available in HTML pages.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    RAD platform for of Ajax applications in java / j2ee. Main features : creation of Ext J.S (extjs) master-detail forms and grids( filter-paging-grouping) without javascript, integration with Spring for the services layer, a Generic Hibernate DAO.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Calenco XML CMS
    Calenco is a Web collaborative platform that enable remote teams of writers, proofreader, graphic designers, translators, etc. to produce together XML documents like user guides, security procedures, etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    This little project may help you build simple multilingual web sites for sourceforge open source projects. You won't need to worry about html and JavaScript, but also capabilities will be limited.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Java browser and wysiwyg | source editor of html SFI(Structure-Fragment-Identifier)-files: 1) creates dynamically Table-Of-Contents 2) One-to-One mapping ToC - browser|editors 3) indexes the words 4) developers can use it for their help-system.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Reporting engine library written in C. Create one XML file and generate PDF, HTML, TXT, and CSV reports based on queries. Has support for MySQL, PostgreSQL, ODBC. Bindings for PHP, Java, Python.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    A port of the iTEXT pdf library for Java to PHP (a PHP extension).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    untmpl is a tool for detecting static sections such as headers, footers and menus of HTML documents. Detecting and removing these sections allows access to the actual content of the document.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Experimental browser that natively supports HTML, PDF, man pages, TeX DVI, scanned paper. Annotate with hyperlinks, highlights, notes, executable copy editor markup. [PDF tools for compress, impose, decrypt/encrypt, split/merge have been moved.]
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25
    Identityflow provides a framework for building protocol flows that perform distributed Identity operations. Examples of Identity Operations are Single Sign-On (SSO) and Attribute queries.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next
MongoDB Logo MongoDB