Showing 29 open source projects for "document"

View related business solutions
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 1
    MinerU

    MinerU

    A high-quality tool for convert PDF to Markdown and JSON

    MinerU is an open-source, high-quality document extraction toolkit focused on converting PDFs (and other document formats) into structured Markdown and JSON. It leverages OCR and layout analysis to preserve semantic structure and metadata, ideal for research and data science workflows.
    Downloads: 15 This Week
    Last Update:
    See Project
  • 2
    PDF.js

    PDF.js

    A PDF Reader in JavaScript

    PDF.js is a web standards-based platform for parsing and rendering Portable Document Formats (PDFs). Open source and built with HTML5, this PDF viewer is supported by a great community and Mozilla Labs. PDF.js can be used on both modern and older browsers, and is built into version 19+ of Firefox.
    Downloads: 82 This Week
    Last Update:
    See Project
  • 3
    QuestPDF

    QuestPDF

    A library that can help you with generating PDF documents

    ...QuestPDF is a progressive library that can help you with generating PDF documents in your .NET application by offering a friendly, discoverable and predictable C# fluent API. Do you believe that creating a complete invoice document can take less than 200 lines of code? We have prepared for you a step-by-step instruction that shows every detail of this implementation and describes the best patterns and practices. This library is created specifically for designing and arranging document layouts, with full paging support. Alternative solutions, such as HTML-based converters, are not designed for this purpose and therefore are often unpredictable and do not produce desired results.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 4
    PDF4QT

    PDF4QT

    Open source PDF editor

    ...The applications are primarily used by target users to view, edit, manipulate or compare PDF documents. Users can preview these applications in the screenshots section of this webpage. Basic browsing and lots of other functionalities, such as encryption, reading a document, verification of digital signatures, editing of annotations, searching for text using regular expressions, drawing pages into an image, and much more. Several plug-ins are available.
    Downloads: 82 This Week
    Last Update:
    See Project
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 5
    PDFPatcher

    PDFPatcher

    A versatile toolkit for PDF manipulation

    PDFPatcher (aka “PDF补丁丁”) is a versatile toolkit for PDF manipulation—editing document metadata, bookmarks, page layout, content restrictions, rotation, compression, merging/splitting, image extraction, and more, all within an intuitive interface. Merge/split PDFs or images, preserve or add bookmarks, and set page dimensions. Batch style/color/target changes, regex/XPath search/replace, mid‑page positioning. Modify PDF metadata, page numbers, links, initial view mode, and remove open actions.
    Downloads: 37 This Week
    Last Update:
    See Project
  • 6
    DOCX Document Converter

    DOCX Document Converter

    Convert .docx to .md/.txt and .html. Free, unlimited, fast.

    ...This is a better version of 'convert docx to txt' since .md files can be opened in notepad++ just the same AND they preserve formatting too! Meanwhile HTML files will even embed images allowing you to view them just like they were in the original document.
    Downloads: 13 This Week
    Last Update:
    See Project
  • 7
    Vanilla.PDF

    Vanilla.PDF

    Cross-platform SDK for creating and modifying PDF documents

    ...Vanilla.PDF supports advanced PDF features such as adding CMS (PKCS#7) digital signatures, modifying content streams and metadata, and working with encryption and permissions based on standard PDF security models. It includes tools for parsing PDF internals like cross-reference tables and objects, providing fine-grained document analysis capabilities. The project is unit-tested with continuous integration pipelines, supporting sanitizers for enhanced code quality and stability.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 8
    borb

    borb

    borb is a library for reading, creating and manipulating PDF files

    borb is a library for creating and manipulating PDF files in python. borb is a pure python library to read, write, and manipulate PDF documents. It represents a PDF document as a JSON-like data structure of nested lists, dictionaries and primitives (numbers, string, booleans, etc) This is currently a one-man project, so the focus will always be to support those use-cases that are more common in favor of those that are rare.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 9
    Asciidoctor PDF

    Asciidoctor PDF

    Asciidoctor PDF: A native PDF converter for AsciiDoc

    A fast text processor & publishing toolchain for converting AsciiDoc to HTML5, DocBook & more. Asciidoctor is a fast, open source, Ruby-based text processor for parsing AsciiDoc® into a document model and converting it to output formats such as HTML 5, DocBook 5, manual pages, PDF, EPUB 3, and other formats. Asciidoctor also has an ecosystem of extensions, converters, build plugins, and tools to help you author and publish content written in AsciiDoc.
    Downloads: 9 This Week
    Last Update:
    See Project
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 10
    converseen

    converseen

    A batch image converter and resizer

    Converseen is a free cross-platform batch image processor for Windows, Linux, macOS, FreeBSD, and other operating systems. It allows you to convert, resize, rotate, and flip an infinite number of images with a single mouse click. Moreover, Converseen can convert an entire PDF document into a set of images with your preferred characteristics. You can choose from over 100+ formats, set the size, resolution, and the filename. Thanks to ImageMagick, the powerful image manipulation library, Converseen can handle more than 100 image formats, including DPX, EXR, GIF, JPEG, JPEG-2000, PNG, SVG, TIFF, WebP, HEIC/HEIF, and many others. ...
    Leader badge
    Downloads: 1,249 This Week
    Last Update:
    See Project
  • 11
    bridgex

    bridgex

    Convert files like docx, xlsx, pptx, html, and more to MarkDown

    ...Supported Formats 📂 Bridgex supports conversion of the following file formats: - PDF (.pdf) - Word (.docx) - PowerPoint (.pptx) - Excel (.xlsx, .xls, .csv) - Outlook Messages (.msg) - Text (.txt, .text) - Markdown (.md, .markdown) - JSON (.json, .jsonl) - XML (.xml) - RSS/Atom (.rss, .atom) - HTML/MHTML (.html, .htm, .mhtml) - ePub (.epub) - Compressed files (.zip) - Jupyter Notebooks (.ipynb) - Other formats supported by Markitdown Bridgex is not an IDE, text editor, Markdown editor, or document viewer
    Downloads: 7 This Week
    Last Update:
    See Project
  • 12
    PDF to Word Converter
    Coolmuster PDF to Word Converter turns any PDF into a fully editable Microsoft Word document — with perfect preservation of: Text styles & fonts Images & graphics Tables & columns Hyperlinks Overall layout & formatting No more re-formatting after conversion.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 13
    pdf combiner merger converter splitter

    pdf combiner merger converter splitter

    PDF Combiner is a user-friendly, GUI-based tool built in

    PDF Combiner is a user-friendly open source free to use, GUI-based tool for combining, pdf to excel, pdf to word, image to pdf, zip, unzip annotate and splitting PDF files. It is easy to use, supports multiple file insert and delete and process, and allows you to adjust the order of files before combining.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 14
    posterdown

    posterdown

    Use RMarkdown to generate PDF Conference Posters via HTML

    ...Many creature comforts from RMarkdown are available in this package such as Markdown section notation, figure captioning, and even citations like this one (Allaire, Xie, McPherson, et al. 2018). The rest of this example poster will show how you can insert typical conference poster features into your own document. Posterdown was created as a proof-of-concept (to myself) that it is possible to make a beautiful poster using open-source reproducible code.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    Free HTML to PDF for .NET and C# Library

    Free HTML to PDF for .NET and C# Library

    Free HTML to PDF Converter for C# .NET and ASP.NET MVC

    HiQPdf Library for C# .NET offers you a modern, simple, fast, flexible and powerful tool to create complex and stylish PDF documents in your applications with just a few lines of code. Using the high quality HTML to PDF conversion engine you can easily design a document in HTML with CSS3, JavaScript, SVG or Canvas and then convert it to PDF preserving the exact content and style.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16

    Pdf Split and Merge with Bookmark Import

    PDF merging with bookmarks and pagination

    PDF Split and Merge tool is implemented using the PdfSharp library and is either GUI or command driven.It is useful for automated document creation.It allows bookmarks from the input pdfs to be imported and flexibly embedded in the destination document. PDF pages can be numbered and annotated with a footer label.
    Downloads: 29 This Week
    Last Update:
    See Project
  • 17
    DinkToPdf

    DinkToPdf

    C# .NET Core wrapper for wkhtmltopdf library that uses Webkit engine

    .NET Core P/Invoke wrapper for wkhtmltopdf library that uses Webkit engine to convert HTML pages to PDF. Copy the native library to root folder of your project. From there .NET Core loads the native library when the native method is called with P/Invoke. You can find the latest version of the native library. Select the appropriate library for your OS and platform (64 or 32-bit). The library was not tested with IIS. The library was tested in console applications and with Kestrel web server...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    PDFill PDF Editor
    ...Please try PDFill: PDF Redaction or PDF Eraser to remove the original sensitive content; Group/Ungroup Text, shape or picture; Save, Manage and Use the PDFill Objects; Save all the editing with Information, Encrypt, Initial View, Presentation and Advanced options; Save editing now and Resume later. PDF Document Management to Insert, Replace, Delete, Renumber, Extract, Move, Rotate and Crop PDF Pages, to Add Bookmark, Header, Footer, Watermark, Attachment, Action, JavaScript, Digital Signature, and Bates Numbering, to optimize PDF and extract into Text and Image. Deskew PDF to rotate, move and scale like image. Thumbnail to rearrange pages by drag and drop; PDF Form Maker with Field Recognition, Multiple Copying, Table Order and Calculation Order; PDF Ink Annotator to write, draw, mark or highlight using the pen input; PDF Form Filler to save fill-in PDF forms with data import, export, batch or auto fill; PDF Filler
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19

    Convert HTML to PDF in .NET with C#

    Convert HTML to PDF in .NET with C# using EVO HTML to PDF for .NET

    EVO HTML to PDF Converter for .NET is a library that can be easily integrated and distributed in your ASP.NET and MVC web sites, desktop applications, Windows services and Azure cloud services to convert web pages, HTML strings and streams to PDF, to images or to SVG and to create nicely formatted and easily maintainable PDF reports and documents. The converter has full support for HTML5, CSS3, SVG, Canvas, Web Fonts and JavaScript. Does not require installation or any third party tools. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    AlphaCentauri

    AlphaCentauri

    AlphaCentauri is a completely free Office tools application.

    AlphaCentauri is a completely free office application, with various utilities resembling those distributed commercially on Windows. Functionalities are still quite limited compared to the commercial products, but more features will be added soon. Please note that update v1.1.0 has full featured programs, however you will see a watermark with "trial version" details. This is due to some features were used through Nevron's very useful and awesome libraries. Unfortunately, these...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    DJVU++

    DJVU++

    The DjVu complete solution,with OCR Technology(Arabic ,English).

    ...o DjVu++ supports two languages in the OCR technique (Arabic and English). o Read multiple documents at the same time with the new tabs feature. o DjVu++ supports multiple formats:  Convert PDF document into DjVu format with smaller file size and the same performance.  Convert DjVu into PDF format.  Combine images to a single DjVu document. Perform OCR operations on multiple image formats.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 22
    seppdflatex

    seppdflatex

    Build a large LaTeX book with separate linked chapters

    seppdflatex is a Perl script which automates a lot of the tasks needed to compile PDF documents from LaTeX source for a multi-volume book, or a book with many huge chapters which you may not want as a single document, but which should all be unified by cross-references and external hyperlinks, so a PDF reader will open a link to an external chapter PDF file. The Table of Contents and LOF and LOT are made for all chapters, and the TOC, LOF, LOT are all hyperlinked to the correct external PDF chapter files using zr-ref and xr packages from CTAN. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    stymaker

    stymaker

    Create your own LaTeX style.

    Stymaker is a GUI application assisting LaTeX users with creating their own style packages. After filling a simple form one can get a new package file corresponding to the chosen settings. This package may be included in preamble of LaTeX document by: \usepackage{mystyle} The new package, based on standard LaTeX packages, allows changing of the document layout, or appearance of standard environments like such as lists. While testing new settings one may instantly view actual changes in an external PDF viewer.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    eLectrix PDF Viewer
    eLectrix is an application to view Portable Document Format (PDF) files. The program is written in the C programming language, using GTK+ for the user interface and poppler for rendering pages.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25
    JPDF Export
    JPDF Export is a java library built on the famous iText library. It provides simple functions that can be used to build complex pdf files. It also provides simple classes to merge, split and convert pdf files
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB