Showing 37 open source projects for "documents"

View related business solutions
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • Application Monitoring That Won't Slow Your App Down Icon
    Application Monitoring That Won't Slow Your App Down

    AppSignal's Rust-based agent is lightweight and stable. Already running in thousands of production apps.

    Full APM with errors, performance, logs, and uptime monitoring. 99.999% uptime SLA on the platform itself.
    Start Free
  • 1
    PDF4QT

    PDF4QT

    Open source PDF editor

    PDF4QT is open source PDF editor based on Qt framework. It contains a C++ library, applications for viewing/editing PDF documents, and a command line tool. PDF4QT is an open-source PDF editor for Windows/Linux. It is a modern solution for viewing/editing/rendering PDF documents, for users and developers alike. For developers, there is a C++ library and a command line tool for use in scripts. For users, there are four applications offering many features. The project is hosted on Github and uses the license LGPLv3. ...
    Downloads: 54 This Week
    Last Update:
    See Project
  • 2
    QuestPDF

    QuestPDF

    A library that can help you with generating PDF documents

    Quickly design and generate PDF documents with an open-source, modern, and battle-tested C# library. Forget about limitations, feel confident, enjoy your task and efficiently deliver professional products. QuestPDF is a progressive library that can help you with generating PDF documents in your .NET application by offering a friendly, discoverable and predictable C# fluent API.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 3
    BentoPDF

    BentoPDF

    A Privacy First PDF Toolkit

    BentoPDF is a self-hosted, open-source PDF toolkit that provides a suite of local PDF manipulation features for users who want full control over their documents without relying on cloud PDF services. It offers functionality to merge, split, compress, rotate, and convert PDFs through an easy-to-deploy container or local installation, making it ideal for individuals and teams that handle large volumes of PDF files regularly. Because it runs entirely on your own system or server, BentoPDF helps ensure that sensitive documents never leave your environment — a critical consideration for privacy-conscious users or organizations with strict data policies. ...
    Downloads: 123 This Week
    Last Update:
    See Project
  • 4
    iLovePDF Api

    iLovePDF Api

    iLovePDF Rest Api - PHP Library

    ...We offer a simple and concise API Reference and Guide as well as API Libraries with their own docs too. Our infrastructure uses the best PDF technology for processing PDF files. Merge and split documents with a variety of custom options. Remove, extract or organize PDF pages as you need. Reduce the size of your PDF while maintaining its original quality and formatting. Easily convert Images, MS Word, PowerPoint and Excel files into non-editable PDF documents. Convert PDF documents to JPG images or to PDF/A format.
    Downloads: 3 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    tinypdf

    tinypdf

    Minimal PDF creation library

    ...The library supports essential primitives like writing text, drawing basic shapes, and placing JPEG images, which covers common needs such as invoices, receipts, tickets, and simple reports. It also supports clickable links so generated documents can include interactive URLs, and it can create multi-page documents with custom page sizes. A notable convenience is built-in markdown-to-PDF conversion for common structures like headers and lists, letting you go from formatted text to a PDF layout quickly.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    OCRmyPDF

    OCRmyPDF

    OCRmyPDF adds an OCR text layer to scanned PDF files

    OCRmyPDF adds an optical character recognition (OCR) text layer to scanned PDF files, allowing them to be searched. PDF is the best format for storing and exchanging scanned documents. Unfortunately, PDFs can be difficult to modify. OCRmyPDF makes it easy to apply image processing and OCR (recognized, searchable text) to existing PDFs.
    Downloads: 104 This Week
    Last Update:
    See Project
  • 7
    PDFCraft

    PDFCraft

    PDFCraft is a free, privacy-focused PDF toolkit

    ...But beyond manual editing, it also offers a programmable layer so developers can write scripts to batch process documents, generate templated reports, or extract structured data from PDFs for integration in workflows. The design emphasizes quality and compatibility: output PDFs render accurately across readers, preserve metadata, and support interactive elements like hyperlinks and form fields.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 8
    jsPDF

    jsPDF

    HTML5 client solution for generating PDFs

    The leading HTML5 client solution for generating PDFs. Perfect for event tickets, reports, certificates, you name it! PDFs are ubiquitous across the web, with virtually every enterprise relying on them to share documents. We created jsPDF to solve a major problem with how pdf files were being generated. We decided to make it open-source to allow a community of developers to expand on it.
    Downloads: 36 This Week
    Last Update:
    See Project
  • 9
    py-pdf-parser

    py-pdf-parser

    A Python tool to help extracting information from structured PDFs

    py-pdf-parser is a Python tool designed to help extract information from structured PDFs. It provides a simple interface to define parsing rules and extract data from PDF documents. ​
    Downloads: 2 This Week
    Last Update:
    See Project
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 10
    Unredact

    Unredact

    A simple tool for reading in poorly redacted documents

    Unredact is a specialized tool that attempts to reconstruct redacted or obscured text in images, PDFs, or screenshots using a combination of image processing and generative AI inference to suggest plausible completions of blurred, black-boxed, or jumbled content. Unlike traditional optical character recognition (OCR), which only reads visible text, Unredact focuses on inferring missing content where redaction has been applied by analyzing surrounding context, font characteristics, and...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 11
    Vanilla.PDF

    Vanilla.PDF

    Cross-platform SDK for creating and modifying PDF documents

    Vanilla.PDF is a modern, high-performance, open-source C++17 SDK designed for creating, editing, signing, and analyzing PDF documents across multiple platforms. It requires no external runtime dependencies, making it lightweight and ideal for embedding into desktop applications, servers, or automation pipelines. The SDK offers full cross-platform support including Windows, Linux, macOS, and Android, with builds available for major compilers and architectures.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    OpenDataLoader PDF

    OpenDataLoader PDF

    PDF Parser for AI-ready data. Automate PDF accessibility

    ...The tool combines deterministic parsing methods with an optional hybrid AI-powered mode that improves extraction quality for difficult layouts such as multi-column documents, scanned files, and scientific papers. It includes built-in OCR capabilities supporting dozens of languages, making it suitable for digitizing low-quality or image-based PDFs. A key differentiator is its emphasis on accessibility automation, as it can generate tagged PDFs aligned with accessibility standards, significantly reducing manual remediation effort.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    OCRBase

    OCRBase

    MD/.JSON Document OCR and structured data extraction API

    OCRBase is a self-hostable document OCR and structured extraction system built to turn PDFs into machine-usable outputs at scale, aiming to bridge the gap between raw text extraction and production-ready pipelines. Instead of treating OCR as a one-off script, it presents an API-driven workflow where documents are submitted as jobs and processed through a queue-based architecture that can handle high throughput. The core output is designed for downstream automation, producing structured results like JSON according to user-defined schemas while also providing readable formats like Markdown for human review or indexing. It includes real-time job progress updates via WebSockets, which makes it easier to integrate into UIs, dashboards, or ingestion systems where users need feedback on long-running document processing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Snappy PDF

    Snappy PDF

    A ServiceProvider for Snappy

    ...Because it relies on wkhtmltopdf, it supports complex HTML and CSS (including many types of fonts, layouts, and images) much better than many pure-PHP solutions. It’s commonly used for invoices, reports, tickets, labels, and any other server-side generated documents where consistent layout matters. The package is widely used in the Laravel ecosystem and has community documentation, tutorials, and integrations with other tools and CMSs based on Laravel.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15

    litePDF

    a library to create/modify PDF documents using HDC/TCanvas

    litePDF is a Windows library (DLL), which allows creating new and editing of existing PDF documents with simple API. Page content is drawn with standard GDI functions through a device context (HDC or TCanvas, in case of Delphi or C++ Builder).
    Downloads: 12 This Week
    Last Update:
    See Project
  • 16
    NAPS2 - Not Another PDF Scanner

    NAPS2 - Not Another PDF Scanner

    Scan documents to PDF and other file types, as simply as possible.

    Visit NAPS2's home page at www.naps2.com. NAPS2 is a document scanning application with a focus on simplicity and ease of use. Scan your documents from WIA- and TWAIN-compatible scanners, organize the pages as you like, and save them as PDF, TIFF, JPEG, PNG, and other file formats. Available on Windows, Mac, and Linux. NAPS2 is currently available in over 40 different languages. Want to see NAPS2 in your preferred language? Help translate! See the wiki for more details.
    Leader badge
    Downloads: 626 This Week
    Last Update:
    See Project
  • 17
    workerPdf

    workerPdf

    WorkerPDF is GUI for GhostScript created for PDF conversion

    WorkerPDF uses GhostScript https://www.ghostscript.com/. WorkerPDF created for PDF conversion. Program features: - Compress pdf documents; - Combine pdf; - Moving pdf pages; - Rotating pdf pages; - Creating pdf from images; - Convert pdf to images. - Encrypt, decrypt pdf WorkerPDF использует GhostScript https://www.ghostscript.com/. WorkerPDF создан для преобразования PDF. Возможности программы: - Сжатие pdf документов; - Объединение pdf; - Перестановка страниц pdf; - Поворот страниц pdf; - Создание pdf из изображений; - Преобразование pdf в изображения...
    Downloads: 16 This Week
    Last Update:
    See Project
  • 18

    PDFTK Builder Enhanced

    Enhanced version of the PDFTK Builder GUI for PDF Toolkit on Windows

    Free and open source GUI application for manipulating PDF files using the Windows version of PDF Toolkit (PDFtk) - split, merge, stamp, number pages, rotate, metadata, bookmarks, attachments, etc. This project is a fork of PDFTK Builder by Angus Johnson that enhances the user interface, adds functions, and enables use of later versions of PDFtk. OS: Windows. Author: David King. License: GPLv3.
    Leader badge
    Downloads: 173 This Week
    Last Update:
    See Project
  • 19

    efactuur-pdf-nl

    PDF generation for Dutch UBL and SETU invoices

    The EfactuurNL2PDF project provides the following functionality: - PDF generation for UBL or SETU Invoice documents. - Schematron validation stylesheets - Genericode validation stylesheets The following HR-XML-NL and UBL-NL message versions are currently supported in this project : - NLCIUS (si-ubl-2.0.1) - UBL Invoice 1.9 - UBL Invoice 1.8 - UBL Invoice 1.7 - UBL Invoice 1.6.3 - UBL Invoice 1.6.2 - UBL Invoice 1.1 - SETU Invoice 2.0 - SETU Invoice 1.8.1 - SETU Invoice 1.7 - SETU Invoice 1.6.4 - SETU Invoice 1.1
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Super PDF Editor (a Batch PDF Processor)

    Super PDF Editor (a Batch PDF Processor)

    Create, Edit, Delete, Organize , Convert, Export, Secure & Sign PDF.

    Super PDF Editor - Powerful, superfast, lightweight PDF processor. All-in-one PDF solution, PDF editing with 80+ tools and functions. The easy-to-use software is complete with editing tools for modifying PDF files your way. Most comprehensive, powerful, process-based and lightning-fast batch processor software. OCR PDF. PDF Imposition, Reverse Pages, Resize Page, Scale Page, Booklet, N-up Pages, Merge, Split by page, Extract Page, Rotate Page. Replace Page, Insert Page, Delete Page....
    Leader badge
    Downloads: 17 This Week
    Last Update:
    See Project
  • 21

    Create Index from PDF

    PDF Indexing Script: Searches PDF for words, records page numbers

    ...The final output is a text file with each word followed by the page numbers where it appears, separated by commas. This script is ideal for anyone looking to build an automated index for their PDF documents. With detailed comments and a clear structure, it's easy to customize and use for various indexing projects for researchers, authors, and anyone needing a precise and automated indexing solution.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    VDP Mail Merge 1.7

    VDP Mail Merge 1.7

    High Volume Variable Data Printing & Mail Merge in PDF format

    FREE & user friendly Variable Data Printing tool, to impose Variable Text onto PDF documents. (Mail Merge, Numbered Tickets, Bates numbering, Mass mail Addresses, customized Brochures / Post cards, Contact information, Voters Lists ... etc) Capable of processing more than 2,000,000 Pages or 10,000,000 Records.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 23
    Pdf4Tcl is a library for generating PDF documents from Tcl.
    Leader badge
    Downloads: 8 This Week
    Last Update:
    See Project
  • 24
    Merge to PDF

    Merge to PDF

    Simple portable Java app to merge single images and documents to PDF

    Portable Java app to merge single PDF documents and JPG images to one PDF document. Standalone app include own Java. * Application is in the BETA version - please report bugs and issues on Tickets page. *
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25

    PoDoFo

    A PDF parsing, modification and creation library.

    ...It can parse and modify existing PDF files and create new ones from scratch. It also includes several tools to work with PDF files. It features an unique approach which provides access to PDF documents via an object tree. Therefore, PDFs can be created and or manipulated using a simple tree structure. Development of PoDoFo has been moved to GitHub: https://github.com/podofo/podofo Please raise new issues in the GitHub project.
    Leader badge
    Downloads: 40 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB