Showing 27 open source projects for "documents"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Application Monitoring That Won't Slow Your App Down Icon
    Application Monitoring That Won't Slow Your App Down

    AppSignal's Rust-based agent is lightweight and stable. Already running in thousands of production apps.

    Full APM with errors, performance, logs, and uptime monitoring. 99.999% uptime SLA on the platform itself.
    Start Free
  • 1
    PDF4QT

    PDF4QT

    Open source PDF editor

    PDF4QT is open source PDF editor based on Qt framework. It contains a C++ library, applications for viewing/editing PDF documents, and a command line tool. PDF4QT is an open-source PDF editor for Windows/Linux. It is a modern solution for viewing/editing/rendering PDF documents, for users and developers alike. For developers, there is a C++ library and a command line tool for use in scripts. For users, there are four applications offering many features. The project is hosted on Github and uses the license LGPLv3. ...
    Downloads: 54 This Week
    Last Update:
    See Project
  • 2
    QuestPDF

    QuestPDF

    A library that can help you with generating PDF documents

    Quickly design and generate PDF documents with an open-source, modern, and battle-tested C# library. Forget about limitations, feel confident, enjoy your task and efficiently deliver professional products. QuestPDF is a progressive library that can help you with generating PDF documents in your .NET application by offering a friendly, discoverable and predictable C# fluent API.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 3
    BentoPDF

    BentoPDF

    A Privacy First PDF Toolkit

    BentoPDF is a self-hosted, open-source PDF toolkit that provides a suite of local PDF manipulation features for users who want full control over their documents without relying on cloud PDF services. It offers functionality to merge, split, compress, rotate, and convert PDFs through an easy-to-deploy container or local installation, making it ideal for individuals and teams that handle large volumes of PDF files regularly. Because it runs entirely on your own system or server, BentoPDF helps ensure that sensitive documents never leave your environment — a critical consideration for privacy-conscious users or organizations with strict data policies. ...
    Downloads: 123 This Week
    Last Update:
    See Project
  • 4
    iLovePDF Api

    iLovePDF Api

    iLovePDF Rest Api - PHP Library

    ...We offer a simple and concise API Reference and Guide as well as API Libraries with their own docs too. Our infrastructure uses the best PDF technology for processing PDF files. Merge and split documents with a variety of custom options. Remove, extract or organize PDF pages as you need. Reduce the size of your PDF while maintaining its original quality and formatting. Easily convert Images, MS Word, PowerPoint and Excel files into non-editable PDF documents. Convert PDF documents to JPG images or to PDF/A format.
    Downloads: 3 This Week
    Last Update:
    See Project
  • Build Securely on AWS with Proven Frameworks Icon
    Build Securely on AWS with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 5
    tinypdf

    tinypdf

    Minimal PDF creation library

    ...The library supports essential primitives like writing text, drawing basic shapes, and placing JPEG images, which covers common needs such as invoices, receipts, tickets, and simple reports. It also supports clickable links so generated documents can include interactive URLs, and it can create multi-page documents with custom page sizes. A notable convenience is built-in markdown-to-PDF conversion for common structures like headers and lists, letting you go from formatted text to a PDF layout quickly.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    OCRmyPDF

    OCRmyPDF

    OCRmyPDF adds an OCR text layer to scanned PDF files

    OCRmyPDF adds an optical character recognition (OCR) text layer to scanned PDF files, allowing them to be searched. PDF is the best format for storing and exchanging scanned documents. Unfortunately, PDFs can be difficult to modify. OCRmyPDF makes it easy to apply image processing and OCR (recognized, searchable text) to existing PDFs.
    Downloads: 104 This Week
    Last Update:
    See Project
  • 7
    PDFCraft

    PDFCraft

    PDFCraft is a free, privacy-focused PDF toolkit

    ...But beyond manual editing, it also offers a programmable layer so developers can write scripts to batch process documents, generate templated reports, or extract structured data from PDFs for integration in workflows. The design emphasizes quality and compatibility: output PDFs render accurately across readers, preserve metadata, and support interactive elements like hyperlinks and form fields.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 8
    jsPDF

    jsPDF

    HTML5 client solution for generating PDFs

    The leading HTML5 client solution for generating PDFs. Perfect for event tickets, reports, certificates, you name it! PDFs are ubiquitous across the web, with virtually every enterprise relying on them to share documents. We created jsPDF to solve a major problem with how pdf files were being generated. We decided to make it open-source to allow a community of developers to expand on it.
    Downloads: 36 This Week
    Last Update:
    See Project
  • 9
    py-pdf-parser

    py-pdf-parser

    A Python tool to help extracting information from structured PDFs

    py-pdf-parser is a Python tool designed to help extract information from structured PDFs. It provides a simple interface to define parsing rules and extract data from PDF documents. ​
    Downloads: 2 This Week
    Last Update:
    See Project
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 10
    Unredact

    Unredact

    A simple tool for reading in poorly redacted documents

    Unredact is a specialized tool that attempts to reconstruct redacted or obscured text in images, PDFs, or screenshots using a combination of image processing and generative AI inference to suggest plausible completions of blurred, black-boxed, or jumbled content. Unlike traditional optical character recognition (OCR), which only reads visible text, Unredact focuses on inferring missing content where redaction has been applied by analyzing surrounding context, font characteristics, and...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 11
    Vanilla.PDF

    Vanilla.PDF

    Cross-platform SDK for creating and modifying PDF documents

    Vanilla.PDF is a modern, high-performance, open-source C++17 SDK designed for creating, editing, signing, and analyzing PDF documents across multiple platforms. It requires no external runtime dependencies, making it lightweight and ideal for embedding into desktop applications, servers, or automation pipelines. The SDK offers full cross-platform support including Windows, Linux, macOS, and Android, with builds available for major compilers and architectures.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    OCRBase

    OCRBase

    MD/.JSON Document OCR and structured data extraction API

    OCRBase is a self-hostable document OCR and structured extraction system built to turn PDFs into machine-usable outputs at scale, aiming to bridge the gap between raw text extraction and production-ready pipelines. Instead of treating OCR as a one-off script, it presents an API-driven workflow where documents are submitted as jobs and processed through a queue-based architecture that can handle high throughput. The core output is designed for downstream automation, producing structured results like JSON according to user-defined schemas while also providing readable formats like Markdown for human review or indexing. It includes real-time job progress updates via WebSockets, which makes it easier to integrate into UIs, dashboards, or ingestion systems where users need feedback on long-running document processing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Snappy PDF

    Snappy PDF

    A ServiceProvider for Snappy

    ...Because it relies on wkhtmltopdf, it supports complex HTML and CSS (including many types of fonts, layouts, and images) much better than many pure-PHP solutions. It’s commonly used for invoices, reports, tickets, labels, and any other server-side generated documents where consistent layout matters. The package is widely used in the Laravel ecosystem and has community documentation, tutorials, and integrations with other tools and CMSs based on Laravel.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14

    litePDF

    a library to create/modify PDF documents using HDC/TCanvas

    litePDF is a Windows library (DLL), which allows creating new and editing of existing PDF documents with simple API. Page content is drawn with standard GDI functions through a device context (HDC or TCanvas, in case of Delphi or C++ Builder).
    Downloads: 12 This Week
    Last Update:
    See Project
  • 15
    NAPS2 - Not Another PDF Scanner

    NAPS2 - Not Another PDF Scanner

    Scan documents to PDF and other file types, as simply as possible.

    Visit NAPS2's home page at www.naps2.com. NAPS2 is a document scanning application with a focus on simplicity and ease of use. Scan your documents from WIA- and TWAIN-compatible scanners, organize the pages as you like, and save them as PDF, TIFF, JPEG, PNG, and other file formats. Available on Windows, Mac, and Linux. NAPS2 is currently available in over 40 different languages. Want to see NAPS2 in your preferred language? Help translate! See the wiki for more details.
    Leader badge
    Downloads: 626 This Week
    Last Update:
    See Project
  • 16
    workerPdf

    workerPdf

    WorkerPDF is GUI for GhostScript created for PDF conversion

    WorkerPDF uses GhostScript https://www.ghostscript.com/. WorkerPDF created for PDF conversion. Program features: - Compress pdf documents; - Combine pdf; - Moving pdf pages; - Rotating pdf pages; - Creating pdf from images; - Convert pdf to images. - Encrypt, decrypt pdf WorkerPDF использует GhostScript https://www.ghostscript.com/. WorkerPDF создан для преобразования PDF. Возможности программы: - Сжатие pdf документов; - Объединение pdf; - Перестановка страниц pdf; - Поворот страниц pdf; - Создание pdf из изображений; - Преобразование pdf в изображения...
    Downloads: 16 This Week
    Last Update:
    See Project
  • 17

    efactuur-pdf-nl

    PDF generation for Dutch UBL and SETU invoices

    The EfactuurNL2PDF project provides the following functionality: - PDF generation for UBL or SETU Invoice documents. - Schematron validation stylesheets - Genericode validation stylesheets The following HR-XML-NL and UBL-NL message versions are currently supported in this project : - NLCIUS (si-ubl-2.0.1) - UBL Invoice 1.9 - UBL Invoice 1.8 - UBL Invoice 1.7 - UBL Invoice 1.6.3 - UBL Invoice 1.6.2 - UBL Invoice 1.1 - SETU Invoice 2.0 - SETU Invoice 1.8.1 - SETU Invoice 1.7 - SETU Invoice 1.6.4 - SETU Invoice 1.1
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Pdf4Tcl is a library for generating PDF documents from Tcl.
    Leader badge
    Downloads: 8 This Week
    Last Update:
    See Project
  • 19

    PoDoFo

    A PDF parsing, modification and creation library.

    ...It can parse and modify existing PDF files and create new ones from scratch. It also includes several tools to work with PDF files. It features an unique approach which provides access to PDF documents via an object tree. Therefore, PDFs can be created and or manipulated using a simple tree structure. Development of PoDoFo has been moved to GitHub: https://github.com/podofo/podofo Please raise new issues in the GitHub project.
    Leader badge
    Downloads: 40 This Week
    Last Update:
    See Project
  • 20
    pdf-editor

    pdf-editor

    Edit your PDFs without needing a subscription or creating accounts

    ...Add a parser for the command line to do multiple commands at once e.g. merge (cut pdf1) pdf2. Tested working with Python 3.8.5. Install venv (py -3.8 -m pip install virtualenv). PDF and Word documents are binary files, which makes them much more complex than plaintext files. In addition to text, they store lots of font, color, and layout information. If you want your programs to read or write to PDFs or Word documents, you’ll need to do more than simply pass their filenames to open().
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    Free editor for PDF documents. Complete editing of PDF documents is possible with PDFedit. You can change raw pdf objects (for advanced users) or use many gui functions. Functionality can be easily extended using a scripting language (ECMAScript).
    Leader badge
    Downloads: 122 This Week
    Last Update:
    See Project
  • 22
    PyPDFConvert

    PyPDFConvert

    PyPDFConverter is program to convert Word files and images to pdf

    Downloads: 0 This Week
    Last Update:
    See Project
  • 23

    jpeg2pdf

    Create PDF from JPEG scans and photos

    Cross-platform command-line tool for creation of PDF documents from scans/photos of pages in JPEG (.jpg) format and the lightest weight ANSI C library to put multiple JPEG files into one PDF file. You can add handwritten comments to PDF scans (over original images) with xournal: http://xournal.sourceforge.net/ It supports graphics tablets and saves comments to PDFs as vector data.
    Leader badge
    Downloads: 21 This Week
    Last Update:
    See Project
  • 24
    TCPDF - PHP class for PDF

    TCPDF - PHP class for PDF

    PHP class for PDF

    TCPDF is a PHP class for generating PDF documents without requiring external extensions. TCPDF Supports UTF-8, Unicode, RTL languages, XHTML, Javascript, digital signatures, barcodes and much more. IMPORTANT: This version will be soon marked as deprecated and replaced by a new version currently under development: https://github.com/tecnickcom/tc-lib-pdf
    Leader badge
    Downloads: 132 This Week
    Last Update:
    See Project
  • 25
    iText®, a JAVA PDF library

    iText®, a JAVA PDF library

    PDF Library for Developers

    iText is an open-source PDF library available for Java and .NET (C#). iText allows you to effortlessly generate and manipulate standards-compliant PDF documents with a powerful and feature-rich SDK. With iText, you can create archivable and accessible PDFs, split and merge documents, fill and flatten forms, digitally sign documents, and more. iText add-ons enable additional functionality, such as PDF creation from HTML templates, secure redaction, OCR, and much more. The latest versions of iText build on the success of previous versions and feature an improved document engine, high and low-level programming capabilities, and a more efficient modular structure. iText represents the next level for developers looking to leverage PDF in document workflows. ...
    Leader badge
    Downloads: 146 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB