Showing 420 open source projects for "pdf ocr windows"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • 1
    fpdf2

    fpdf2

    Simple PDF generation for Python

    fpdf2 is a library for simple & fast PDF document generation in Python. It is a fork and the successor of PyFPDF. Compared with other PDF libraries, fpdf2 is fast, versatile, easy to learn and to extend (example). It is also entirely written in Python and has very few dependencies: Pillow, defusedxml, & fontTools. It is a fork and the successor of PyFPDF.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    pdfme

    pdfme

    A TypeScript based PDF generator library, made with React

    TypeScript base PDF generator and React-based UI. Open source, developed by the community, and completely free to use under the MIT license. No complex operations are required. Just bring your favorite template and generate all the PDFs you need. Works on node and the browser. Anyone can easily create and modify templates using Designer (UI template editor). Templates have a JSON document representation, which makes theme easy to understand and easy to work with.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    xhtml2pdf

    xhtml2pdf

    A library for converting HTML into PDFs using ReportLab

    xhtml2pdf enables users to generate PDF documents from HTML content easily and with automated flow control such as pagination and keeping text together. The Python module can be used in any Python environment, including Django. The Command line tool is a stand-alone program that can be executed from the command line.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    LibPDF

    LibPDF

    A modern PDF library for TypeScript

    LibPDF-js/core is a modern, TypeScript-first PDF processing library that provides a comprehensive toolkit for parsing, modifying, and generating PDF documents with a clean, intuitive API designed to handle real-world files safely and robustly. Unlike many existing JavaScript PDF libraries, it emphasizes lenient parsing that can gracefully handle malformed structures and fallback strategies where typical parsers fail, making it useful for production environments that encounter unpredictable...
    Downloads: 1 This Week
    Last Update:
    See Project
  • $300 Free Credits to Build on Google Cloud Icon
    $300 Free Credits to Build on Google Cloud

    New to Google Cloud? Get $300 in credits to explore Compute Engine, BigQuery, Cloud Run, Gemini Enterprise Agent Platform, and more.

    Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query petabytes in BigQuery, or build agents with Gemini Enterprise Agent Platform. Once your credits are used, keep building with 20+ always-free tier products including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. No commitment required—just sign up and start building.
    Claim $300 Free
  • 5
    HummusJS

    HummusJS

    Node.js module for high performance creation and modification of PDFs

    PDFWriter latest release (4.5.12) includes support for fonts that contain Emojis. Notable examples for Emoji fonts are Windows Segoe UI emoji and Google Noto font. This means that writing text that include emojis will result in lovely colorful emojis, rather than black and white representations. PDFHummus is a fast and free PDF Writing, Parsing and Modification library.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    zpdf

    zpdf

    Zero-copy PDF text extraction library written in Zig

    zpdf is a high-performance PDF text extraction library written in Zig that focuses on speed, low overhead, and modern parsing techniques. It leans heavily on memory-mapped file reading and zero-copy patterns where possible, so it can scan large PDFs without repeatedly copying data around in memory. The library supports streaming extraction using efficient arena allocation, making it well suited for workloads that need to process big documents quickly or in batches. It implements multiple PDF...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    go-wkhtmltopdf

    go-wkhtmltopdf

    Golang commandline wrapper for wkhtmltopdf

    We needed a way to generate PDF documents from Go. These vary from invoices with highly customizable layouts to reports with tables, graphs and images. In our opinion, the best way to do this was by using HTML/CSS templates as a source for our PDFs. Using CSS print media types and millimeters instead of pixel units we can generate very accurate PDF documents using wkhtmltopdf. It takes care of setting the correct order of options as these can become very long with multiple pages where you...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Laravel DataTables Buttons Plugin

    Laravel DataTables Buttons Plugin

    Laravel DataTables Buttons Plugin

    A Laravel DataTables plugin for handling server-side exporting of tables such as CSV, excel, pdf, etc. This package is a plugin of Laravel DataTables for handling server-side functions of exporting the table as CSV, excel, pdf, and printing.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    carbone

    carbone

    Fast and simple report generator, from JSON to pdf, xslx, docx, odt

    Turn your JSON into PDF, DOCX, XLSX, PPTX, ODS and many more. Fast, Simple and Powerful report generator in any format PDF, DOCX, XLSX, ODT, PPTX, ODS, XML, CSV using templates and your JSON data as input.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Build Securely on AWS with Proven Frameworks Icon
    Build Securely on AWS with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 10
    PHP-SVG-Lib

    PHP-SVG-Lib

    SVG file parsing / rendering library

    php-svg-lib is a PHP library designed for parsing and rendering SVG files. It provides functionalities to rasterize SVG content to various surfaces, such as images or PDFs, and is utilized by projects like Dompdf for SVG rendering within PDF documents.​
    Downloads: 2 This Week
    Last Update:
    See Project
  • 11
    PEASS-ng

    PEASS-ng

    Privilege Escalation Awesome Scripts SUITE

    ...Any misuse of this software will not be the responsibility of the author or of any other collaborator. Use it at your own machines and/or with the owner's permission. Here you will find privilege escalation tools for Windows and Linux/Unix and MacOS. Find the latest versions of all the scripts and binaries in the releases page. Check the parsers directory to transform PEASS outputs to JSON, HTML and PDF.
    Downloads: 109 This Week
    Last Update:
    See Project
  • 12
    Asciidoc Editor based on JavaFX 20

    Asciidoc Editor based on JavaFX 20

    Asciidoc Editor and Toolchain written with JavaFX 19

    Asciidoc FX is a WYSIWYG editor for the Asciidoc markup language. You can build PDF, Epub, and HTML books, documents, and slides. Supported Operating Systems and Builds shows the list of available builds with links for reference. If you are looking for the very latest version, visit the link in the note above to be guaranteed of downloading the latest and greatest version of AsciidocFX. AsciidocFX converts documents via the AsciidoctorJ library.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    OmniTools

    OmniTools

    Self-hosted collection of powerful web-based tools for everyday tasks

    OmniTools is a self-hosted web application that bundles a large collection of everyday utilities into a single clean interface you can run on your own infrastructure. It’s designed to replace the random assortment of “free online tools” people use for quick tasks, while avoiding ads, tracking, and the need to upload sensitive files to unknown servers. A key design choice is that file processing happens entirely on the client side, meaning your data stays in your browser instead of being sent...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 14
    ripgrep

    ripgrep

    Regex pattern directory search tool that respects your .gitignore

    ripgrep is a line-oriented search tool that actively searches the directory you're currently in for a regex pattern. By default, ripgrep will ignore your .gitignore and skip hidden files or directories and binary files automatically. ripgrep has first class support on Windows, macOS and Linux, with binary downloads available for every release. ripgrep is similar to other popular search tools like The Silver Searcher, ack and grep. ripgrep supports arbitrary input preprocessing filters which could be PDF text extraction, less supported decompression, decrypting, automatic encoding detection and so on. ...
    Downloads: 85 This Week
    Last Update:
    See Project
  • 15
    Keychron Hardware Design

    Keychron Hardware Design

    Industrial design files for Keychron keyboards and mice

    The Keychron-Keyboards-Hardware-Design project is a source-available repository that provides production-grade industrial design files for a wide range of Keychron keyboards and mice. It includes over one hundred device models with detailed CAD assets, allowing users to explore real-world hardware engineering practices and understand how commercial peripherals are constructed. The repository contains multiple file formats such as STEP, DXF, DWG, and PDF, enabling compatibility with...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16
    PageIndex

    PageIndex

    Document Index for Vectorless, Reasoning-based RAG

    PageIndex is an innovative open-source framework that reimagines retrieval-augmented generation (RAG) by eliminating conventional vector similarity search and instead building hierarchical semantic indexes that mirror a document’s natural structure. Rather than chunking text and embedding it into a vector database, PageIndex constructs a tree-structured index — similar to a detailed, AI-enhanced table of contents — that a large language model can traverse to locate the most relevant sections...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    Sphinx

    Sphinx

    Main repository for the Sphinx documentation builder

    ...It was originally created for the Python documentation, and it has excellent facilities for the documentation of software projects in a range of languages. Of course, this site is also created from reStructuredText sources using Sphinx! HTML (including Windows HTML Help), LaTeX (for printable PDF versions), ePub, Texinfo, manual pages, plain text. Semantic markup and automatic links for functions, classes, citations, glossary terms and similar pieces of information. Easy definition of a document tree, with automatic links to siblings, parents and children. General index as well as a language-specific module index. ...
    Downloads: 28 This Week
    Last Update:
    See Project
  • 18
    Google Open Source Project Style Guide

    Google Open Source Project Style Guide

    Chinese version of Google open source project style guide

    Each larger open source project has its own style guide, a series of conventions on how to write code for the project (sometimes more arbitrary). When all the code maintains a consistent style, it is more important when understanding large code bases. easy. The meaning of "style" covers a wide range, from "variables use camelCase" to "never use global variables" to "never use exceptions". The English version of the project maintains the programming style guidelines used in Google. If the...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 19
    node-canvas

    node-canvas

    Node canvas is a Cairo backed Canvas implementation for NodeJS

    If you don't have a supported OS or processor architecture, or you use --build-from-source, the module will be compiled on your system. This requires several dependencies, including Cairo and Pango. Note that libgif/giflib, librsvg and libjpeg are optional and only required if you need GIF, SVG and JPEG support, respectively. Cairo v1.10.0 or later is required. This project is an implementation of the Web Canvas API and implements that API as closely as possible. For API documentation,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    PromptingTools.jl

    PromptingTools.jl

    Streamline your life using PromptingTools.jl

    PromptingTools.jl is a Julia-based toolkit designed to simplify prompt engineering and unify interactions with multiple large language model providers through a consistent interface. It focuses on reducing the complexity of prompt creation by introducing templating systems, macros, and reusable functions that standardize how prompts are constructed and executed. The library provides a family of ai* functions that handle tasks such as generation, embeddings, classification, and data...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    reveal.js

    reveal.js

    The HTML Presentation Framework

    reveal.js is a framework for creating beautiful interactive presentations using HTML. It comes with a wide range of features, including nested slides, auto-sliding, touch navigation, Markdown support, PDF export, speaker notes, theming and more. It also comes with a JavaScript API that allows you to control various other options, and a list of plugins that can be used to extend reveal.js further. reveal.js currently offers full support for any recently released version of the following...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    ImageMagick

    ImageMagick

    ImageMagick 7

    ...It is written in C and can be used on a variety of operating systems, including Linux, Windows, and macOS.
    Downloads: 21 This Week
    Last Update:
    See Project
  • 23
    docToolchain

    docToolchain

    AsciiDoc Toolchain for technical Software Documentation

    docToolchain is a toolkit built around the “docs as code” philosophy, designed to make writing, maintaining, and publishing technical documentation as smooth as software development itself. It provides a suite of scripts and integrations to generate documentation from AsciiDoc (and related formats), run automated checks, incorporate diagrams, and publish outputs in different formats (HTML, PDF, etc.). The framework encourages documentation and architecture artifacts to live alongside source...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    SBCL

    SBCL

    Mirror of Steel Bank Common Lisp (SBCL)'s repository

    Steel Bank Common Lisp (SBCL) is a high-performance Common Lisp compiler. It is open-source/free software, with a permissive license. In addition to the compiler and runtime system for ANSI Common Lisp, it provides an interactive environment including a debugger, a statistical profiler, a code coverage tool, and many other extensions. SBCL runs on Linux, various BSDs, macOS, Solaris, and Windows. See the download page for supported platforms, and the getting started guide for additional...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 25
    Calibre-Web

    Calibre-Web

    Web app for browsing, reading and downloading eBooks stored in Calibre

    Calibre-Web is a web app providing a clean interface for browsing, reading and downloading eBooks using an existing Calibre database. User management with fine-grained per-user permissions. User Interface in Brazilian, Czech, Dutch, English, Finnish, French, German, Greek, Hungarian, Italian, Japanese, Khmer, Polish, Russian, simplified and traditional Chinese, Spanish, Swedish, Turkish, Ukrainian. Filter and search by titles, authors, tags, series and language. Support for editing eBook...
    Downloads: 21 This Week
    Last Update:
    See Project
Auth0 Logo