Showing 420 open source projects for "pdf ocr windows"

View related business solutions
  • Error to trace to log to deploy. One click. No SSH. Icon
    Error to trace to log to deploy. One click. No SSH.

    Catch the cause before the pager goes off.

    AppSignal links every error to the trace, the trace to the log, the log to the deploy that shipped it.
    Free 30 days.
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • 1
    DOMPDF Wrapper for Laravel

    DOMPDF Wrapper for Laravel

    A DOMPDF Wrapper for Laravel

    Require this package in your composer.json and update composer. This will download the package and the dompdf + fontlib libraries also. After updating composer, add the ServiceProvider to the providers array in config/app.php. You can optionally use the facade for shorter code. To change the configuration, copy the config file to your config folder and enable it in bootstrap/app.php. You can create a new DOMPDF instance and load a HTML string, file or view name. You can save it to a file, or...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Webifier

    Webifier

    A GitHub Action to deploy Notebooks, Markdowns

    Webifier is a stand-alone build tool for converting any repository into a deployable jekyll website. You can define your pages via yaml files and provide notebooks, markdown and pdf and other files for Webifier to render. It uses python markdown providing additional control over attributes and other extensive functionalities. It lets you define and direct how your web pages feel and automatically manages your assets, making it a perfect solution for fast static website development and a...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Brighter

    Brighter

    A framework for building messaging apps with .NET and C#

    This project is a Command Processor & Dispatcher implementation with support for task queues that can be used as a lightweight library. It can be used for implementing Ports and Adapters and CQRS (PDF) architectural styles in .NET. It can also be used in microservices architectures for decoupled communication between the services. Brighter supports Task Queues. As such it can be used to improve performance by introducing concurrency using a queue, and/or as an integration strategy between...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    DocsGPT

    DocsGPT

    Private AI platform for agents, enterprise search and RAG pipelines

    DocsGPT is an open-source AI platform for deploying private RAG pipelines, AI agents, and enterprise search on your own infrastructure. Connect any data source (PDFs, DOCX, CSV, Excel, HTML, audio, GitHub, databases, URLs) and get accurate, hallucination-free answers with source citations. Choose your LLM: OpenAI, Anthropic, Google Gemini, or local models. Works with Qdrant, MongoDB, and Elasticsearch and more. Deploy via Docker or Kubernetes with full data sovereignty. Build...
    Downloads: 2 This Week
    Last Update:
    See Project
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • 5
    Jina

    Jina

    Build cross-modal and multimodal applications on the cloud

    Jina is a framework that empowers anyone to build cross-modal and multi-modal applications on the cloud. It uplifts a PoC into a production-ready service. Jina handles the infrastructure complexity, making advanced solution engineering and cloud-native technologies accessible to every developer. Build applications that deliver fresh insights from multiple data types such as text, image, audio, video, 3D mesh, PDF with Jina AI’s DocArray. Polyglot gateway that supports gRPC, Websockets, HTTP,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6

    Candid PDF Table

    CandidPDFTable – Deterministic TCPDF Table Builder

    CandidPDFTable (Candid PDF Table Builder) is a deterministic, colspan-aware table builder designed specifically for TCPDF. It provides a clean and predictable API to construct HTML tables for TCPDF::writeHTML() using explicit, cell-owned borders and late-stage layout computation. The library is built for programmatic table generation where precise control over rows, columns, colspans, borders, and serial numbering is essential. Building complex tables directly in TCPDF becomes difficult...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    OculiX

    OculiX

    Visual Automation IDE — automate anything you see on screen

    OculiX is the evolution of SikuliX, actively maintained with the full agreement of its original creator RaiMan. Automate any desktop application using image recognition (OpenCV) and OCR (Tesseract + PaddleOCR). No access to source code or DOM required — if you can see it, you can automate it. Key features: - Guided step-by-step recorder with live code preview - Image recognition via OpenCV 4.10 - Dual OCR: Tesseract (built-in) + PaddleOCR (neural, high precision) - Local and remote automation via integrated VNC - SSH tunnels via embedded JSch - Cross-platform: Windows, macOS (Apple Silicon M1-M4), Linux - Scripting: Jython, JRuby, Java, PowerShell, AppleScript - Java 17 recommended (Java 8+ supported) - Full CI/CD with automated builds for all platforms Used worldwide for test automation, RPA, and visual regression testing. ...
    Leader badge
    Downloads: 154 This Week
    Last Update:
    See Project
  • 8

    toPDF

    Online service for PDF conversion (to PDF)

    A simple online service for PDF conversion. This project is a simple library and also a web application. It offers a REST service and a simple upload service for synchronous conversion. This library/application doesn't contain conversion libraries because it's a wrapper for existing tools. toPDF currently supports the open source tool PDF Creator (http://www.pdfforge.org) and the commercial solution, easy PDF, from BCL (http://www.pdfonline.com/easypdf/sdk/).
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    Bridgex

    Bridgex

    Convert files like docx, xlsx, pptx, html, and more to MarkDown

    Bridgex is a graphical user interface (GUI) developed in Rust (using FreyaUI) designed to easily convert various file types into Markdown format. Its main objective is to simplify access to the Markitdown library through a simple and modular visual experience, allowing for the rapid and efficient extraction of text from documents (such as PDFs). It is an ideal tool for those seeking a direct and uncomplicated file converter.
    Downloads: 7 This Week
    Last Update:
    See Project
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • 10
    java-pdf-table-extractor-lib

    java-pdf-table-extractor-lib

    Java Pdf Table extraction library

    The command line application is an example of usage of the Java library. The library is based on pdfbox library and works by looking for the layout of each selected pdf page, and looking for table structure patterns. After calling the library (passing the pdf filename, and the page range), the result is a List<PdfTextElement>. PdfTextElement is an interface that has two implementations. * A basic text (outside the tables) * And PdfTextTabulaElement, for table structures. That...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    DocWire SDK

    DocWire SDK

    Award-winning modern data processing SDK in C++20

    DocWire SDK, a standout C++20AI driven data processing tool, has received award from SourceForge and strong backing from Microsoft. It handles nearly 100 file types, empowering efficient text extraction, web data extraction, and document analysis. For businesses, the shift to DocWire SDK signifies a leap forward. It promises comprehensive document format support and the ability to extract valuable insights from email boxes, databases, and websites using cutting-edge AI. DocWire SDK aims to...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    JasperReports Library

    JasperReports Library

    Free Java Reporting Library

    JasperReports Library is the world's most popular open source business intelligence and reporting engine. It is entirely written in Java and it is able to use data coming from any kind of data source and produce pixel-perfect documents that can be viewed, printed or exported in a variety of document formats including HTML, PDF, Excel, OpenOffice and Word. The project is also available at: https://github.com/TIBCOSoftware/jasperreports Jaspersoft Studio is the open source report...
    Leader badge
    Downloads: 1,487 This Week
    Last Update:
    See Project
  • 13

    Vanga

    Structure-routed visual compiler: recognize and automate GUIs in YAML

    Vanga reads a screen — live or captured — and resolves it to an action. You describe the interface in plain YAML: tokens are pictures matched under a discipline, lexemes their spatial arrangements, with OCR for the content that changes. On a match it drives the GUI (click, type, route text); the logic turns a recognized screen into what to do — a watched value with prose rules, or a neighbourhood rule table over a tiled grid. The shipped examples are pure grammar, no Java: a calculator...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Static EBooks' Website Maker

    Static EBooks' Website Maker

    Create your Free Ebooks and EWriting WebSite

    On Linux only. Read the EPUB or PDF headers, creating static and lite HTML WebSite. Pascal Lazarus Sources and Components. For www.livrels.fr, www.livrels.com and www.scribels.org
    Downloads: 8 This Week
    Last Update:
    See Project
  • 15
    ESignPDF

    ESignPDF

    Sign PDF with Digital Signature Certificate

    Brought to you by parvesh88 System Requirements 1. A Computer running Windows 7 Service Pack 1 or higher 2. Java Runtime Environment (JRE) 8 or higher if JRE is not installed on your Computer, then download and install from here JRE 8 for Windows x86 -> https://www.azul.com/core-post-download/?endpoint=zulu&uuid=55abea0c-2aa5-4316-aafb-e90847f6ee21 JRE 8 for Windows x64 -> https://www.azul.com/core-post-download/?
    Downloads: 3 This Week
    Last Update:
    See Project
  • 16
    OmegaT - multiplatform CAT tool

    OmegaT - multiplatform CAT tool

    The free computer aided translation (CAT) tool for professionals

    OmegaT is a free and open source multiplatform Computer Assisted Translation tool with fuzzy matching, translation memory, keyword search, glossaries, and translation leveraging into updated projects.
    Leader badge
    Downloads: 2,286 This Week
    Last Update:
    See Project
  • 17
    FastReport Open Source

    FastReport Open Source

    Free Open Source Reporting tool for .NET

    Free Open Source Reporting tool for .NET Core/.NET Framework that helps your application generate document-like reports.
    Downloads: 60 This Week
    Last Update:
    See Project
  • 18

    Report Manager

    Reporting tool, libraries and designer

    Report Manager is both a print scheme designer (report) and a high level printing (reporting) engine. Also a TCP Report Server and a Web Report Server (PDF on the fly), supports Windows and Linux. Connectivity to almost all databases is provided.
    Downloads: 77 This Week
    Last Update:
    See Project
  • 19
    AuroreNR
    Software developed for the analysis of Neutron Reflectivity data.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    C# ECG Toolkit

    C# ECG Toolkit

    ECG Toolkit support for: SCP-ECG, DICOM, HL7 aECG, ISHNE & MUSE-XML

    C# ECG Toolkit is an open source software toolkit to convert, view and print electrocardiograms. The toolkit is developed using C# .NET Framework 2.0 and later (code also supports netstandard2.0). Support for ECG formats: SCP-ECG, DICOM, HL7 aECG, ISHNE, MUSE-XML and OmronECG.
    Leader badge
    Downloads: 15 This Week
    Last Update:
    See Project
  • 21
    Small Python library with various things such as Configuration file parsing (in Python syntax), HTML and PDF parsing. Used in others of my projects.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Typst LSP

    Typst LSP

    A brand-new language server for Typst, plus a VS Code extension

    A brand-new language server for Typst. Syntax highlighting, error reporting, code completion, and function signature help. Compiles to PDF on save (configurable to as-you-type, or can be disabled)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Admidio

    Admidio

    Manage your organization online

    Admidio is a free open source user management system for websites of organizations and groups. The system has a flexible role model so that it’s possible to reflect the structure and permissions of your organization. You can create an individual profile for your members by adding or removing fields. Additional to these functions the system contains several modules like member lists, event manager, guestbook, photo album or download area.
    Leader badge
    Downloads: 59 This Week
    Last Update:
    See Project
  • 24
    jPicEdt

    jPicEdt

    Another drawing editor for LaTeX with PSTricks & TikZ

    jPicEdt is an extensible internationalized vector-based drawing editor for LaTeX and related packages (TikZ, PsTricks,...), written in Java. It is also a library of reusable high-level graphic primitives.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 25
    dktools - Dirk Krauses tools

    dktools - Dirk Krauses tools

    Drawing, graphics conversion, software development, administration.

    GUI and command line tools for advanced users and administrators: wxdkdraw - Minimalistic drawing application for use with LaTeX, wxd2lat - Convert wxdkdraw files to LaTeX, bitmap2pp - Convert PNG/JPEG/TIFF/NetPBM to (E)PS or PDF, fig2lat - Convert XFig files to LaTeX, htmlbook - publish HTML like a book, dkcpre - C debugging and tracing preprocessor, itadmin - manage your IT using a MySQL/MariaDB database, dk-fic - file integrity checker, dk-ls - list files, output column order is configurable, dk-cat, dk-sort, dk-lines - text tools for *x and Windows, dk-send, dk-recv - transmit data stream, dk-t2h, dk-t2l - text to HTML or LaTeX conversion.
    Leader badge
    Downloads: 15 This Week
    Last Update:
    See Project
Auth0 Logo