Showing 100 open source projects for "pdf para ocr"

View related business solutions
  • Digital Payments by Deluxe Payment Exchange Icon
    Digital Payments by Deluxe Payment Exchange

    A single integrated payables solution that takes manual payment processes out of the equation, helping reduce risk and cutting costs for your business

    Save time, money and your sanity. Deluxe Payment Exchange+ (DPX+) is our integrated payments solution that streamlines and automates your accounts payable (AP) disbursements. DPX+ ensures secure payments and offers suppliers alternate ways to receive funds, including mailed checks, ACH, virtual credit cards, debit cards, or eCheck payments. By simply integrating with your existing accounting software like QuickBooks®, you’ll implement efficient payment solutions for AP with ease—without costly development fees or untimely delays.
  • Innovate faster with enterprise-ready generative AI—enhanced by Gemini Icon
    Innovate faster with enterprise-ready generative AI—enhanced by Gemini

    Build, deploy, and scale machine learning (ML) models faster, with fully managed ML tools for any use case.

    Vertex AI offers everything you need to build and use generative AI—from AI solutions, to Search and Conversation, to 130+ foundation models, to a unified AI platform.
  • 1

    Tesseract OCR

    Open Source OCR Engine

    Tesseract is an open source OCR or optical character recognition engine and command line program. OCR is a technology that allows for the recognition of text characters within a digital image. With the latest version of Tesseract, there is a greater focus on line recognition, however it still supports the legacy Tesseract OCR engine which recognizes character patterns. Tesseract can recognize over 100 languages out-of-the-box, and can be trained to recognize other languages. It supports...
    Downloads: 1,271 This Week
    Last Update:
    See Project
  • 2
    Super-PDF-Editor

    Super-PDF-Editor

    World's most comprehensive, powerful, process-based PDF editor

    World's most comprehensive, powerful, process-based and lighting fast PDF reader, editor and batch processor. PDF editing with 60+ features rich tools and function like OCR pdf and images and produce output like searchable PDF, Text, Hocr, Box, Unlv. Also, improve image enhancement before OCR operation for better OCR performance. pdf Imposition, etc. Super PDF Editor is best for bulk pdf processing, especially for the printing industry. Easy pdf imposition, booklet, n ups pages, and more. OCR...
    Downloads: 54 This Week
    Last Update:
    See Project
  • 3
    Super-PDF-Editor-Lite

    Super-PDF-Editor-Lite

    World's most comprehensive, powerful, process-based PDF editor

    World's most comprehensive, powerful, process-based and lighting fast PDF reader, editor and batch processor. Includes features like Create PDF from Images, HTML, Text files. Create a processing log file. Extract Page, Split Page, Rotate Page, Merge Page, Duplicate page, Move Page, Printing, and Compress Page. Improve image enhancement before OCR operation for better OCR performance. pdf Imposition, etc. Super PDF Editor is best for bulk pdf processing, especially for the printing industry...
    Downloads: 20 This Week
    Last Update:
    See Project
  • 4
    NAPS2 - Not Another PDF Scanner

    NAPS2 - Not Another PDF Scanner

    Scan documents to PDF and other file types, as simply as possible.

    Visit NAPS2's home page at www.naps2.com. NAPS2 is a document scanning application with a focus on simplicity and ease of use. Scan your documents from WIA- and TWAIN-compatible scanners, organize the pages as you like, and save them as PDF, TIFF, JPEG, PNG, and other file formats. Available on Windows, Mac, and Linux. NAPS2 is currently available in over 40 different languages. Want to see NAPS2 in your preferred language? Help translate! See the wiki for more details.
    Leader badge
    Downloads: 1,265 This Week
    Last Update:
    See Project
  • Find out just how much your login box can do for your customer | Auth0 Icon
    Find out just how much your login box can do for your customer | Auth0

    With over 53 social login options, you can fast-track the signup and login experience for users.

    From improving customer experience through seamless sign-on to making MFA as easy as a click of a button – your login box must find the right balance between user convenience, privacy and security.
  • 5
    Stirling-PDF

    Stirling-PDF

    #1 Locally hosted web application that allows you to work on PDFs

    This is a robust, locally hosted web-based PDF manipulation tool using Docker. It enables you to carry out various operations on PDF files, including splitting, merging, converting, reorganizing, adding images, rotating, compressing, and more. This locally hosted web application has evolved to encompass a comprehensive set of features, addressing all your PDF requirements. Stirling PDF does not initiate any outbound calls for record-keeping or tracking purposes. All files and PDFs exist...
    Downloads: 178 This Week
    Last Update:
    See Project
  • 6
    OCRmyPDF

    OCRmyPDF

    OCRmyPDF adds an OCR text layer to scanned PDF files

    OCRmyPDF adds an optical character recognition (OCR) text layer to scanned PDF files, allowing them to be searched. PDF is the best format for storing and exchanging scanned documents. Unfortunately, PDFs can be difficult to modify. OCRmyPDF makes it easy to apply image processing and OCR (recognized, searchable text) to existing PDFs.
    Downloads: 16 This Week
    Last Update:
    See Project
  • 7
    Papermerge

    Papermerge

    Open Source Document Management System for Digital Archives

    ...-source software which means that transparency is the core value of our software development. Source code can be reviewed and improved by anyone from anywhere. Papermerge supports multiple users. Each user can be assigned different permissions to perform only a specific kind of action e.g. view only documents from a specific folder. OCR technology is vital part of Papermerge. It extracts text information from scanned documents, PDF, JPEG, TIFF files.
    Downloads: 18 This Week
    Last Update:
    See Project
  • 8
    Teedy

    Teedy

    Lightweight document management system

    ...-oriented document management system, the user interface is not cluttered with buttons and menus and works both on desktop and mobile. Document searching has never been easier thanks to the powerful full-text search engine in Teddy. You can search in images (embedded OCR), DOCX, ODT, TXT, PDF, and more. Verify or validate your documents with people of your organization using workflows.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 9
    docconv

    docconv

    Converts PDF, DOC, DOCX, XML, HTML, RTF, etc to plain text

    A Go wrapper library to convert PDF, DOC, DOCX, XML, HTML, RTF, ODT, Pages documents and images (see optional dependencies below) to plain text. See go help install for details on the installation location of the installed docd executable. Make sure that the full path to the executable is in your PATH environment variable. To add image support to the docconv library you first need to install and build gosseract. Now you can add -tags ocr to any go command when building/fetching/testing docconv...
    Downloads: 3 This Week
    Last Update:
    See Project
  • Propelling Payments for Software Platforms Icon
    Propelling Payments for Software Platforms

    For SaaS businesses to monetize payments through its turnkey PayFac-as-a-Service solution.

    Exact Payments delivers easy-to-integrate embedded payment solutions enabling you to rapidly onboard merchants, instantly activate a variety of payment methods and accelerate your revenue — delivering an end-to-end payment processing platform for SaaS businesses.
  • 10
    Paperless-ngx

    Paperless-ngx

    A community-supported supercharged version of paperless

    Paperless-ngx is a community-supported open-source document management system that transforms your physical documents into a searchable online archive so you can keep, well, less paper.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 11
    OCR Manga Reader for Android

    OCR Manga Reader for Android

    Android Manga reader with Japanese OCR and dictionary capabilities

    OCR Manga Reader is a free and open source Android app that allows you to quickly OCR and lookup Japanese words in real-time. It does not have ads or telemetry/spyware and does not require an Internet connection. Supports both EDICT and EPWING dictionaries. Requires Android 4.0 (Ice Cream Sandwich) or higher. See http://ocrmangareaderforandroid.sourceforge.net/ for details.
    Leader badge
    Downloads: 51 This Week
    Last Update:
    See Project
  • 12
    Ana PDF

    Ana PDF

    Junte PDF's e Imagens

    Software realizado em homenagem a minha mãe que realiza trabalhos juntando PDF's mesclados com Imagens. Desenvolvido em C# realiza Merge/Junção de Arquivos PDF e Imagens de um jeito simples , prático e totalmente FREE. Para execução basta baixar o Zip Ana_PDF e executar Ana_PDF_Solution.exe , os PDF's gerados serão salvos na mesma estrutura de pasta onde estiver o executável. Abraços
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Form OCR Testing Tool

    Form OCR Testing Tool

    A set of tools to use in Microsoft Azure Form Recognizer

    An open source labeling tool for Form Recognizer, part of the Form OCR Test Toolset (FOTT). This is a MAIN branch of the Tool. It contains all the newest features available. This is NOT the most stable version since this is a preview. The purpose of this repo is to allow customers to test the tools available when working with Microsoft Forms and OCR services. Currently, Labeling tool is the first tool we present here. Users could provide feedback, and make customer-specific changes to meet...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    AnyTXT Searcher

    AnyTXT Searcher

    A Powerful Desktop Full-Text Search Engine, Just Like Local Google.

    .... You can quickly find any text in any file on your disk by Anytxt almost in 0.1 second. It works on Windows 11,10, 8, 7, Vista, XP, 2008, 2012, 2016,2022... AnyTXT Searcher supports the following file formats: Plain text (txt, cpp, py, html, etc.) Microsoft OneNote (one) Microsoft Word (doc, docx) Microsoft Excel (xls, xlsx) Microsoft PowerPoint (ppt, pptx) PDF WPS Office (wps, et, dps) EBook (epub, mobi, azw3, fb2 etc.) Mind Map Format (lighten, mmap, mm, xmind etc.) OFD .....
    Leader badge
    Downloads: 2,268 This Week
    Last Update:
    See Project
  • 15
    Provides optical character recognition (OCR) solutions for Vietnamese language.
    Leader badge
    Downloads: 504 This Week
    Last Update:
    See Project
  • 16
    A GUI to ease the process of producing a multipage PDF from a scan. gscan2pdf should work on almost any Linux/BSD machine.
    Leader badge
    Downloads: 261 This Week
    Last Update:
    See Project
  • 17
    PDF TO WORD (.docx)

    PDF TO WORD (.docx)

    Convertir PDF a WORD (docxs)

    Se debe seleccionar un pdf ya modificado, el tamaño del archivo puede ser de gran amaño en comparación al PDF, pero estoy trabajando en eso. Por ahora disfrútenlo. (Inge. SRM) Este proyecto en realidad lo hice para un compañero de Universidad que no sabía convertir de PDF a DOCXS, pero quiero compartir con el que lo necesite. Este proyecto usa "Tkinter" V8.6 y usa "pdf2docx" V0.5.8 para realizar las conversiones de PDF a DOCX. Si alguien conoce una librería más estable y mejor que "pdf2docx...
    Downloads: 22 This Week
    Last Update:
    See Project
  • 18
    Super PDF Editor Lite

    Super PDF Editor Lite

    Create, Edit, Delete, Organize , Convert, Export, Secure & Sign.

    Super PDF Editor Lite is a robust and versatile PDF management software designed to streamline your document handling needs. Whether you're an individual, student, or professional, this software offers a comprehensive suite of tools to create, edit, and manage your PDFs with ease. Key Features: Extract Page: Easily extract specific pages from a PDF document. Split Page: Divide a single PDF page into multiple smaller pages. Rotate Page: Rotate pages to adjust their orientation. Merge Page...
    Downloads: 16 This Week
    Last Update:
    See Project
  • 19
    OpenKM Document Management - DMS

    OpenKM Document Management - DMS

    Document Management System and Content Management System

    ... technological architecture design, OpenKM meets the document management needs of businesses of all sizes (from SMEs to big corporations). Thanks to its elegant and intuitive interface, OpenKM transforms complex operations into easy tasks. The most relevant functions of OpenKM is the indexing of the most common types of files: text, Office, Office 2007, OpenOffice, PDF, HTML, XML, MP3, JPEG, etc. For a complete feature list take a look at http://goo.gl/au8cQy
    Leader badge
    Downloads: 635 This Week
    Last Update:
    See Project
  • 20
    Hathi Download Helper

    Hathi Download Helper

    Download books from the hathitrust website in a fast and easy manner

    2023-08-15: Hathi Download Helper 1.2.3 released Hathi Download Helper is a tool for downloading public domain books from hathitrust.org. Dependencies: - for merging of pdf files the freeware "pdftk" (http://www.pdflabs.com/) is used. Miscellaneous: - Developed under Qt 5.14.2 (mingw) Win 10 Pro - Compatible to Qt5 with Webkit - tested with Win 10 Pro, Mac OS 10.15 Catalina, Ubuntu 16.04-18.04 - binary installer for windows: HDH_vX.Y.Z_setup.exe - installer for Mac OS X
    Leader badge
    Downloads: 43 This Week
    Last Update:
    See Project
  • 21

    Image To Text tools

    ITTT is a Free tool designed to Scan and extract Text from Images.

    Image To Text Tools is a 100% Free user-friendly tool designed to Scan and extract containing text in images into editable text formats. Whether you need to extract text from scanned documents, photographs, or other image files, Image To Text Tools provides accurate and reliable Optical Character Recognition (OCR) capabilities to meet your needs.
    Leader badge
    Downloads: 88 This Week
    Last Update:
    See Project
  • 22
    chessPDFBrowser

    chessPDFBrowser

    Chess application whichs allows working with chess PDF books and PGNs.

    Chess application which allows working with PDFs and PGNs. You can work with the chess games of the PDF and edit their tree of variants. Graphical environment. Standard PGN TAGs. PGN comments. Ocr like (Fen string detection from chess board position images). Connection to Uci chess engines (like stockfish). Position analysis, full game analysis. You can now play games against uci engines. pdf2pgn command line command included. Detailed documentation. Multilanguage currently support...
    Downloads: 52 This Week
    Last Update:
    See Project
  • 23
    MyBox

    MyBox

    Easy Tools of PDF, Image, File, Network, Data, and Medias

    javafx-desktop-apps pdf image ocr icc barcode color-palette text bytes markdown html archive compress digest video audio editor converter media https://github.com/Mararsh/MyBox Self-contain packages need not java env nor installation. Jar packages need Java 16 or higher.
    Downloads: 13 This Week
    Last Update:
    See Project
  • 24
    Multiuser HylaFAX PHP/MySQL Web interface for viewing faxes online, downloading & emailing in PDF format, and categorizing & archiving all sent and received faxes.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 25

    pdf2txt

    Script to convert PDF files to TXT files

    Two scripts using calibre and poppler to convert PDF files to TXT (plain text) files. Only use PDF files without spaces in the name. Dois scripts que usam calibre e poppler para converter arquivos PDF para arquivos TXT (texto simples). Apenas usem arquivos PDF sem espaços no nome.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • Next