Showing 25 open source projects for "pdf ocr windows"

View related business solutions
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    PyMuPDF

    PyMuPDF

    Python bindings for MuPDF's rendering library.

    MuPDF is a lightweight PDF, XPS, and E-book viewer. MuPDF consists of a software library, command line tools, and viewers for various platforms. The renderer in MuPDF is tailored for high-quality anti-aliased graphics. It renders text with metrics and spacing accurate to within fractions of a pixel for the highest fidelity in reproducing the look of a printed page on the screen. The viewer is small, fast, yet complete. It supports many document formats, such as PDF, XPS, OpenXPS, CBZ, EPUB,...
    Downloads: 12 This Week
    Last Update:
    See Project
  • 2
    PDF Split and Merge

    PDF Split and Merge

    Split and merge PDF files on any platform

    Split and merge PDF files with PDFsam, an easy-to-use desktop tool with graphical, command line and web interface.
    Leader badge
    Downloads: 201 This Week
    Last Update:
    See Project
  • 3
    Academicons

    Academicons

    An icon font for academics

    Academicons is a specialist icon font for academics. It contains icons for websites and organizations related to academia that are often missing from mainstream font packages. It can be used by itself, but its primary purpose is to be used as a supplementary package alongside a larger icon set. Go here to view the full icon set along with instructions for their use. The organization in question is already using a logo/icon of appropriate dimensions (roughly square). If that doesn't exist,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    DocSearcher
    DocSearcher is a search tool for indexing and searching files on a personal computer. It uses API's to provide search functionality for common document formats. currently: Word, Excel, PDF, Libre/Open/StarOffice, RTF, Text, and HTML
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    Delphi : VRCalc++ and more Binary Exec

    Delphi : VRCalc++ and more Binary Exec

    Delphi Java - VRCalc++ OOSL (Script) and + (Binary Exec Distro)

    Vincent Radio {Adrix.NT} Embarcadero : Delphi : Executable Binaries Delphi : VRCalc++ Object Oriented Scripting Language : Engine + Ext Libraries VRCalc++ OOSL Visual Stage Project : VCL & FMX (FireMonkey) VRCalc++ Script Executor: - VCL Console - Terminal Console - FMX Console + VRCalc++ OOSL : VR System Scripted Standard Runtime Library Delphi Applics - VR Multi Editor : Smart Text Editor - VR Lazy Code Editor : Smart RTF Multi Lang Code Text Editor - VR Astro Vision...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 6
    gImageReader

    gImageReader

    A graphical frontend to tesseract-ocr

    gImageReader is a simple Gtk/Qt front-end to tesseract. Features include: - Import PDF documents and images from disk, scanning devices, clipboard and screenshots - Process multiple images and documents in one go - Manual or automatic recognition area definition - Recognize to plain text or to hOCR documents - Recognized text displayed directly next to the image - Post-process the recognized text, including spellchecking - Generate PDF documents from hOCR documents **Note**:...
    Leader badge
    Downloads: 158 This Week
    Last Update:
    See Project
  • 7
    qiji-font

    qiji-font

    Typeface from Ming Dynasty woodblock printed books

    Typeface from Ming Dynasty woodblock printed books. A Ming typeface. Extracted from Ming Dynasty woodblock printed books (凌閔刻本). Using semi-automatic computer vision and OCR. Open-source. A work in progress. Named in honor of 閔齊伋, a 16th-century printer. Intended to be used with Kenyan-lang, the Classical Chinese programming language. Download high-resolution PDFs and split pages into images. Manually lay a grid on top of each page to generate bounding boxes for characters (potentially...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    A free OCR-A font, conformant to ANSI X3.17-1977, in TrueType format, with sources.
    Leader badge
    Downloads: 73 This Week
    Last Update:
    See Project
  • 9
    Quick Hash GUI

    Quick Hash GUI

    Linux, Windows and Apple Mac File Hashing GUI Tool

    This project has moved to www.quickhash-gui.org as of 2016-12-04. I kept v2.6.9.2 and below hosted here since Dec 16 but too many people were ignoring the fact that no updates were being posted here. For the latest QuickHash v2.8.4 release (Aug 28th 2017), go to www.quickhash-gui.org, and note that as of 29/12/16 a Debian package is also available
    Downloads: 0 This Week
    Last Update:
    See Project
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • 10
    PdfTrick

    PdfTrick

    Pdf images extractor

    PdfTrick is a graphical selective pdf images extractor, for mac and windows platform, 64/32 bit.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11

    Pdf Text Extractor

    A Java Application that extracts text from pdf files.

    A Java Application that extracts text from pdf files. User can select different areas on the pdf file and can extract text from those areas.Extraction of text can be done for single or multiple pages. Generate Bookmarks on the basis of Font Heights entered by the user.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Regain is a Java search engine based on Jakarta Lucene. It provides indexing and searching files for plenty of formats (HTML,XML,doc(x),xls(x),ppt(x),oo,PDF,RTF,mp3,mp4,Java). A TagLibrary eases integrating search results in your JSP based web page.
    Downloads: 13 This Week
    Last Update:
    See Project
  • 13

    PatentX - EPOScan extra utilities

    EPOScan ext folder utilities

    This is a software to operate some functions over the "ext" folder created by EPOScan(European Patent Office software for indexing and scanning patent document images) when the downloading option is selected. This folder is usually used by the ST33 software to convert the indexed images into ST33 standard.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    ippfp is a set of php-classes aimed at building interfaces quickly and indepently from output format (xhtml, pdf, gtk, ncurses). You can create input masks or other user interfaces. This is an auxiliary site for http://sourceforge.net/projects/ippfp
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    A plugin for the Gnome editor »gedit« that supports you in editing LaTeX documents and BibTeX bibliographies.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    A document post production tool for the Gnome Environment. Merge, split, reorganize pages and documents to print or export to PDF format.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    jPdfCalendar is a tool which allows you to create printable calendar pages as PDF document from any user's images.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    A Java implementation of a desktop search engine based on Apache Lucene. It indexes HTML-, XML-, OpenOffice- (Writer, Calc, Impress), MS Word-, and PDF- documents as well as plain text files. For other, arbitrary file types the file name can be indexed.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    As the name of the application suggests, its very simple. User just have to provide the list of images and text files, whose contents they want to have in the PDF in a configuration file. The application reads the configuration file and generates the PDF
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    Visual xsltproc is a tool which help to write xslt file, and debug it to find errors. It writes xml, and generates xml (Syntax highlighting of XML & line Nr.). Finally if the result is XSL-FO it generates the pdf on Apache FOP java. Build on QT4.2.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Yadoda is a personal digital library: user can create his own ontology and a db of digital documents (pdf,ps,mp3,images) that can be enriched with metadata (author,date,title). User can create semantic relations between documents and navigate them.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    jRSVP is a tool for Rapid Serial Visual Presentation, a technique for extremely fast reading, written in Java. It runs under Java 1.4 and uses the Multivalent library for parsing of input documents. It reads PDF, PS, HTML, man pages and others.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    Xom (Xml Object Model) is a lightweight, powerful, and extensible framework for representing Java gui elements. Also, general java gui tools/frameworks for creating impressive user interfaces. Includes a basic PDF viewer, and various gui components.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    When translating becomes a game ! Text to translate can be graphically selected. Several dictionnaries can be sorted according to the context. A large choice of matching strategies is available. The OCR engine is tunable.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    pdfspot is a small desktop application to manage PDF files. Documents are strongly structured (title, authors, references) and the PDF format has hooks for meta data. The idea is to manage PDFs like MP3s ot digital photos. The name is adapted from f-spot
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
Auth0 Logo