Showing 67 open source projects for "text"

View related business solutions
  • Find Hidden Risks in Windows Task Scheduler Icon
    Find Hidden Risks in Windows Task Scheduler

    Free diagnostic script reveals configuration issues, error patterns, and security risks. Instant HTML report.

    Windows Task Scheduler might be hiding critical failures. Download the free JAMS diagnostic tool to uncover problems before they impact production—get a color-coded risk report with clear remediation steps in minutes.
    Download Free Tool
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like ā€œBuild me a revenue dashboard on my Stripe dataā€ and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 1
    Ozyr

    Ozyr

    Ozyr is a simple and easy to use OCR snipping tool

    Ozyr is a simple and easy to use OCR snipping tool to get text from images so you can copy and edit it. Source Code: https://github.com/PETEROLO291/Ozyr Installer: 117MB Program: 524MB Version: 1.0
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    gImageReader

    gImageReader

    A graphical frontend to tesseract-ocr

    ...Features include: - Import PDF documents and images from disk, scanning devices, clipboard and screenshots - Process multiple images and documents in one go - Manual or automatic recognition area definition - Recognize to plain text or to hOCR documents - Recognized text displayed directly next to the image - Post-process the recognized text, including spellchecking - Generate PDF documents from hOCR documents **Note**: This page is only a mirror for the downloads. Development is happening on github at https://github.com/manisandro/gImageReader, release binaries are also posted there.
    Leader badge
    Downloads: 179 This Week
    Last Update:
    See Project
  • 3
    Paperless-ng

    Paperless-ng

    A supercharged version of paperless, scan, index and archive docs

    Paperless is a simple Django application running in two parts, a Consumer (the thing that does the indexing) and a Web server (the part that lets you search & download already-indexed documents). Paper is a nightmare. Environmental issues aside, there’s no excuse for it in the 21st century. It takes up space, collects dust, doesn’t support any form of a search feature, indexing is tedious, it’s heavy and prone to damage & loss. I wrote this to make ā€œgoing paperlessā€ easier. I do not have to...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Manga Rikai OCR
    Manga Rikai is the first consumer-ready multi-page manga OCR/translation engine. It is a spiritual successor to Capture2Text, Visual Novel Reader, and Textractor. At the moment, the engine can capture and translate single text box, detect all text boxes in a page or as many pages as you want. Not only that, you can edit the text, save your progress, and even export your work as an HTML file. Got problems? Join our discord: https://discord.com/invite/BuNuanw
    Downloads: 6 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 5
    Linux-Intelligent-Ocr-Solution

    Linux-Intelligent-Ocr-Solution

    Easy-OCR solution and Tesseract trainer for GNU/Linux

    Linux-intelligent-ocr-solution Lios is a free and open source software for converting print in to text using either scanner or a camera, It can also produce text out of scanned images from other sources such as Pdf, Image, Folder containing Images or screenshot. Program is given total accessibility for visually impaired. A Tesseract Trainer GUI is also shipped with this package. Forum : https://groups.google.com/forum/#!forum/lios Video Tutorial : https://www.youtube.com/playlist?...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 6

    Merge PDF Files

    It is a Windows library that merges standard PDFs into a final PDF

    ...We want to give you a benchmark, the library was used to create a PDF from single page(scanned) image by an OCR SDK (it is not included in our library, you can use any on the market): 20,000 Images (the OCR SDK creates single page PDF text searchable, running 50 threads) in 80 minutes. The size of the final PDF searchable was 800Mb. If you download the library, we provide a sample which cover all the scenarios possible (synchronous and asynchronous).
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    pdfsandwich generates "sandwich" OCR pdf files, i.e. pdf files which contain only images (but no editable text) will be processed by optical character recognition (OCR) and the text will be added to each page invisibly "behind" the images. pdfsandwich is a command line tool which is supposed to be useful to OCR scanned books or journals. It is able to recognize the page layout even for multicolumn text. Essentially, pdfsandwich is a wrapper script which calls the following binaries: convert, unpaper, tesseract, gs, and hocr2pdf (if tesseract < 3.03). ...
    Leader badge
    Downloads: 331 This Week
    Last Update:
    See Project
  • 8

    Lords Mobile Player Statistics

    Compare yourself with others by analyzing Lords Mobile Screenshots

    Lords Mobile Player Statistics (or short Lords Mobile Stats) is a Windows application that allows you to parse Lords Mobile Screenshots to extract player statistics of yourself and other players. After extracting the data you can compare players and view the data as a large sortable table. You can also export the results as text file or render the table as an image (for sharing with your guild for example). This project is still in a early development stage, please read the Wiki (see menu above) for details about what is working and what is not. There are still many features on the todo list for future release. Please consider to support the project by making a donation to the project by clicking the Donation link in the menu!
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    OCR Web based

    OCR Web based

    OCR web based for Browser Firefox & PC

    ...Finally, I wish to inform you that you can write or draw directly on the canvas to get the subsequent character recognition and text extraction
    Downloads: 0 This Week
    Last Update:
    See Project
  • Atera all-in-one platform IT management software with AI agents Icon
    Atera all-in-one platform IT management software with AI agents

    Ideal for internal IT departments or managed service providers (MSPs)

    Atera’s AI agents don’t just assist, they act. From detection to resolution, they handle incidents and requests instantly, taking your IT management from automated to autonomous.
    Learn More
  • 10

    cbrTekStraktor

    an application to automatically extract text from comic books.

    cbrTekStraktor is an application to automatically extract text from the text bubbles or speech balloons present in comic book reader files (CBR). Its prime goal is to perform analysis on the texts of comic books. cbrTekStraktor can however also be used for scanlation or similar purposes. The application also enables to manually define text areas in CBR files. The application comprises a simple graphical editor for further processing the extracted text. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    DoAllWithPDF_servicemenu

    DoAllWithPDF_servicemenu

    KDE servicemenu for pdf

    allows kde user to make a lot of things whit right click on a pdf file.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 12

    WebDjVuTextEd

    Edit the OCR text layer of DjVu documents in a web browser

    WebDjVuTextEd allows to edit the text layer of OCR'ed DjVu documents in a web browser. You can modify the structure (paragraphs, lines, words...) create, delete, edit text nodes, modify their container box by mouse, and run a spellchecker. The program does not directly read the DjVu files, it requires exported XML text data and images. When using without a webserver, you can open and save local files, but cannot take advantages of auto-save and spell checking. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Transparent NotePad for Linux
    Useful when you have to re-type text from a picture that cannot be OCR-ed by FineReader. Just start the app, select transparency with the slide bar on the right, and type the text, looking at the picture with the text through the notepad.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    DJVU++

    DJVU++

    The DjVu complete solution,with OCR Technology(Arabic ,English).

    DjVu++ is a user-friendly program that used to manipulate DjVu file formats such as eBooks with a penalty of editing features. The program introduce a free replacement for the property PDF format with similar resolution and smaller file size DjVu++ also support OCR to handle text in scanned books and images. The program shows good performance for English. In addition to the Arabic language to lead free and commercial software in this area. The main features of DjVu++ program are: o Manipulate DjVu files. o Support smaller size than PDF with the same performance. o DjVu++ supports two languages in the OCR technique (Arabic and English). ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 15
    Terese is a tool for proofreading OCR text. Terese tries to map the text back to the scanned image, and visually shows the differences. See the homepage for further details.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    yagf

    yagf

    YAGF is a tesseract and cuneiform wrapper and helper*

    ...Once you have a scanned image you can prepare it for recognition, select particular image areas for recognition, set the recognition language and so on. Recognized text is displayed in a editor window where it can be corrected, saved to disk or copied to clipboard. YAGF also provides some facilities for a multi-page recognition (see the online help for more details).
    Leader badge
    Downloads: 5 This Week
    Last Update:
    See Project
  • 17
    Eye is an experimental OCR (image-to-text) application.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Sanskrit / Hindi - Tesseract OCR

    Sanskrit / Hindi - Tesseract OCR

    Devanagari fonts traineddata for Tesseract OCR

    ...However the Hindi training texts, images and box files are not provided, so it is difficult to improve the accuracy by further improving the traineddata. It is noted that recognition is more accurate and faster if the training is done with the same /similar font as used in the text to be OCRed. See https://sourceforge.net/p/tesseracthindi/wiki/OCR%20for%20Devanagari/ for more details.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    DrawPad

    DrawPad

    Pattern recognition tool for image, pdf and handwritings

    The tool is an optical recognition tool which runs in following three mode : 1. Drawing Pad : Here the user can draw a character and the tool will recognize which character it is. 2. Image OCR : Image based OCR tool to recognize text and barcodes present in the image. It also supports saving the OCR output. 3. PDF OCR : PDF OCR is the advanced form of OCR, where PDF is parsed into image and OCR is run on that result. At present, PDF OCR comes with low maturity.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    hocr - Hebrew OCR c/c++ library
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    CD+Graphics Magic
    Timeline based editor for creating Compact Disc Subcode Graphics (also known as CD+G or CDG). Both karaoke and multimedia styles of content are supported. Please visit cdgmagic.sf.net for examples playable directly in the HTML5 CD+G player. CD+Graphics Scribe utility (separate download -- click "Browse All Files" above) can now convert existing CDG karaoke content to CMP (CD+Graphics Magic Project), LRC (Enhanced Lyrics), and ASS (Advanced SubStation Alpha) format.
    Leader badge
    Downloads: 21 This Week
    Last Update:
    See Project
  • 22

    edocias

    Electronic Document Index And Search

    EDocIAS (Electronic Document Index And Search) is a PHP-based tool for indexing and searching files of various types. Third-party tools (tesseract, xpdf, etc.) can be configured to support any type of file.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23

    ANPR for National Borders

    ANPR for National Borders Systems

    ...This is going to be achieved by taking the license plate image from the camera and processing it using the software I’m going to develop together with an open source OCR system. The output is going to be a text version of the plate number. Then it is going to retrieve that number from a database and get the information of the vehicle owner. Note: Please note that you need emgucv installed. you also need to move the project files to: C:\Emgu\emgucv-windows-x86XXX\Emgu.CV.Example\ Please note: that the project was created using visual studio 2003, on windows xp. and it might not run fluently on newer versions. please try to update it.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24

    alto_search

    Search full text in ALTO xml and return context

    ALTO files are XML files that store the output of OCR. Alto_search searches for terms in these ALTO files. A term can be several words and multiple terms can be searched at the same time. The output is in XML format and contains the coordinates of the words that were found as well as textual context around the hits.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Tested for Ubuntu Maverick - Create Audiobooks from eBooks, text or pictures. - Read eBooks or text aloud while scrolling through pages
    Downloads: 1 This Week
    Last Update:
    See Project