Showing 48 open source projects for "text"

View related business solutions
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • Find Hidden Risks in Windows Task Scheduler Icon
    Find Hidden Risks in Windows Task Scheduler

    Free diagnostic script reveals configuration issues, error patterns, and security risks. Instant HTML report.

    Windows Task Scheduler might be hiding critical failures. Download the free JAMS diagnostic tool to uncover problems before they impact production—get a color-coded risk report with clear remediation steps in minutes.
    Download Free Tool
  • 1

    cbrTekStraktor

    an application to automatically extract text from comic books.

    cbrTekStraktor is an application to automatically extract text from the text bubbles or speech balloons present in comic book reader files (CBR). Its prime goal is to perform analysis on the texts of comic books. cbrTekStraktor can however also be used for scanlation or similar purposes. The application also enables to manually define text areas in CBR files. The application comprises a simple graphical editor for further processing the extracted text. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    DoAllWithPDF_servicemenu

    DoAllWithPDF_servicemenu

    KDE servicemenu for pdf

    allows kde user to make a lot of things whit right click on a pdf file.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3

    WebDjVuTextEd

    Edit the OCR text layer of DjVu documents in a web browser

    WebDjVuTextEd allows to edit the text layer of OCR'ed DjVu documents in a web browser. You can modify the structure (paragraphs, lines, words...) create, delete, edit text nodes, modify their container box by mouse, and run a spellchecker. The program does not directly read the DjVu files, it requires exported XML text data and images. When using without a webserver, you can open and save local files, but cannot take advantages of auto-save and spell checking. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Elucidate

    Elucidate

    Straightforward OCR for Mac to create searchable PDFs.

    Elucidate is now available on the Mac App Store: https://itunes.apple.com/us/app/elucidate/id1066088407?ls=1&mt=12
    Downloads: 0 This Week
    Last Update:
    See Project
  • Atera all-in-one platform IT management software with AI agents Icon
    Atera all-in-one platform IT management software with AI agents

    Ideal for internal IT departments or managed service providers (MSPs)

    Atera’s AI agents don’t just assist, they act. From detection to resolution, they handle incidents and requests instantly, taking your IT management from automated to autonomous.
    Learn More
  • 5
    Terese is a tool for proofreading OCR text. Terese tries to map the text back to the scanned image, and visually shows the differences. See the homepage for further details.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Eye is an experimental OCR (image-to-text) application.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Sanskrit / Hindi - Tesseract OCR

    Sanskrit / Hindi - Tesseract OCR

    Devanagari fonts traineddata for Tesseract OCR

    ...However the Hindi training texts, images and box files are not provided, so it is difficult to improve the accuracy by further improving the traineddata. It is noted that recognition is more accurate and faster if the training is done with the same /similar font as used in the text to be OCRed. See https://sourceforge.net/p/tesseracthindi/wiki/OCR%20for%20Devanagari/ for more details.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    DrawPad

    DrawPad

    Pattern recognition tool for image, pdf and handwritings

    The tool is an optical recognition tool which runs in following three mode : 1. Drawing Pad : Here the user can draw a character and the tool will recognize which character it is. 2. Image OCR : Image based OCR tool to recognize text and barcodes present in the image. It also supports saving the OCR output. 3. PDF OCR : PDF OCR is the advanced form of OCR, where PDF is parsed into image and OCR is run on that result. At present, PDF OCR comes with low maturity.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    CD+Graphics Magic
    Timeline based editor for creating Compact Disc Subcode Graphics (also known as CD+G or CDG). Both karaoke and multimedia styles of content are supported. Please visit cdgmagic.sf.net for examples playable directly in the HTML5 CD+G player. CD+Graphics Scribe utility (separate download -- click "Browse All Files" above) can now convert existing CDG karaoke content to CMP (CD+Graphics Magic Project), LRC (Enhanced Lyrics), and ASS (Advanced SubStation Alpha) format.
    Leader badge
    Downloads: 21 This Week
    Last Update:
    See Project
  • Cloud data warehouse to power your data-driven innovation Icon
    Cloud data warehouse to power your data-driven innovation

    BigQuery is a serverless and cost-effective enterprise data warehouse that works across clouds and scales with your data.

    BigQuery Studio provides a single, unified interface for all data practitioners of various coding skills to simplify analytics workflows from data ingestion and preparation to data exploration and visualization to ML model creation and use. It also allows you to use simple SQL to access Vertex AI foundational models directly inside BigQuery for text processing tasks, such as sentiment analysis, entity extraction, and many more without having to deal with specialized models.
    Try for free
  • 10

    edocias

    Electronic Document Index And Search

    EDocIAS (Electronic Document Index And Search) is a PHP-based tool for indexing and searching files of various types. Third-party tools (tesseract, xpdf, etc.) can be configured to support any type of file.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    ANPR for National Borders

    ANPR for National Borders Systems

    ...This is going to be achieved by taking the license plate image from the camera and processing it using the software I’m going to develop together with an open source OCR system. The output is going to be a text version of the plate number. Then it is going to retrieve that number from a database and get the information of the vehicle owner. Note: Please note that you need emgucv installed. you also need to move the project files to: C:\Emgu\emgucv-windows-x86XXX\Emgu.CV.Example\ Please note: that the project was created using visual studio 2003, on windows xp. and it might not run fluently on newer versions. please try to update it.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12

    alto_search

    Search full text in ALTO xml and return context

    ALTO files are XML files that store the output of OCR. Alto_search searches for terms in these ALTO files. A term can be several words and multiple terms can be searched at the same time. The output is in XML format and contains the coordinates of the words that were found as well as textual context around the hits.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    charface
    ...It suports automatica detection of next engines to be installed - cuneiform with its languages - tesseract with language database files - gocr Supports - adding custom engines - bach processing of images - text postprocessing
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    This application can be used to read printed text in english. Scan the text to produce image file. Image file is processed using the OCR engine, which converts it to txt file. Text file can be read using screen reader softwares such as ORCA.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    TCR Neuroph -Text Character Recognition
    TCR Neuroph - Text Character Recognition is java tool developed to recognize scanned text , using Java Neural Network Framework - Neuroph
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    ...Include: contour recognition; vectorisation; matrix letter feature recognition; auto page segmentation and detect rotation; SS3 ASM core; XML base; web-based GUI; 99,6% printed Unicode text recognition; letter base up to 1200 letters.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    The purpose of this program is to take metadata and full text OCR from ContentDM and export into a database for use in other applications. The application is setup to generate a JPG derivative from either a TIF or JP2 associated with an object.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    Akshara Malayalam OCR is a project for the development of an OCR for printed and handwritten documents in Malayalam language. The inspiration is from similar OCR softwares in other languages etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Classnotes is an OCR intended to translate handwritten scans into text. In order for the program to translate the scans the user must create a handwriting profile by training the OCR with scans.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Got any emails with obnoxious inline text? Long text stories with bad formatting? Files that an OCR didn't quite translate right? RTF format files and no easy way to read or modify them? Then eBookFormatter is for you!
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Software to fit whole-sentence language models using the principle of maximum entropy. For developers of speech recognizers, text prediction interfaces, OCR, machine translation software.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    OpenOCR will be a commercial quality ocr engine with tools for pre- and post-processing of images and resulting text.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    When translating becomes a game ! Text to translate can be graphically selected. Several dictionnaries can be sorted according to the context. A large choice of matching strategies is available. The OCR engine is tunable.
    Downloads: 0 This Week
    Last Update:
    See Project