Showing 153 open source projects for "ocr"

View related business solutions
  • Context for your AI agents Icon
    Context for your AI agents

    Crawl websites, sync to vector databases, and power RAG applications. Pre-built integrations for LLM pipelines and AI assistants.

    Build data pipelines that feed your AI models and agents without managing infrastructure. Crawl any website, transform content, and push directly to your preferred vector store. Use 10,000+ tools for RAG applications, AI assistants, and real-time knowledge bases. Monitor site changes, trigger workflows on new data, and keep your AIs fed with fresh, structured information. Cloud-native, API-first, and free to start until you need to scale.
    Try for free
  • Skillfully - The future of skills based hiring Icon
    Skillfully - The future of skills based hiring

    Realistic Workplace Simulations that Show Applicant Skills in Action

    Skillfully transforms hiring through AI-powered skill simulations that show you how candidates actually perform before you hire them. Our platform helps companies cut through AI-generated resumes and rehearsed interviews by validating real capabilities in action. Through dynamic job specific simulations and skill-based assessments, companies like Bloomberg and McKinsey have cut screening time by 50% while dramatically improving hire quality.
    Learn More
  • 1
    An image postprocessor for the DIY Book Scanner described on instructables.com and diybookscanner.org. Gets images ready for OCR or for PDF. Written in Java based on a partial port of the Leptonica image processing library.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 2
    Graphical interface for Cuneiform OCR
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    JOcrad is a graphical frontend for GNU/Ocrad written in Java. GNU Ocrad is an OCR (Optical Character Recognition) program based on a feature extraction method.JOcrad supports italian and english languages, JPG,PNG and GIF images.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    The purpose of this program is to take metadata and full text OCR from ContentDM and export into a database for use in other applications. The application is setup to generate a JPG derivative from either a TIF or JP2 associated with an object.
    Downloads: 0 This Week
    Last Update:
    See Project
  • All-in-one security tool helps you prevent ransomware and breaches. Icon
    All-in-one security tool helps you prevent ransomware and breaches.

    SIEM + Detection and Response for IT Teams

    Blumira’s detection and response platform enables faster resolution of threats to help you stop ransomware attacks and prevent data breaches. We surface real threats, providing meaningful findings so you know what to prioritize. With our 3-step rapid response, you can automatically block known threats, use our playbooks for easy remediation, or contact our security team for additional guidance. Our responsive security team helps with onboarding, triage and ongoing consultations to continuously help your organization improve your security coverage.
    Learn More
  • 5
    Tx2Px is an open source program for rendering text unreadable with OCR programs. Such text may be used on web pages for verification purposes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Open Bangla OCR - A BDOSDN (Bangladesh Open Source Development Network) project to develop a Bangla OCR
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Joshi is a program that tries to recognize shapes on an image with a focus on OCR. It converts the image to vector graphics (polylines) and then tries to project these on stored vector graphics, calculating the best match.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Conjecture is a modular, extensible, open-source C++ framework for Optical Character Recognition (OCR). It is not a single OCR, but rather an extensible collection of OCRs that can be explored, compared, extended and modified within a unified environment
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    DigitalEyes is an OCR (Optical Character Recognizer) developed in C/Caml released under GNU GPL by SuM42, as a sophomore project in EPITA.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Nonprofit Budgeting Software Icon
    Nonprofit Budgeting Software

    Martus Solutions provides seamless budgeting, reporting, and forecasting tools that integrate with accounting systems for real-time financial insights

    Martus' collaborative and easy-to-use budgeting and reporting platform will save you hundreds of hours each year. It's designed to make the entire budgeting process easier and create unlimited financial transparency.
    Learn More
  • 10
    Akshara Malayalam OCR is a project for the development of an OCR for printed and handwritten documents in Malayalam language. The inspiration is from similar OCR softwares in other languages etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    Tesseract OCR

    Commercial quality OCR.

    A commercial quality OCR engine originally developed at HP between 1985 and 1995. In 1995, this engine was among the top 3 evaluated by UNLV. It was open-sourced by HP and UNLV in 2005. (NOTE: We're migrating to code.google.com. Please see the forums.)
    Downloads: 10 This Week
    Last Update:
    See Project
  • 12
    Classnotes is an OCR intended to translate handwritten scans into text. In order for the program to translate the scans the user must create a handwriting profile by training the OCR with scans.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Optical Character Recognition (OCR) utility
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    An optical character recognition filter for use with postifix or sendmail as a milter. The filter focuses on only processing images that it absolutely has to, this is to conserve computing power and not allow the spammer to use to many resources. Feature
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Waygoer is an OCR program. It is based on contour extraction and momentum transformations. This allows for rotation- and scale-invariant recognition. Waygoer is still at an early stage of development.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Handwriting recognition and OCR in Indic languages
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Got any emails with obnoxious inline text? Long text stories with bad formatting? Files that an OCR didn't quite translate right? RTF format files and no easy way to read or modify them? Then eBookFormatter is for you!
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Artificial vision library. Objectives are to make an OCR, fingerprint and face identification as some applications through a general purpose learning and pattern relationships algorithm (Currently performs very basic identification).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    An omnifont OCR software for KDE. Due to the fact that each step of the OCR process can be visualized you can get a quick idea of how OCR works and where the problems lie. However the program may be of minor/no use for end users in its current state.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Software to fit whole-sentence language models using the principle of maximum entropy. For developers of speech recognizers, text prediction interfaces, OCR, machine translation software.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    OOCR is a open source character recognition program, it is used to convert images to editable text.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    Tifftool is a high-performance tool to clean scanned documents in preparation for onscreen display or for OCR. Features include skew correction, orientation correction, despeckle, page alignment, split pages and batch processing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Primary goal of Imated is development of handwritten/machine printed - OCR system. And second goal is development text editor, that will be in a position to import scanned documents OCR them on-the-fly, edit them and print/save as a picture again.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    It's a tool who shows the concepts of a type of neuronal networks (multi-layers percetron). It's not a real ocr, it's just a little didactical application.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    OpenOCR will be a commercial quality ocr engine with tools for pre- and post-processing of images and resulting text.
    Downloads: 0 This Week
    Last Update:
    See Project