Showing 210 open source projects for "ocr application python"

View related business solutions
  • Context for your AI agents Icon
    Context for your AI agents

    Crawl websites, sync to vector databases, and power RAG applications. Pre-built integrations for LLM pipelines and AI assistants.

    Build data pipelines that feed your AI models and agents without managing infrastructure. Crawl any website, transform content, and push directly to your preferred vector store. Use 10,000+ tools for RAG applications, AI assistants, and real-time knowledge bases. Monitor site changes, trigger workflows on new data, and keep your AIs fed with fresh, structured information. Cloud-native, API-first, and free to start until you need to scale.
    Try for free
  • Atera all-in-one platform IT management software with AI agents Icon
    Atera all-in-one platform IT management software with AI agents

    Ideal for internal IT departments or managed service providers (MSPs)

    Atera’s AI agents don’t just assist, they act. From detection to resolution, they handle incidents and requests instantly, taking your IT management from automated to autonomous.
    Learn More
  • 1

    PDF2EpubMaker

    Convert PDF to epub by OCR

    Qt Application to convert PDF in EPub format with several step : - convert PDF to png with libpoppler - convert pnf to txt by libtesseract - suppress hyphenate - spell checkinng
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    CTS Surveyor

    CTS Surveyor

    Foot traffic and facial analytics for your business and home

    Surveyor is a software solution that monitors its environment via camera and gathers demographic information about the public in the surrounding area, providing important statistics such as number of people passing by as well as providing facial analytics to classify the pedestrians based on their age and gender. The statistical data is stored in a local database and is made available via RESTful API’s, and easy integration with other applications can be accomplished via a WebSocket...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3

    OCR Template Creator

    Create template for images or PDF files to be OCR'ed and databased

    An OCR application which allows for tag/value templates to be created with a Web GUI for handling the automatic processing of PDF's or images of documents/receipts/contracts etc. Interface to Tesseract
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Pragmatic AI

    Pragmatic AI

    [Book-2019] Pragmatic AI: An Introduction to Cloud-based ML

    Pragmatic AI is the first truly practical guide to solving real-world problems with contemporary machine learning, artificial intelligence, and cloud computing tools. Writing for business professionals, decision-makers, and students who aren’t professional data scientists, Noah Gift demystifies all the tools and technologies you need to get results. He illuminates powerful off-the-shelf cloud-based solutions from Google, Amazon, and Microsoft, as well as accessible techniques using Python...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Grafana: The open and composable observability platform Icon
    Grafana: The open and composable observability platform

    Faster answers, predictable costs, and no lock-in built by the team helping to make observability accessible to anyone.

    Grafana is the open source analytics & monitoring solution for every database.
    Learn More
  • 5

    Face Recognition

    World's simplest facial recognition api for Python & the command line

    Face Recognition is the world's simplest face recognition library. It allows you to recognize and manipulate faces from Python or from the command line using dlib's (a C++ toolkit containing machine learning algorithms and tools) state-of-the-art face recognition built with deep learning. Face Recognition is highly accurate and is able to do a number of things. It can find faces in pictures, manipulate facial features in pictures, identify faces in pictures, and do face recognition on a...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 6
    MDictate

    MDictate

    Speech to text using python, pocketsphinx, ready to deploy

    Automated speech recognition software is extremely cumbersome. This project's aim is to incrementally improve the quality of an open-source and ready to deploy speech to text recognition system. Runs on Windows using the mdictate.exe, but the core workings are found in the mdictate.py script which should work on Windows/Linux/OS X. In version 1.0, we use pocketsphinx' default setup with a basic graphic interface.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Subtitle Workshop

    Subtitle Workshop

    Free subtitle editor

    Subtitle Workshop is a free application for creating, editing, and converting text-based subtitle files. It supports all the subtitle formats you need and has all the features you would want.
    Leader badge
    Downloads: 1,198 This Week
    Last Update:
    See Project
  • 8
    Lip Reading

    Lip Reading

    Cross Audio-Visual Recognition using 3D Architectures

    The input pipeline must be prepared by the users. This code is aimed to provide the implementation for Coupled 3D Convolutional Neural Networks for audio-visual matching. Lip-reading can be a specific application for this work. Audio-visual recognition (AVR) has been considered as a solution for speech recognition tasks when the audio is corrupted, as well as a visual recognition method used for speaker verification in multi-speaker scenarios. The approach of AVR systems is to leverage the...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    OCR Web based

    OCR Web based

    OCR web based for Browser Firefox & PC

    Optical Character Recognition in JS for Browser is based on ocrad.js. OCR for Browser is a free extension and You can use this application to extract text from any image you supply. Just upload your image files. OCR for Browser takes either a JPG, GIF, TIFF, BMP, PNG. ========= Get OCR for Android (Beta release) - https://play.google.com/store/apps/details?id=com.ulm.ocr ========= Add-on for Opera: http://bit.ly/1F0E0wP ========= Release 1.0.1 For safety reasons, I disabled the possibility to import an image from url. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Desktop and Mobile Device Management Software Icon
    Desktop and Mobile Device Management Software

    It's a modern take on desktop management that can be scaled as per organizational needs.

    Desktop Central is a unified endpoint management (UEM) solution that helps in managing servers, laptops, desktops, smartphones, and tablets from a central location.
    Learn More
  • 10

    Lords Mobile Player Statistics

    Compare yourself with others by analyzing Lords Mobile Screenshots

    Lords Mobile Player Statistics (or short Lords Mobile Stats) is a Windows application that allows you to parse Lords Mobile Screenshots to extract player statistics of yourself and other players. After extracting the data you can compare players and view the data as a large sortable table. You can also export the results as text file or render the table as an image (for sharing with your guild for example). This project is still in a early development stage, please read the Wiki (see menu...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    cbrTekStraktor

    an application to automatically extract text from comic books.

    cbrTekStraktor is an application to automatically extract text from the text bubbles or speech balloons present in comic book reader files (CBR). Its prime goal is to perform analysis on the texts of comic books. cbrTekStraktor can however also be used for scanlation or similar purposes. The application also enables to manually define text areas in CBR files. The application comprises a simple graphical editor for further processing the extracted text. The text extraction is...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 12
    RadicalSpam Virtual Appliance

    RadicalSpam Virtual Appliance

    Virtual Appliance of RadicalSpam

    RadicalSpam Virtual Appliance takes full solution of RadicalSpam Community Edition , pre-installed in a OVF virtual machine ( Open Virtual Format ) compatible with the best virtualization platforms on the market , including VMware ESX Server. More information : http://www.radical-spam.org
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    RadicalSpam

    RadicalSpam

    Open Source Anti-Spam and Anti-Virus Gateway

    RadicalSpam is a free and open source package distributed under GPL v2, including products such as Postfix, SpamAssassin Amavisd-new, Clamav, Razor, DCC, Postgrey, Bind; providing a secure SMTP relay, ready to use with linux and docker environement. More information : http://www.radical-spam.org
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Malware Analysis Network in Taiwan <Man in Taiwan, MiT> Welcome to contact us (TonTon@TWMAN.ORG) if you are interested in collaborating with us. This project is open source and distributed under the GNU General Public License version 3. Please feel free to add to or modify this source and propose changes or new converters. Developer & Copyrighted by : TonTon Hsien-De Huang Prompter: Jazz Yao-Tsung Wang, Figaro Chen-Ho Yang | Logo Desinger:Temaki Guo Community on...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Hydroponic Automation Platform (HAPI)

    Hydroponic Automation Platform (HAPI)

    Technologies for automating food production on various scales

    The Hydroponic Automation Platform Initiative (HAPI) develops and provides hardware and software components for automating food production using hydroponic, aquaponics, and precision agriculture techniques. High-yield production in urban settings is one of the primary goals. Artifacts include hardware design (mainly Arduino-based), firmware, management software and reporting modules.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Eye is an experimental OCR (image-to-text) application.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18

    bnlviewer

    METS / ALTO viewer written in Java and Javascript

    The National library of Luxembourg's viewer for METS (http://www.loc.gov/standards/mets/) files with OCR files in the ALTO format. The viewer needs a tomcat application server to run in. It can be deployed so that it reads the METS files from a local folder. Its main use is for digitized newspapers and postcards but can be adapted to other METS profiles as well. The viewer can be seen in action at: http://www.eluxemburgensia.lu Other known users include: National library of Latvia (http://www.periodika.lv) University library of Belgrade (http://arhiva.unilib.rs/unilib/istorijskenovine/index.php?...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 19
    Turn your scanner into a free document reader for invoices (e.g. for e-banking) with the help of tesseract-ocr available for many unix (and also windows) platforms.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20

    ProximityForest

    Efficient Approximate Nearest Neighbors for General Metric Spaces

    A proximity forest is a data structure that allows for efficient computation of approximate nearest neighbors of arbitrary data elements in a metric space. See: O'Hara and Draper, "Are You Using the Right Approximate Nearest Neighbor Algorithm?", WACV 2013 (best student paper award). One application of a ProximityForest is given in the following CVPR publication: Stephen O'Hara and Bruce A. Draper, "Scalable Action Recognition with a Subspace Forest," IEEE Conference on Computer...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    LynxSight Mobile

    LynxSight Mobile

    An OCR assistant for visually impaired people

    LynxSight mobile is an android application that serves as OCR assistant. Application scans pictures taken by camera for text and reads it to user. LynxSight mobile is designed for use by visually impaired people. It contains voice assistant, voice commands and simple UI to make using easier.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    AarTon
    AarTon is an automated text-to-speech application. It allows user to enter text in a web-based front-end and render these texts via a multi-channel sound card.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Tesseract-gui
    Tessract-GUI is not a front-end for tesseract-ocr. It is just a graphical way to use it with simple image manipulation thru ImageMagick.
    Downloads: 15 This Week
    Last Update:
    See Project
  • 24

    Smart Grid Co-Simulator

    Windows code with Omnet++ and OpenDSS simulator

    ...This is done through a software interface between the communication and power networks simulators. We have tested our co-simulation framework with a renewable energy integration application. * This work was partly supported by an Open Collaborative Research (OCR) project from IBM. The authors are thankful to Deva Seetharam, Koushik Das and their team at IBM, for their valuable suggestions and comments.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    voicecommand

    voicecommand

    Run Bash commands using Google voice recognition

    This simple pygtk application uses ffmpeg and arecord; to record sound Google's unofficial text to speech service; to convert sound to text The python subprocess module to run the text as a shell command. The text to speech service used by this application is unofficial, and this program should therefore be considered a complete hack.
    Downloads: 0 This Week
    Last Update:
    See Project