Showing 29 open source projects for "umi-ocr"

View related business solutions
  • The #1 Embedded Analytics Solution for SaaS Teams. Icon
    The #1 Embedded Analytics Solution for SaaS Teams.

    Qrvey saves engineering teams time and money with a turnkey multi-tenant solution connecting your data warehouse to your SaaS application.

    Qrvey’s comprehensive embedded analytics software enables you to design more customizable analytics experiences for your end users.
    Try Developer Playground
  • Free CRM Software With Something for Everyone Icon
    Free CRM Software With Something for Everyone

    216,000+ customers in over 135 countries grow their businesses with HubSpot

    Think CRM software is just about contact management? Think again. HubSpot CRM has free tools for everyone on your team, and it’s 100% free. Here’s how our free CRM solution makes your job easier.
    Get free CRM
  • 1
    Video-subtitle-extractor

    Video-subtitle-extractor

    A GUI tool for extracting hard-coded subtitle (hardsub) from videos

    Video hard subtitle extraction, generate srt file. There is no need to apply for a third-party API, and text recognition can be implemented locally. A deep learning-based video subtitle extraction framework, including subtitle region detection and subtitle content extraction. A GUI tool for extracting hard-coded subtitles (hardsub) from videos and generating srt files. Use local OCR recognition, no need to set up and call any API, and do not need to access online OCR services such as Baidu...
    Downloads: 41 This Week
    Last Update:
    See Project
  • 2
    Papermerge

    Papermerge

    Open Source Document Management System for Digital Archives

    ...-source software which means that transparency is the core value of our software development. Source code can be reviewed and improved by anyone from anywhere. Papermerge supports multiple users. Each user can be assigned different permissions to perform only a specific kind of action e.g. view only documents from a specific folder. OCR technology is vital part of Papermerge. It extracts text information from scanned documents, PDF, JPEG, TIFF files.
    Downloads: 34 This Week
    Last Update:
    See Project
  • 3
    DocTR

    DocTR

    Library for OCR-related tasks powered by Deep Learning

    DocTR provides an easy and powerful way to extract valuable information from your documents. Seemlessly process documents for Natural Language Understanding tasks: we provide OCR predictors to parse textual information (localize and identify each word) from your documents. Robust 2-stage (detection + recognition) OCR predictors with pretrained parameters. User-friendly, 3 lines of code to load a document and extract text with a predictor. State-of-the-art performances on public document...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Doctor Dok

    Doctor Dok

    Doctor Dok is an AI based medical data framework

    ... - digitalized - accessible anywhere from Mobile or Desktop. Using AI you may translate your health records to one of 50+ languages - making abroad health services more accessible. Doctor Dok uses AI to OCR even a hardly readable photo of your health documents. Then stores it in the cloud with Zero Trust Security architecture (nobody but You can decrypt the data).
    Downloads: 1 This Week
    Last Update:
    See Project
  • Bright Data - All in One Platform for Proxies and Web Scraping Icon
    Bright Data - All in One Platform for Proxies and Web Scraping

    Say goodbye to blocks, restrictions, and CAPTCHAs

    Bright Data offers the highest quality proxies with automated session management, IP rotation, and advanced web unlocking technology. Enjoy reliable, fast performance with easy integration, a user-friendly dashboard, and enterprise-grade scaling. Powered by ethically-sourced residential IPs for seamless web scraping.
    Get Started
  • 5
    qiankun

    qiankun

    Blazing fast, simple and complete solution for micro frontends

    Qiankun enables you and your teams to build next-generation and enterprise-ready web applications leveraging Micro Frontends. It is inspired by and based on single-spa. Qiankun was birthed internally in our group during the time web app development by distributed teams had turned to complete chaos. We faced every problem the micro frontend was conceived to solve, so naturally, it became part of our solution. After solving these common problems of micro frontends and lots of polishing and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Ant Design Pro

    Ant Design Pro

    Out-of-box UI solution for enterprise applications

    ... knowledge, and understand umi and Ant Design. If you are a pure novice, it is recommended to read Beginner's Need to know. Sharpen the knife and chop wood by mistake. Knowing some basic knowledge can make the learning curve smoother. Create umi offers a wide range of options to meet the scaffolding needs of different users, and we can choose to develop the language JavaScript or TypeScript.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    DocWire SDK

    DocWire SDK

    Award-winning modern data processing in C++17/20

    DocWire SDK, a standout C++17/20 data processing tool, has received award from SourceForge and strong backing from Microsoft. It handles nearly 100 file types, empowering efficient text extraction, web data extraction, and document analysis. The upcoming integration of C++17 and C++20 will bring advanced functionalities, particularly in areas like HTTP capabilities and web data extraction. For businesses, the shift to DocWire SDK signifies a leap forward. It promises comprehensive document...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 8
    Common Resource Grep - crgrep

    Common Resource Grep - crgrep

    Common Resource Grep

    CRGREP searches for matching text in databases, various document formats, archives and other difficult to access resources. A command line tool for name and content text matching in database tables, plain files, MS Office documents, PDF, archives, MP3 audio, image meta-data, scanned documents, maven dependencies and web resources. CRGREP will search resources within resources of any arbitrary combination or depth, so text within a document within a zip archive, and so on. Here you...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 9
    Paperless-ng

    Paperless-ng

    A supercharged version of paperless, scan, index and archive docs

    Paperless is a simple Django application running in two parts, a Consumer (the thing that does the indexing) and a Web server (the part that lets you search & download already-indexed documents). Paper is a nightmare. Environmental issues aside, there’s no excuse for it in the 21st century. It takes up space, collects dust, doesn’t support any form of a search feature, indexing is tedious, it’s heavy and prone to damage & loss. I wrote this to make “going paperless” easier. I do not have to...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Build Securely on Azure with Proven Frameworks Icon
    Build Securely on Azure with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 10
    TNN

    TNN

    Uniform deep learning inference framework for mobile

    TNN, a high-performance, lightweight neural network inference framework open sourced by Tencent Youtu Lab. It also has many outstanding advantages such as cross-platform, high performance, model compression, and code tailoring. The TNN framework further strengthens the support and performance optimization of mobile devices on the basis of the original Rapidnet and ncnn frameworks. At the same time, it refers to the high performance and good scalability characteristics of the industry's...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    SwiftOCR

    SwiftOCR

    Fast and simple OCR library written in Swift

    SwiftOCR is a fast and simple OCR library written in Swift. It uses a neural network for image recognition. As of now, SwiftOCR is optimized for recognizing short, one-line long alphanumeric codes (e.g. DI4C9CM). We currently support iOS and OS X. If you want to recognize normal text like a poem or a news article, go with Tesseract, but if you want to recognize short, alphanumeric codes (e.g. gift cards), I would advise you to choose SwiftOCR because that's where it exceeds. Tesseract...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Mob

    Mob

    A handsome Himalayan desktop client for Mac, Win and Linux

    The official PC side of Himalaya has been released. Mob (モブ), the male number one with super power of 100 (strongly recommended this time) Built on Electron , Umi , Dva , Antd. Join Himalaya podcast API, multi-language, custom style, keyboard shortcuts settings, download history, local music, play record, album review, multiple tracks join lists.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    dva

    dva

    React and redux based, lightweight and elm-style framework

    React and redux-based, lightweight and elm-style framework. Lightweight front-end framework based on redux, redux-saga and react-router. (Inspired by elm and choo) Easy to learn, easy to use: only 6 apis, very friendly to redux users, and API reduce to 0 when use with umi. Elm concepts: organize models with reducers, effects and subscriptions. Support HMR: support HMR for components, routes and models with babel-plugin-dva-hmr. Plugin system: e.g. we have dva-loading plugin to handle loading...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    iText®, a JAVA PDF library

    iText®, a JAVA PDF library

    PDF Library for Developers

    iText is an open-source PDF library available for Java and .NET (C#). iText allows you to effortlessly generate and manipulate standards-compliant PDF documents with a powerful and feature-rich SDK. With iText, you can create archivable and accessible PDFs, split and merge documents, fill and flatten forms, digitally sign documents, and more. iText add-ons enable additional functionality, such as PDF creation from HTML templates, secure redaction, OCR, and much more. The latest versions...
    Leader badge
    Downloads: 451 This Week
    Last Update:
    See Project
  • 15
    pdfsandwich generates "sandwich" OCR pdf files, i.e. pdf files which contain only images (but no editable text) will be processed by optical character recognition (OCR) and the text will be added to each page invisibly "behind" the images. pdfsandwich is a command line tool which is supposed to be useful to OCR scanned books or journals. It is able to recognize the page layout even for multicolumn text. Essentially, pdfsandwich is a wrapper script which calls the following binaries...
    Leader badge
    Downloads: 324 This Week
    Last Update:
    See Project
  • 16
    A Java JNA wrapper for Tesseract OCR API
    Leader badge
    Downloads: 154 This Week
    Last Update:
    See Project
  • 17
    nunn

    nunn

    This is an implementation of a machine learning library in C++17

    nunn is a collection of ML algorithms and related examples written in modern C++17.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Tesseract OCR iOS

    Tesseract OCR iOS

    Tesseract OCR iOS is a Framework for iOS7+

    Use Tesseract OCR in iOS 9.0+ projects written in either Objective-C or Swift. Easy and fast. Tesseract OCR iOS and TesseractOCR.framework are distributed under the MIT license. These are the current versions of the upstream bundled libraries within the framework that this repository provides, Tesseract 3.03-rc1, Leptonica 1.72, Image libraries, Libtiff 4.0.4, Libpng 1.6.18, and Libjpeg 9a.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    OCR For Visually Challenged Person

    OCR For Visually Challenged Person

    Provides GUI for Tessaract OCR

    It converts scanned image into text, braille and audio format. The image should be scanned with atleast 300 dpi for better accuracy.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    DJVU++

    DJVU++

    The DjVu complete solution,with OCR Technology(Arabic ,English).

    DjVu++ is a user-friendly program that used to manipulate DjVu file formats such as eBooks with a penalty of editing features. The program introduce a free replacement for the property PDF format with similar resolution and smaller file size DjVu++ also support OCR to handle text in scanned books and images. The program shows good performance for English. In addition to the Arabic language to lead free and commercial software in this area. The main features of DjVu++ program are: o...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    DjVuPlus

    DjVuPlus

    DjVu Read Documents,With OCR Technology(Arabic ,English ),Small Size

    The DjVu Reference Library 3.5 was released by Lizardtech under the GNU General Public License version 2. DjVuLibre-3.5 was developed by Leon Bottou and others as a "Derived Work" of the DjVu Reference Library 3.5. As such, it is also subject to the GNU General Public License version 2. Several patents apply to two very specific aspects of DjVu and DjVuLibre. The patents cover a particular aspect of the ZP-coder (the arithmetic coder used in DjVu and implemented in libdjvu/ZPCodec.cpp)...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 22

    Immutable Sparse Wave Trees (WaveTree)

    Realtime bigdata tool for bit strings up to 2^63 based on AVL forest

    Realtime bigdata tool at the bit level based on immutable AVL forest which can be run in memory or, in future versions, as a merkle forest like a blockchain. Main object is a sparse bit string (Bits) that efficiently scales up to 2^63 bits normally compressed as forest has duplicated substrings. Bits objects support reading bit, byte, short, int, or long (Java primitives) at any bit index in 64 bit range. Example: instead of building a class to hold a header and then data, represent all of...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    For several purposes of our OCR research we needed a converter that can generate an image from any given Bangla/ Bengali text. One of the reasons was to prepare artificial training and testing data. For this reason we finally created a very simple text to image converter. A Text to Image Converter. Systrem Requirement: .net Framework 4.0 and up 1. Select your Font Type, style and size. 2. First Browse your file, which will contain your text (Supporetd File Type .txt) 3. Select...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24
    Software library for performing Optical Character Recognition (OCR) targeting the .NET platform. It's ported from the JavaOCR project.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Neuroph OCR - Handwriting Recognition
    Neuroph OCR - Handwriting Recognition is developed to recognize hand written letter and characters. It's engine derived's from the Java Neural Network Framework - Neuroph and as such it can be used as a standalone project or a Neuroph plug in.
    Downloads: 4 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next