Showing 19 open source projects for "tool"

View related business solutions
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    More flexibility. More control.

    Generate interest, access liquidity without selling, and execute trades seamlessly. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 1
    MinerU

    MinerU

    A high-quality tool for convert PDF to Markdown and JSON

    MinerU is an open-source, high-quality document extraction toolkit focused on converting PDFs (and other document formats) into structured Markdown and JSON. It leverages OCR and layout analysis to preserve semantic structure and metadata, ideal for research and data science workflows.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 2
    Umi-OCR

    Umi-OCR

    OCR software, free and offline

    Umi-OCR is a free and open-source optical character recognition (OCR) tool designed to provide fast, offline text extraction from images, screenshots, PDFs, and more without requiring a network connection. It includes a highly efficient offline OCR engine with built-in multilingual recognition libraries, so users can extract text across multiple languages with high accuracy directly on their machines. The software supports flexible usage patterns including screenshot capture OCR, batch processing of large sets of images or documents, PDF parsing, QR code detection, and layout-aware paragraph output. ...
    Downloads: 48 This Week
    Last Update:
    See Project
  • 3
    DeepSeek-OCR

    DeepSeek-OCR

    Contexts Optical Compression

    DeepSeek-OCR is an open-source optical character recognition solution built as part of the broader DeepSeek AI vision-language ecosystem. It is designed to extract text from images, PDFs, and scanned documents, and integrates with multimodal capabilities that understand layout, context, and visual elements beyond raw character recognition. The system treats OCR not simply as “read the text” but as “understand what the text is doing in the image”—for example distinguishing captions from body...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 4
    Hathi Download Helper

    Hathi Download Helper

    Download books from the hathitrust website in a fast and easy manner

    ...https://sourceforge.net/p/hathidownloadhelper/alternative/ ---------------------------------------------------------------------------------------------- Hathi Download Helper was a tool for downloading public domain books from hathitrust.org. E-Mail contact: hathidownloadhelper@hotmail.com
    Leader badge
    Downloads: 21 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    LWOCR

    LWOCR

    LightWeight OCR

    A lightweight and simple command‑line OCR tool for extracting text from images. Ideal for developers and users who only require basic image to text OCR functionality . Console only application (No GUI) USAGE: C:\Progs>LWOCR Usage: LWOCR.exe <image_path> [text_output] [options] Options: --brightness=X (-1.0 to 1.0, default 0.0) --contrast=X (0.0 to 5.0, default 1.0) --gamma=X (0.1 to 5.0, default 1.0) --digits Only output digits --save-image=path Save processed image Examples: LWOCR.exe image.png # Output to console LWOCR.exe image.png output.txt # Output to file LWOCR.exe image.png --brightness=0.2 # Adjust brightness LWOCR.exe image.png output.txt --save-image=processed.png Support: mrbeepbeepp@gmail.com `
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    DocWire SDK

    DocWire SDK

    Award-winning modern data processing SDK in C++20

    DocWire SDK, a standout C++20AI driven data processing tool, has received award from SourceForge and strong backing from Microsoft. It handles nearly 100 file types, empowering efficient text extraction, web data extraction, and document analysis. For businesses, the shift to DocWire SDK signifies a leap forward. It promises comprehensive document format support and the ability to extract valuable insights from email boxes, databases, and websites using cutting-edge AI.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 7
    MyBox

    MyBox

    Easy Tools of PDF, Image, File, Network, Data, and Medias

    javafx-desktop-apps pdf image ocr icc barcode color-palette text bytes markdown html archive compress digest video audio editor converter media https://github.com/Mararsh/MyBox Self-contain packages need not java env nor installation. Jar packages need Java 16 or higher.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    Image To Text tools

    ITTT is a Free tool designed to Scan and extract Text from Images.

    Image To Text Tools is a 100% Free user-friendly tool designed to Scan and extract containing text in images into editable text formats. Whether you need to extract text from scanned documents, photographs, or other image files, Image To Text Tools provides accurate and reliable Optical Character Recognition (OCR) capabilities to meet your needs.
    Downloads: 13 This Week
    Last Update:
    See Project
  • 9
    cintruder

    cintruder

    CIntruder - OCR Bruteforcing Toolkit

    Captcha Intruder is an automatic pentesting tool to bypass captchas. -> CIntruder-v0.4 (.zip) -> md5 = 6326ab514e329e4ccd5e1533d5d53967 -> CIntruder-v0.4 (.tar.gz) ->md5 = 2256fccac505064f3b84ee2c43921a68 --------------------------------------------
    Downloads: 0 This Week
    Last Update:
    See Project
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 10

    Devanagari OCR

    Devanagari Optical Character Recognition, Annotation tool

    The project has source code and data related to the following tools: 1. Optical Character Recognition. Recognize machine printed Devanagari with or without a dictionary. 2. Document Image Analysis. Automatic page segmentation of document images in multiple Indian languages. Identifies pictures, lines, and words in a document scanned at 300 dpi. 3. Multi-lingual annotation. An interface that has transilteration and a soft-keyboard using which multiple languages can be input....
    Downloads: 7 This Week
    Last Update:
    See Project
  • 11

    comic-translator

    Based on Russian project "Overlay" ,A tool to translate comic books

    Based on Russian project "Overlay" ,helping to translate comic books, Added funtion as ZIP RAR support Colorpicker OCR and more coming soon.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 12
    EliteOCR

    EliteOCR

    OCR tool for market screenshots in Elite: Dangerous

    EliteOCR allows you to OCR market screenshots from Elite: Dangerous and export the data to various formats and services.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 13
    Terese is a tool for proofreading OCR text. Terese tries to map the text back to the scanned image, and visually shows the differences. See the homepage for further details.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14

    Immutable Sparse Wave Trees (WaveTree)

    Realtime bigdata tool for bit strings up to 2^63 based on AVL forest

    Realtime bigdata tool at the bit level based on immutable AVL forest which can be run in memory or, in future versions, as a merkle forest like a blockchain. Main object is a sparse bit string (Bits) that efficiently scales up to 2^63 bits normally compressed as forest has duplicated substrings. Bits objects support reading bit, byte, short, int, or long (Java primitives) at any bit index in 64 bit range.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    DrawPad

    DrawPad

    Pattern recognition tool for image, pdf and handwritings

    The tool is an optical recognition tool which runs in following three mode : 1. Drawing Pad : Here the user can draw a character and the tool will recognize which character it is. 2. Image OCR : Image based OCR tool to recognize text and barcodes present in the image. It also supports saving the OCR output. 3. PDF OCR : PDF OCR is the advanced form of OCR, where PDF is parsed into image and OCR is run on that result.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    qt-box-editor

    QT4 editor of tesseract-ocr box files

    QT Box Editor is tool for adjusting tesseract-ocr box files. Aim of this project is to provide easy and efficient way for editing regardless file size.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17

    edocias

    Electronic Document Index And Search

    EDocIAS (Electronic Document Index And Search) is a PHP-based tool for indexing and searching files of various types. Third-party tools (tesseract, xpdf, etc.) can be configured to support any type of file.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    SubHub

    SubHub

    Post-OCR correction tool for SRT subtitles

    A post-OCR correction tool for SRT subtitle files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    TCR Neuroph -Text Character Recognition
    TCR Neuroph - Text Character Recognition is java tool developed to recognize scanned text , using Java Neural Network Framework - Neuroph
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB