Showing 13 open source projects for "x-art"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 1
    HunyuanOCR

    HunyuanOCR

    OCR expert VLM powered by Hunyuan's native multimodal architecture

    ...It’s designed to unify the entire OCR pipeline, detection, recognition, layout parsing, information extraction, translation, and even subtitle or structured output generation, into a single model inference instead of a cascade of separate tools. Despite being fairly lightweight (about 1 billion parameters), it delivers state-of-the-art performance across a wide variety of OCR tasks, outperforming many traditional OCR systems and even other multimodal models on benchmark suites. HunyuanOCR handles complex documents: multi-column layouts, tables, mathematical formulas, mixed languages, handwritten or stylized fonts, receipts, tickets, and even video-frame subtitles. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    dots.ocr

    dots.ocr

    Multilingual Document Layout Parsing in a Single Vision-Language Model

    ...Unlike traditional OCR pipelines that rely on multiple specialized components, dots.ocr integrates these processes end-to-end, reducing error propagation and improving consistency across tasks. The model is designed to recognize virtually any human script, making it highly effective for global and low-resource language scenarios. It achieves state-of-the-art performance on document parsing benchmarks while maintaining a relatively compact model size, demonstrating efficiency without sacrificing accuracy. Beyond standard OCR tasks, it extends its capabilities to parse complex visual elements such as charts, diagrams, and web interfaces, converting them into structured outputs like SVG code.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    LWOCR

    LWOCR

    LightWeight OCR

    ...Ideal for developers and users who only require basic image to text OCR functionality . Console only application (No GUI) USAGE: C:\Progs>LWOCR Usage: LWOCR.exe <image_path> [text_output] [options] Options: --brightness=X (-1.0 to 1.0, default 0.0) --contrast=X (0.0 to 5.0, default 1.0) --gamma=X (0.1 to 5.0, default 1.0) --digits Only output digits --save-image=path Save processed image Examples: LWOCR.exe image.png # Output to console LWOCR.exe image.png output.txt # Output to file LWOCR.exe image.png --brightness=0.2 # Adjust brightness LWOCR.exe image.png output.txt --save-image=processed.png Support: mrbeepbeepp@gmail.com `
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4
    Dual Clip Translator
    Translation of Selected text or Clipboard contents powered by Google. HotKeys Paste/Change Text auto translated. View in Balloon/Window the result of translation, besides being sent to the clipboard. Screen Capture of Desktop/Game > OCR > Translated.
    Downloads: 30 This Week
    Last Update:
    See Project
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 5
    LayoutParser

    LayoutParser

    A Unified Toolkit for Deep Learning Based Document Image Analysis

    With the help of state-of-the-art deep learning models, Layout Parser enables extracting complicated document structures using only several lines of code. This method is also more robust and generalizable as no sophisticated rules are involved in this process. A complete instruction for installing the main Layout Parser library and auxiliary components. Learn how to load DL Layout models and use them for layout detection.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 6
    PandaOCR

    PandaOCR

    Multifunctional OCR Image and Text Recognition

    ...The normal version will continue to be maintained, and all interfaces will be retained but no new functions will be added. The reason why the version number of the professional version starts from 5.x is that the normal version will be updated in the future, so a period of version number is reserved. You can continue to use the regular version for free as before, without worrying about deactivating the regular version after the launch of the professional version. If you have higher needs, you can try the professional version. You can also use the Baidu API interface without activation. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Linux-Intelligent-Ocr-Solution

    Linux-Intelligent-Ocr-Solution

    Easy-OCR solution and Tesseract trainer for GNU/Linux

    ...list=PLn29o8rxtRe1zS1r2-yGm1DNMOZCgdU0i Tesseract Training Tutorial (beta) : https://www.youtube.com/watch?v=qLpCld4cdtk Source Code Github : https://github.com/Nalin-x-Linux/lios-3 Gitlab : https://gitlab.com/Nalin-x-Linux/lios-3 User guide is available in download page
    Downloads: 8 This Week
    Last Update:
    See Project
  • 8
    cintruder

    cintruder

    CIntruder - OCR Bruteforcing Toolkit

    Captcha Intruder is an automatic pentesting tool to bypass captchas. -> CIntruder-v0.4 (.zip) -> md5 = 6326ab514e329e4ccd5e1533d5d53967 -> CIntruder-v0.4 (.tar.gz) ->md5 = 2256fccac505064f3b84ee2c43921a68 --------------------------------------------
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    ...Essentially, pdfsandwich is a wrapper script which calls the following binaries: convert, unpaper, tesseract, gs, and hocr2pdf (if tesseract < 3.03). It is known to run on Unix systems and has been tested on Linux and MacOS X. It supports parallel processing on multiprocessor systems. In contrast to most competing sandwich programs, it performs preprocessing of the scanned images, such as de-skewing or removal of dark edges etc. For further information please read the manual: http://www.tobias-elze.de/pdfsandwich/index.html
    Leader badge
    Downloads: 289 This Week
    Last Update:
    See Project
  • Secure File Transfer for Windows with Cerberus by Redwood Icon
    Secure File Transfer for Windows with Cerberus by Redwood

    Protect and share files over FTP/S, SFTP, HTTPS and SCP with the #1 rated Windows file transfer server.

    Cerberus supports unlimited users and connections on a single IP, with built-in encryption, 2FA, and a browser-based web client — all deployable in under 15 minutes with a 25-day free trial.
    Try for Free
  • 10
    Vedvarsha is an application for 2 purposes: 1. Handwariting script recognition that extracts recognized letters into documents. 2. OCR (Optical Character Recogniton) that works only for non-cursive and isolated characters. It depends upon libsyntactic,
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Tested for Ubuntu Maverick - Create Audiobooks from eBooks, text or pictures. - Read eBooks or text aloud while scrolling through pages
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    Artificial vision library. Objectives are to make an OCR, fingerprint and face identification as some applications through a general purpose learning and pattern relationships algorithm (Currently performs very basic identification).
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    It's a tool who shows the concepts of a type of neuronal networks (multi-layers percetron). It's not a real ocr, it's just a little didactical application.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next