Showing 18 open source projects for "character recognition code"

View related business solutions
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • Fully Managed MySQL, PostgreSQL, and SQL Server Icon
    Fully Managed MySQL, PostgreSQL, and SQL Server

    Automatic backups, patching, replication, and failover. Focus on your app, not your database.

    Cloud SQL handles your database ops end to end, so you can focus on your app.
    Try Free
  • 1
    OCRmyPDF

    OCRmyPDF

    OCRmyPDF adds an OCR text layer to scanned PDF files

    OCRmyPDF adds an optical character recognition (OCR) text layer to scanned PDF files, allowing them to be searched. PDF is the best format for storing and exchanging scanned documents. Unfortunately, PDFs can be difficult to modify. OCRmyPDF makes it easy to apply image processing and OCR (recognized, searchable text) to existing PDFs.
    Downloads: 94 This Week
    Last Update:
    See Project
  • 2
    Unredact

    Unredact

    A simple tool for reading in poorly redacted documents

    Unredact is a specialized tool that attempts to reconstruct redacted or obscured text in images, PDFs, or screenshots using a combination of image processing and generative AI inference to suggest plausible completions of blurred, black-boxed, or jumbled content. Unlike traditional optical character recognition (OCR), which only reads visible text, Unredact focuses on inferring missing content where redaction has been applied by analyzing surrounding context, font characteristics, and linguistic patterns to produce candidate reconstructions. It accepts a variety of input formats, automatically identifies redacted regions, and then generates text suggestions that are presented alongside visual overlays so users can choose or refine outputs.
    Downloads: 42 This Week
    Last Update:
    See Project
  • 3
    Rapid LaTeX OCR

    Rapid LaTeX OCR

    Formula recognition based on LaTeX-OCR and ONNXRuntime

    Formula recognition based on LaTeX-OCR and ONNXRuntime. rapid_latex_ocr is a tool to convert formula images to latex format. The reasoning code in the repo is modified from LaTeX-OCR, the model has all been converted to ONNX format, and the reasoning code has been simplified, Inference is faster and easier to deploy. The repo only has codes based on ONNXRuntime or OpenVINO inference in onnx format and does not contain training model codes.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4
    pytablewriter

    pytablewriter

    pytablewriter is a Python library to write a table in various formats

    pytablewriter is a Python library to write a table in various formats: AsciiDoc / CSV / Elasticsearch / HTML / JavaScript / JSON / LaTeX / LDJSON / LTSV / Markdown / MediaWiki / NumPy / Excel / Pandas / Python / reStructuredText / SQLite / TOML / TSV / YAML.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Catch Bugs Before Your Customers Do Icon
    Catch Bugs Before Your Customers Do

    Real-time error alerts, performance insights, and anomaly detection across your full stack. Free 30-day trial.

    Move from alert to fix before users notice. AppSignal monitors errors, performance bottlenecks, host health, and uptime—all from one dashboard. Instant notifications on deployments, anomaly triggers for memory spikes or error surges, and seamless log management. Works out of the box with Rails, Django, Express, Phoenix, Next.js, and dozens more. Starts at $23/month with no hidden fees.
    Try AppSignal Free
  • 5
    latexcv

    latexcv

    A collection of cv and resume templates written in LaTeX

    A collection of user-friendly LaTeX CV and résumé templates (packaged within the R Markdown vitae ecosystem), offering simple themes and templates for creating professional CVs without heavy TeX coding. Supports multiple display themes such as classic, modern, sidebar layouts.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    JSON implementation for Ruby

    JSON implementation for Ruby

    JSON implementation for Ruby

    ...You can think of it as a low-fat alternative to XML if you want to store data on disk or transmit it over a network rather than use a verbose markup language. Both variants of the JSON generator generate UTF-8 character sequences by default. If an:ascii_only option with a true value is given, they escape all non-ASCII and control characters with \uXXXX escape sequences, and support UTF-16 surrogate pairs in order to be able to generate the whole range of Unicode code points. All strings, that are to be encoded as JSON strings, should be UTF-8 byte sequences on the Ruby side. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7

    realwatermark

    A Python application to add watermarks (text or image) to PDF files

    A Python application to add watermarks (text or image) to PDF files, converts them into image and back to PDF with options for OCR and compression.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    IO91pm

    Command-line calculator for Maidenhead- (QTH-, QRA-) Locators.

    Keywords: Hamradio, Radio-amateurs, Maidenhead-, QTH-, (QRA-), Locator, Distance and Bearing Calculator, GNU/Linux command-line, LaTeX,- UTF-8-, ASCII-output, GPLv3; based on GM4ANB's book. "IO91pm is the Maidenhead-Locator for Maidenhead" ========================================= Background --------------- * For another little project, I wanted a QTH-Locator calculator for the UNIX (GNU/Linux, bash) command-line. * There are many, some very good, online QTH resources, but I wanted...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    HTMLViewer
    HTML Viewer Components for Delphi, Lazarus and C++Builder The HtmlViewer component set consists of the THtmlViewer, TFrameViewer, and TFrameBrowser components. All three are HTML document display components: THtmlViewer * The basic component. * THtmlViewer displays single (non-frame) documents. * It also forms the basis for the other two components. TFrameViewer * Displays both frame and single HTML documents. * TFrameViewer is oriented more for local file system...
    Leader badge
    Downloads: 71 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 10
    Code Guide

    Code Guide

    Standards for developing consisten and sustainable HTML and CSS

    Code Guide is a set of standards for developing consistent, flexible, and sustainable HTML and CSS. It comes from years of experience writing code on projects of all sizes. It's not the end-all be-all, but it's a start. Don't capitalize tags, including the doctype. Use soft tabs with two spaces, they're the only way to guarantee code renders the same in any environment. Nested elements should be indented once (two spaces). Always use double quotes, never single quotes, on attributes. Don't...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    CONVERTCP

    CONVERTCP

    Text File Codepage Converter for the Windows command line

    This command line utility is a codepage converter to be used to change the character encoding of text. It fully supports charsets such as ANSI code pages, UTF-8, UTF-16 LE/BE, UTF-32 LE/BE, and EBCDIC. It's designed to convert big text files, too. It runs on Windows XP onwards (tested on XP, Windows 7, Windows 8.1, Windows 10, and Windows 11). The "readme.txt" file and the Wiki gives you some more information.
    Leader badge
    Downloads: 5 This Week
    Last Update:
    See Project
  • 12
    JSONCrush

    JSONCrush

    Compress JSON into URL friendly strings

    This simple system allows for excellent compression of uri encoded JSON strings using the JSCrush algorithm.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    he

    he

    A robust HTML entity encoder/decoder written in JavaScript

    he is a JavaScript library that provides robust HTML entity encoding and decoding, with full Unicode support. It supports all standardized named character references (e.g., ©, —), handles numeric and hex entities, and deals properly with astral Unicode symbols (i.e., code points outside the BMP). The library is designed so that he.decode(input) will safely convert HTML-entity encoded strings into proper Unicode text, and he.encode(text, options) will encode non-ASCII or special characters into safe entity references. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Highlight

    Highlight

    Source code to formatted text converter

    Highlight converts source code to HTML, XHTML, RTF, ODT, LaTeX, TeX, SVG, BBCode, Pango markup, and terminal escape sequences with colored syntax highlighting. Language definitions and color themes are customizable. Highlight was designed to offer a flexible but easy-to-use syntax highlighter for several output formats. No syntax or coloring information is hardcoded, instead all relevant data is stored in configuration scripts. These Lua scripts may be altered and enhanced with plug-in...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    MathOCR

    MathOCR

    A scientific document recognition system

    MathOCR is a printed scientific document recognition system. MathOCR is still in the pre-alpha stage, recognition result may not be good enough for practical purposes. MathOCR is a printed scientific document recognition system written in pure Java. MathOCR has the functionality of image preprocessing, layout analysis and character recognition, especially the ability to recognize mathematical expression.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16
    MPPerl::Convert::ASCII85

    MPPerl::Convert::ASCII85

    Convert to/from ASCII base-85 encoded text

    This module provides functions for conversion between binary data and ASCII base-85 encoded text as described by Adobe in the PostScript Language Reference. There is also a non‐standard mode that encodes a group of four consecutive ASCII SPACE characters as a single ‘y’ character, and omits the ‘~>’ EOD marker. Please note that this module is written in pure Perl, and is as such very slow. A better alternative is to use the MPPerl::Convert::ASCII85::XS module, in which the most time-critical code has been written in C. It can be found at https://sourceforge.net/projects/mpperl-convert-ascii85-xs/.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    UniCode C++ library, a replacement for libunicode. This library allows to use / create multiple code pages, convert to HTML, using the latest UniCode version as reference. Max flexibility and re-usability. Cross-platform applications published: xpfweb
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Easily produce custom XML from Java code with validity checking and namespace support. Character stream and SAX event outputs are supported now. More can be added via a plug-in interface.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB