Showing 27 open source projects for "simple-xml"

View related business solutions
  • Error to trace to log to deploy. One click. No SSH. Icon
    Error to trace to log to deploy. One click. No SSH.

    Catch the cause before the pager goes off.

    AppSignal links every error to the trace, the trace to the log, the log to the deploy that shipped it.
    Free 30 days.
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    More flexibility. More control.

    Generate interest, access liquidity without selling, and execute trades seamlessly. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 1
    Tesseract.js

    Tesseract.js

    A pure Javascript Multilingual OCR

    Tesseract.js is a pure Javascript port of the popular Tesseract OCR engine. Tesseract.js' library supports more than 100 languages, automatic text orientation and script detection, a simple interface for reading paragraph, word, and character bounding boxes. Tesseract.js can run either in a browser and on a server with NodeJS. Tesseract.js is a javascript library that gets words in almost any spoken language out of images. The main Tesseract.js functions (ex. recognize, detect) take an image parameter, which should be something that is like an image. ...
    Downloads: 16 This Week
    Last Update:
    See Project
  • 2
    GLM-OCR

    GLM-OCR

    Accurate × Fast × Comprehensive

    ...The model’s multimodal capabilities allow it to reason across image and text content holistically, capturing structured and unstructured information from pages that include dense tables, seals, code snippets, and varied document graphics. GLM-OCR integrates a comprehensive SDK and inference toolchain that makes it easy for developers to install, invoke, and embed into production pipelines with simple commands or APIs.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 3
    Papermerge

    Papermerge

    Open Source Document Management System for Digital Archives

    Papermerge is an open source document management system (DMS) primarily designed for archiving and retrieving your digital documents. Instead of having piles of paper documents all over your desk, office or drawers - you can quickly scan them and configure your scanner to directly upload to Papermerge DMS. Store, organize and index scanned documents in PDF, JPEG and TIFF formats. Instantly find relevant information using full text, tags and metadata-based search. Papermerge is free and...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 4
    DeepDetect

    DeepDetect

    Deep Learning API and Server in C++14 support for Caffe, PyTorch

    ...While the Open Source Deep Learning Server is the core element, with REST API, and multi-platform support that allows training & inference everywhere, the Deep Learning Platform allows higher level management for training neural network models and using them as if they were simple code snippets. Ready for applications of image tagging, object detection, segmentation, OCR, Audio, Video, Text classification, CSV for tabular data and time series. Neural network templates for the most effective architectures for GPU, CPU, and Embedded devices. Training in a few hours and with small data thanks to 25+ pre-trained models. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • 5
    Scribe.js

    Scribe.js

    JavaScript OCR and text extraction for images and PDFs

    ...The library can take image files (such as PNG or JPEG) and recognize the text they contain, and it can also extract text from PDF files that either already contain text or are image-based scans, using modern web standards and WebAssembly under the hood. In addition to simple text extraction, Scribe.js supports writing or injecting a high-quality invisible text layer back into PDFs, effectively making them searchable and improving usability for indexing or accessibility. It is written in modern ECMAScript Modules (ESM), so it can be imported in both browser and Node.js environments without a build step, though browser usage requires same-origin hosting of the files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    OpenKM Document Management - DMS

    OpenKM Document Management - DMS

    Document Management System and Content Management System

    OpenKM Community Edition is a free Document Management System (DMS) that helps businesses control the production, storage, management and distribution of electronic documents, boosting effectiveness and productivity. It integrates document management, collaboration and advanced search into one easy-to-use solution, including administration tools for user roles, access control, security levels, activity logs and automation setup. With OpenKM Community Edition you can: Collect information...
    Leader badge
    Downloads: 282 This Week
    Last Update:
    See Project
  • 7
    DocWire SDK

    DocWire SDK

    Award-winning modern data processing SDK in C++20

    DocWire SDK, a standout C++20AI driven data processing tool, has received award from SourceForge and strong backing from Microsoft. It handles nearly 100 file types, empowering efficient text extraction, web data extraction, and document analysis. For businesses, the shift to DocWire SDK signifies a leap forward. It promises comprehensive document format support and the ability to extract valuable insights from email boxes, databases, and websites using cutting-edge AI. DocWire SDK aims to...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    LWOCR

    LWOCR

    LightWeight OCR

    A lightweight and simple command‑line OCR tool for extracting text from images. Ideal for developers and users who only require basic image to text OCR functionality . Console only application (No GUI) USAGE: C:\Progs>LWOCR Usage: LWOCR.exe <image_path> [text_output] [options] Options: --brightness=X (-1.0 to 1.0, default 0.0) --contrast=X (0.0 to 5.0, default 1.0) --gamma=X (0.1 to 5.0, default 1.0) --digits Only output digits --save-image=path Save processed image Examples: LWOCR.exe image.png # Output to console LWOCR.exe image.png output.txt # Output to file LWOCR.exe image.png --brightness=0.2 # Adjust brightness LWOCR.exe image.png output.txt --save-image=processed.png Support: mrbeepbeepp@gmail.com `
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Chalk

    Chalk

    All-In-One Digital Teaching Assistant

    Downloads: 0 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 10
    e-Dokyumento

    e-Dokyumento

    e-Dokyumento is web-based Document Management System (DMS)

    e-Dokyumento is opensource web-based Document Management System (DMS) A Document Management which automates the basic office document workflow such as receiving, filing, routing, and approving through capturing (scanning), digitizing (OCR Reading), storing, tagging, and electronically routing and approving (e-signature) of electronic documents. # Demo : https://e-dokyumento.herokuapp.com/ https://edokyu.seillig.com/ (refer to Readme.md for the...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 11
    Ozyr

    Ozyr

    Ozyr is a simple and easy to use OCR snipping tool

    Ozyr is a simple and easy to use OCR snipping tool to get text from images so you can copy and edit it. Source Code: https://github.com/PETEROLO291/Ozyr Installer: 117MB Program: 524MB Version: 1.0
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    gImageReader

    gImageReader

    A graphical frontend to tesseract-ocr

    gImageReader is a simple Gtk/Qt front-end to tesseract. Features include: - Import PDF documents and images from disk, scanning devices, clipboard and screenshots - Process multiple images and documents in one go - Manual or automatic recognition area definition - Recognize to plain text or to hOCR documents - Recognized text displayed directly next to the image - Post-process the recognized text, including spellchecking - Generate PDF documents from hOCR documents **Note**: This page is only a mirror for the downloads. ...
    Leader badge
    Downloads: 143 This Week
    Last Update:
    See Project
  • 13
    Paperless-ng

    Paperless-ng

    A supercharged version of paperless, scan, index and archive docs

    Paperless is a simple Django application running in two parts, a Consumer (the thing that does the indexing) and a Web server (the part that lets you search & download already-indexed documents). Paper is a nightmare. Environmental issues aside, there’s no excuse for it in the 21st century. It takes up space, collects dust, doesn’t support any form of a search feature, indexing is tedious, it’s heavy and prone to damage & loss.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    OCR Image Simply

    OCR Image Simply

    Simple Windows application to OCR images

    ...Languages recognized: German, English, French, Italian, Polish, Spanish Just download ZIP file Then unzip archive And feel free to use everywhere - Solution published under MIT license Description can be found at: https://coolautomations.com/ocr-as-simple-as-it-can-be/
    Downloads: 9 This Week
    Last Update:
    See Project
  • 15

    cbrTekStraktor

    an application to automatically extract text from comic books.

    ...Its prime goal is to perform analysis on the texts of comic books. cbrTekStraktor can however also be used for scanlation or similar purposes. The application also enables to manually define text areas in CBR files. The application comprises a simple graphical editor for further processing the extracted text. The text extraction is achieved by a combination of statistical and graphical processing operations. It is based on the following 3 major algorithms - Binarization of color images (Niblak and other methods) - Connected components - K-Means clustering Apache Tesseract is used to perform Optical Character Recognition on the extracted text. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 16

    HRCloud2

    A full-featured home hosted Cloud Drive, Personal Assistant, App Launc

    HRCloud2 - A fully featured home-hosted Cloud drive and personal assistant that allows users to create and manage user accounts for friends and family, access files from anywhere, convert files and archives to other formats, perform simple image resizing and editing, stream media, create playlists, search for files, OCR images and PDF's, share files with friends and more! Building off the observed industry standards for commercial Cloud storage, HRCloud2 protects server permission levels, hashes sensitive data, enforces API security policies, and can even scan itself and it's controlled directories with ClamAV for security. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    EliteOCR

    EliteOCR

    OCR tool for market screenshots in Elite: Dangerous

    EliteOCR allows you to OCR market screenshots from Elite: Dangerous and export the data to various formats and services.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18

    WebDjVuTextEd

    Edit the OCR text layer of DjVu documents in a web browser

    ...You can modify the structure (paragraphs, lines, words...) create, delete, edit text nodes, modify their container box by mouse, and run a spellchecker. The program does not directly read the DjVu files, it requires exported XML text data and images. When using without a webserver, you can open and save local files, but cannot take advantages of auto-save and spell checking. Note that current SVN version has much more features than V1.0!
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Vision2u

    Vision2u

    free image processing software

    Vision2u offers a free image processing software for personal use and research. Primary tasks of the image processing can be realized during simple operation of the software. Every Web cam owner can have simplest measuring, counting or tasks of monitoring done without high capital outlays.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    LynxSight Mobile

    LynxSight Mobile

    An OCR assistant for visually impaired people

    ...Application scans pictures taken by camera for text and reads it to user. LynxSight mobile is designed for use by visually impaired people. It contains voice assistant, voice commands and simple UI to make using easier.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Tesseract-gui
    Tessract-GUI is not a front-end for tesseract-ocr. It is just a graphical way to use it with simple image manipulation thru ImageMagick.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 22
    For several purposes of our OCR research we needed a converter that can generate an image from any given Bangla/ Bengali text. One of the reasons was to prepare artificial training and testing data. For this reason we finally created a very simple text to image converter. A Text to Image Converter. Systrem Requirement: .net Framework 4.0 and up 1. Select your Font Type, style and size. 2. First Browse your file, which will contain your text (Supporetd File Type .txt) 3. Select Single or Multi Line option. 4. Select Your output image type (Supported image type .bmp, .jpg and .png.). 5. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23

    alto_search

    Search full text in ALTO xml and return context

    ALTO files are XML files that store the output of OCR. Alto_search searches for terms in these ALTO files. A term can be several words and multiple terms can be searched at the same time. The output is in XML format and contains the coordinates of the words that were found as well as textual context around the hits.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Simple Document management software written in PHP for linux-operating-systems supporting Bank-Account-Linking and OCR
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    ocrlib
    OCR c++ library of computer optical recognition methods. In library: contour recognition; contour vectorisation; matrix letters feature recognition; web based GUI; assembler core on SS3 instruction; xml support; detect page rotation and segmentation;
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
Auth0 Logo