Search Results for "document layout recognition"

Showing 89 open source projects for "document layout recognition"

View related business solutions
  • Red Hat Enterprise Linux on Microsoft Azure Icon
    Red Hat Enterprise Linux on Microsoft Azure

    Deploy Red Hat Enterprise Linux on Microsoft Azure for a secure, reliable, and scalable cloud environment, fully integrated with Microsoft services.

    Red Hat Enterprise Linux (RHEL) on Microsoft Azure provides a secure, reliable, and flexible foundation for your cloud infrastructure. Red Hat Enterprise Linux on Microsoft Azure is ideal for enterprises seeking to enhance their cloud environment with seamless integration, consistent performance, and comprehensive support.
  • All-in-One Payroll and HR Platform Icon
    All-in-One Payroll and HR Platform

    For small and mid-sized businesses that need a comprehensive payroll and HR solution with personalized support

    We design our technology to make workforce management easier. APS offers core HR, payroll, benefits administration, attendance, recruiting, employee onboarding, and more.
  • 1
    PDF4QT

    PDF4QT

    Open source PDF editor

    ... the license LGPLv3. The applications are primarily used by target users to view, edit, manipulate or compare PDF documents. Users can preview these applications in the screenshots section of this webpage. Basic browsing and lots of other functionalities, such as encryption, reading a document, verification of digital signatures, editing of annotations, searching for text using regular expressions, drawing pages into an image, and much more. Several plug-ins are available.
    Downloads: 37 This Week
    Last Update:
    See Project
  • 2
    EasyOCR

    EasyOCR

    Ready-to-use OCR with 80+ supported languages

    Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc. EasyOCR is a python module for extracting text from image. It is a general OCR that can read both natural scene text and dense text in document. We are currently supporting 80+ languages and expanding. Second-generation models: multiple times smaller size, multiple times faster inference, additional characters and comparable accuracy to the first...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 3
    SiYuan

    SiYuan

    Build your eternal digital garden

    SiYuan is a local-first personal knowledge management system that supports complete offline use, as well as end-to-end encrypted synchronization. Fuse blocks, outlines, and bidirectional links to build your eternal digital garden. SiYuan, the only important core concept is Content block. The content block can be formed through the formatting format, so that we can organize our thoughts and knowledge at the block-level granularity, and it is also convenient for reading and outputting long...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 4
    DocTR

    DocTR

    Library for OCR-related tasks powered by Deep Learning

    DocTR provides an easy and powerful way to extract valuable information from your documents. Seemlessly process documents for Natural Language Understanding tasks: we provide OCR predictors to parse textual information (localize and identify each word) from your documents. Robust 2-stage (detection + recognition) OCR predictors with pretrained parameters. User-friendly, 3 lines of code to load a document and extract text with a predictor. State-of-the-art performances on public document...
    Downloads: 3 This Week
    Last Update:
    See Project
  • Digital Payments by Deluxe Payment Exchange Icon
    Digital Payments by Deluxe Payment Exchange

    A single integrated payables solution that takes manual payment processes out of the equation, helping reduce risk and cutting costs for your business

    Save time, money and your sanity. Deluxe Payment Exchange+ (DPX+) is our integrated payments solution that streamlines and automates your accounts payable (AP) disbursements. DPX+ ensures secure payments and offers suppliers alternate ways to receive funds, including mailed checks, ACH, virtual credit cards, debit cards, or eCheck payments. By simply integrating with your existing accounting software like QuickBooks®, you’ll implement efficient payment solutions for AP with ease—without costly development fees or untimely delays.
  • 5
    Frescobaldi

    Frescobaldi

    LilyPond sheet music text editor

    Frescobaldi is a free and open source LilyPond sheet music text editor. Designed to be powerful yet lightweight and easy-to-use, Frescobaldi offers great functionality and a host of useful features such as music view with advanced two-way Point & Click, Midi capturing to enter music, a Snippet Manager and many more. Frescobaldi is named after Girolamo Frescobaldi (1583-1643), an Italian composer of keyboard music in the late Renaissance and early Baroque period.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 6
    Pix2Text

    Pix2Text

    Open-Source Python3 tool for recognizing layouts, tables, and math

    An Open-Source Python3 tool for recognizing layouts, tables, math formulas, and text in images, converting them into Markdown format. A free alternative to Mathpix, empowering seamless conversion of visual content into text-based representations. 80+ languages are supported. Pix2Text (P2T) aims to be a free and open-source Python alternative to Mathpix, and it can already accomplish Mathpix's core functionality. Pix2Text (P2T) can recognize layouts, tables, images, text, and mathematical...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 7
    pdfmake

    pdfmake

    Client/server side PDF printing in pure JavaScript

    Print PDFs directly in the browser or delegate it to your NodeJS backend. Use the same document definition in both cases. Forget about manual x, y calculations. Declare document structure and let pdfmake do the rest. Use paragraphs, columns, lists, tables, canvas, etc. Declare your own styles, use custom fonts, build a DSL and extend the framework. Provides a set of options to disable font layout cache and to control when pages are flushed to the output file. Pdfmake is runnable in browser...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 8
    Collabora Online

    Collabora Online

    Collabora Online is a collaborative online office suite

    Collabora Online is a powerful online office suite that you can integrate into your own infrastructure or access via one of our trusted hosting Partners. Your digital sovereignty is our priority. We provide you with all the tools to keep your data secure, without compromising on features. Collabora Online’s text document editor provides a true WYSIWYG editing experience, making visualizing your document layout incredibly easy. Open any document, add comments and track changes from anywhere...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    iText Core/Community

    iText Core/Community

    iText for .NET is the .NET version of the iText library

    ... of the output PDF files will be incorrectly displayed by the GitHub previewer, so be sure to download them to see the correct results. The iText Suite builds on over a decade of lessons learned from iText 5 (and iTextSharp) development. The latest release of iText, version 8, is a simpler, more performant and extensible library that is ready to handle the increased challenges of today's document workflows, one add-on at a time.
    Downloads: 2 This Week
    Last Update:
    See Project
  • Discover Multiview ERP: The Financial Management Revolution Icon
    Discover Multiview ERP: The Financial Management Revolution

    Reclaim precious moments with loved ones while our robust cloud accounting software streamlines your financial processes.

    Built for growing businesses and well-established enterprises alike, Multiview is a highly scalable and robust ERP.
  • 10
    Leku

    Leku

    Map location picker component for Android

    Map location picker component for Android. Based on Google Maps. An alternative to Google Place Picker. Component library for Android that uses Google Maps and returns a latitude, longitude and an address based on the location picked with the Activity provided. Note that you have the voice_search_extra_language that is used for the language of the voice recognition. Replace it with the allowed voice recognition locale for your language. We encourage you to add these languages to this component...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    SILE

    SILE

    The SILE Typesetter — Simon’s Improved Layout Engine

    SILE is a typesetting system; its job is to produce beautiful printed documents. Conceptually, SILE is similar to TeX—from which it borrows some concepts and even syntax and algorithms—but the similarities end there. Rather than being a derivative of the TeX family SILE is a new typesetting and layout engine written from the ground up using modern technologies and borrowing some ideas from graphical systems such as InDesign.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    flair

    flair

    A very simple framework for state-of-the-art NLP

    A very simple framework for state-of-the-art NLP. Developed by Humboldt University of Berlin and friends. A powerful NLP library. Flair allows you to apply our state-of-the-art natural language processing (NLP) models to your text, such as named entity recognition (NER), sentiment analysis, part-of-speech tagging (PoS), special support for biomedical texts, sense disambiguation and classification, with support for a rapidly growing number of languages. A text embedding library. Flair has simple...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    ShowDoc

    ShowDoc

    ShowDoc is a tool greatly applicable for an IT team to share documents

    When we take over a module or project which has been developed by others, we always feel crazy watching at the codes without notes or comments. Programmers often want others to write technical documents, but not by themselves. Writing technical documents need lots of time and works, such as it needs to consider how to handle the format and layout of the documents. Beside, the man who writes the documents has to think all kinds of non-technical details. For example, which catalog should...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Cleanlab

    Cleanlab

    The standard data-centric AI package for data quality and ML

    cleanlab helps you clean data and labels by automatically detecting issues in a ML dataset. To facilitate machine learning with messy, real-world data, this data-centric AI package uses your existing models to estimate dataset problems that can be fixed to train even better models. cleanlab cleans your data's labels via state-of-the-art confident learning algorithms, published in this paper and blog. See some of the datasets cleaned with cleanlab at labelerrors.com. This package helps you...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    HtmlSanitizer

    HtmlSanitizer

    Cleans HTML to avoid XSS attacks

    HtmlSanitizer is a .NET library for cleaning HTML fragments and documents from constructs that can lead to XSS attacks. It uses AngleSharp to parse, manipulate, and render HTML and CSS. Because HtmlSanitizer is based on a robust HTML parser it can also shield you from deliberate or accidental "tag poisoning" where invalid HTML in one fragment can corrupt the whole document leading to broken layout or style. In order to facilitate different use cases, HtmlSanitizer can be customized at several...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    mPDF

    mPDF

    PHP library generating PDF files from UTF-8 encoded HTML

    ..., DIV) including margins, borders, padding, line height, background colors, etc. Right-to-left languages are supported, with automatic detection of RTL characters within a document. Transposes tables, lists, text justification, and table cell alignment, as well as a full-text reversal for RTL characters. Automatically detects non-RTL characters and displays these in the original order.
    Downloads: 112 This Week
    Last Update:
    See Project
  • 17
    LayoutParser

    LayoutParser

    A Unified Toolkit for Deep Learning Based Document Image Analysis

    With the help of state-of-the-art deep learning models, Layout Parser enables extracting complicated document structures using only several lines of code. This method is also more robust and generalizable as no sophisticated rules are involved in this process. A complete instruction for installing the main Layout Parser library and auxiliary components. Learn how to load DL Layout models and use them for layout detection. The full list of layout models currently available in Layout Parser...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    OmegaT - multiplatform CAT tool

    OmegaT - multiplatform CAT tool

    The free computer aided translation (CAT) tool for professionals

    OmegaT is a free and open source multiplatform Computer Assisted Translation tool with fuzzy matching, translation memory, keyword search, glossaries, and translation leveraging into updated projects.
    Leader badge
    Downloads: 1,228 This Week
    Last Update:
    See Project
  • 19
    Umi-OCR

    Umi-OCR

    Free OCR Software: No internet required, easy to use.

    Support screenshots/pasting/batch importing of images, paragraph layout/excluding watermarks, scanning/generating QR codes. No need for internet connection throughout the entire process, with built-in multi language recognition library. 支持截屏/粘贴/批量导入图片,支持段落排版/排除水印,扫描/生成二维码。全程无需联网,内置多国语言识别库。
    Leader badge
    Downloads: 365 This Week
    Last Update:
    See Project
  • 20
    easy-canvas

    easy-canvas

    easy-canvas is a powerful tool helps us easy to layout with canvas

    Use the render function to create a document flow in the canvas and quickly implement the layout. Support document flow, refer to the web, no need to set x, y, width and height. Compatible with applets and the web, no third-party dependencies. Support componentization, global components, and local components. High performance, scroll-view supports dirty rectangles, and only draws the visible part. Support element operation, similar to operation dom to modify document flow.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    Msc-generator

    Msc-generator

    Draws signalling charts, block diagrams and graphs from text input.

    ... the appearance and has a rich feature set complete with detailed documentation. On Windows, you can embed the charts in a document or presentation and simply double-click it in Office to edit them. On Linux and the Mac, a command-line version is available, and a GUI, as well. A .deb package is available starting from Debian Bookworm (currently testing) and Ubuntu Jammy Jellyfish (22.04) from the official repositories. For older releases see the Wiki. A Mac homebrew package is available.
    Leader badge
    Downloads: 46 This Week
    Last Update:
    See Project
  • 22
    EspoCRM - Open Source CRM

    EspoCRM - Open Source CRM

    Moving in the right direction together!

    EspoCRM software is that it’s fully customizable. We strive to create a solution that fits different business and industry needs, without having to rely on a “one size fits all” approach or make you spend a fortune on customization. Demo: https://www.espocrm.com/demo/ Installation: https://docs.espocrm.com/administration/installation/ Customer relationship management (CRM) software is developing every day due to ever-changing global business environment and rapid advances in...
    Downloads: 23 This Week
    Last Update:
    See Project
  • 23
    eXo Platform - Digital Workplace

    eXo Platform - Digital Workplace

    The open-source digital workplace for growing teams and enterprises.

    eXo Platform is an open-source digital workplace solutions for growing teams and enterprises, featuring: ✅ Internal Communications ✅ Team Collaboration ✅ Knowledge Management ✅ Productivity and Employee Recognition use cases. eXo stands out by: 👍 its fluid and integrated employe experience, on desktop and mobile 👍 the platform’s ease of use 👍 innovative employee engagement features. eXo Platform is developed on open-source technology and supports open standards...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 24
    cerberuscms2

    cerberuscms2

    Cerberus Content Management System

    Cerberus Content Management System is a dynamic, secure and infinitely expandable CMS designed after a Unix-Like model. It is a custom written Web Application Framework ( W.A.F. ) with a consistent and custom written Pre-Hyper-Text-Post-Processor Programming Code Framework ( P.C.F. ). This Web Application Software Project' aim is to be the fastest and most secure Web Application Framework, Web Application Programming Code Framework, Text, Voice and Video Communications Platform and Content...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 25

    gsSetInitialView

    GUI to set PDF initial view using Ghostscript on Windows

    Free and open-source GUI application for setting the initial view property of a PDF document. The GUI allows for user selection of the initial view settings (navigation pane, page layout, zoom, etc.), generates a snippet of corresponding PostScript code (pdfmark), and then runs the Ghostscript command line program to apply the pdfmark to the PDF file. OS: Windows. Author: David King. License GPLv3.
    Downloads: 6 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • Next