Join/Login
Open Source Software
Business Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Open Source Software

Business Software

Articles
Case Studies
Learn
Blog
SourceForge Podcast

Menu

Help
Create
Join
Login

Home
Browse Open Source
Artificial Intelligence Software
Search Results

Search Results for "document layout recognition"

x

Sort By:

Relevance

Clear All Filters

OS

Linux 17
Windows 17
Mac 14
More...
BSD 8
ChromeOS 4
Desktop Operating Systems 4
Mobile Operating Systems 1

Category

Artificial Intelligence 19
Business 5
Software Development 5
Multimedia 3
Scientific/Engineering 3
System 3
Database 1
Text Editors 1

License

OSI-Approved Open Source 15
Creative Commons Attribution License 1

Translations

English 6
French 1

Programming Language

Python 9
C++ 4
Java 4
JavaScript 2
More...
OCaml (Objective Caml) 1

Status

Production/Stable 6
Beta 3
Planning 1
Pre-Alpha 1
More...
Mature 1

Showing 19 open source projects for "document layout recognition"

View related business solutions

Artificial Intelligence Clear Filters & Widen Search

The next chapter in business mental wellness
Entrust your employee well-being to Calmerry's nationwide network of licensed mental health professionals.

Calmerry is beneficial for businesses of all sizes, particularly those in high-stress industries, organizations with remote teams, and HR departments seeking to improve employee well-being and productivity

Learn More
Cloudflare secures and ensures the reliability of your external-facing resources such as websites, APIs, and applications.
Cloudflare is the foundation for your infrastructure, applications, and teams.

It protects your internal resources such as behind-the-firewall applications, teams, and devices.

Get Started
1

EasyOCR

Ready-to-use OCR with 80+ supported languages

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc. EasyOCR is a python module for extracting text from image. It is a general OCR that can read both natural scene text and dense text in document. We are currently supporting 80+ languages and expanding. Second-generation models: multiple times smaller size, multiple times faster inference, additional characters and comparable accuracy to the first...

Downloads: 9 This Week

Last Update: 2023-09-04
See Project
2

DocTR

Library for OCR-related tasks powered by Deep Learning

DocTR provides an easy and powerful way to extract valuable information from your documents. Seemlessly process documents for Natural Language Understanding tasks: we provide OCR predictors to parse textual information (localize and identify each word) from your documents. Robust 2-stage (detection + recognition) OCR predictors with pretrained parameters. User-friendly, 3 lines of code to load a document and extract text with a predictor. State-of-the-art performances on public document...

Downloads: 3 This Week

Last Update: 2024-08-16
See Project
3

flair

A very simple framework for state-of-the-art NLP

A very simple framework for state-of-the-art NLP. Developed by Humboldt University of Berlin and friends. A powerful NLP library. Flair allows you to apply our state-of-the-art natural language processing (NLP) models to your text, such as named entity recognition (NER), sentiment analysis, part-of-speech tagging (PoS), special support for biomedical texts, sense disambiguation and classification, with support for a rapidly growing number of languages. A text embedding library. Flair has simple...

Downloads: 0 This Week

Last Update: 2024-07-31
See Project
4

Cleanlab

The standard data-centric AI package for data quality and ML

cleanlab helps you clean data and labels by automatically detecting issues in a ML dataset. To facilitate machine learning with messy, real-world data, this data-centric AI package uses your existing models to estimate dataset problems that can be fixed to train even better models. cleanlab cleans your data's labels via state-of-the-art confident learning algorithms, published in this paper and blog. See some of the datasets cleaned with cleanlab at labelerrors.com. This package helps you...

Downloads: 0 This Week

Last Update: 2024-06-25
See Project
AlertBot: Website Monitoring of Uptime, Performance, and Errors
For IT Professionals and network adminstrators looking for a web application monitoring solution

AlertBot monitors your website's full functionality around the clock so you can focus your time on more important things.

Learn More
5

LayoutParser

A Unified Toolkit for Deep Learning Based Document Image Analysis

With the help of state-of-the-art deep learning models, Layout Parser enables extracting complicated document structures using only several lines of code. This method is also more robust and generalizable as no sophisticated rules are involved in this process. A complete instruction for installing the main Layout Parser library and auxiliary components. Learn how to load DL Layout models and use them for layout detection. The full list of layout models currently available in Layout Parser...

Downloads: 0 This Week

Last Update: 2022-08-04
See Project
6

Umi-OCR

Free OCR Software: No internet required, easy to use.

Support screenshots/pasting/batch importing of images, paragraph layout/excluding watermarks, scanning/generating QR codes. No need for internet connection throughout the entire process, with built-in multi language recognition library. 支持截屏/粘贴/批量导入图片，支持段落排版/排除水印，扫描/生成二维码。全程无需联网，内置多国语言识别库。

Downloads: 365 This Week

Last Update: 2024-07-28
See Project
7

Common Resource Grep - crgrep

Common Resource Grep

CRGREP searches for matching text in databases, various document formats, archives and other difficult to access resources. A command line tool for name and content text matching in database tables, plain files, MS Office documents, PDF, archives, MP3 audio, image meta-data, scanned documents, maven dependencies and web resources. CRGREP will search resources within resources of any arbitrary combination or depth, so text within a document within a zip archive, and so on. Here you will find...

3 Reviews

Downloads: 4 This Week

Last Update: 2023-04-23
See Project
8

OpenKYC - FaceOnLive Community Project

FaceOnLive Open KYC: Streamlining Identity Verification with AI

Immerse yourself in the groundbreaking realm of the FaceOnLive Open KYC Project, a trailblazing endeavor at the forefront of redefining identity verification paradigms. With a commitment to leveraging the latest advancements in biometric technology, our platform presents a comprehensive solution encompassing cutting-edge features such as face recognition, face liveness detection, and ID document recognition. By seamlessly integrating these powerful tools, we empower businesses across...

149 Reviews

Downloads: 1 This Week

Last Update: 2024-04-02
See Project
9

DocWire SDK

Award-winning modern data processing in C++17/20

DocWire SDK, a standout C++17/20 data processing tool, has received award from SourceForge and strong backing from Microsoft. It handles nearly 100 file types, empowering efficient text extraction, web data extraction, and document analysis. The upcoming integration of C++17 and C++20 will bring advanced functionalities, particularly in areas like HTTP capabilities and web data extraction. For businesses, the shift to DocWire SDK signifies a leap forward. It promises comprehensive document...

2 Reviews

Downloads: 1 This Week

Last Update: 2023-11-13
See Project
Gain insights and build data-powered applications
Your unified business intelligence platform. Self-service. Governed. Embedded.

Chat with your business data with Looker. More than just a modern business intelligence platform, you can turn to Looker for self-service or governed BI, build your own custom applications with trusted metrics, or even bring Looker modeling to your existing BI environment.

Try it free
10

DynaQ

Innovative text document search. http://dynaq.opendfki.de for details.

The goal of DynaQ is to develop an inquiry system to explore the personal information space, supporting you with the searching paradigm 'orienteering'. DynaQ is a (desktop)search engine with enhanced functionality for file, email and blog search. Look at our GitLab homepage for sourcecode and documentation: http://dynaq.opendfki.de

Downloads: 1 This Week

Last Update: 2021-08-05
See Project
11

NLP-progress

Repository to track the progress in Natural Language Processing (NLP)

Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks. This document aims to track the progress in Natural Language Processing (NLP) and give an overview of the state-of-the-art (SOTA) across the most common NLP tasks and their corresponding datasets. It aims to cover both traditional and core NLP tasks such as dependency parsing and part-of-speech tagging as well as more recent ones...

Downloads: 0 This Week

Last Update: 2024-07-31
See Project
12

pdfsandwich

pdfsandwich generates "sandwich" OCR pdf files, i.e. pdf files which contain only images (but no editable text) will be processed by optical character recognition (OCR) and the text will be added to each page invisibly "behind" the images. pdfsandwich is a command line tool which is supposed to be useful to OCR scanned books or journals. It is able to recognize the page layout even for multicolumn text. Essentially, pdfsandwich is a wrapper script which calls the following binaries...

8 Reviews

Downloads: 279 This Week

Last Update: 2018-08-12
See Project
13

Devanagari OCR

Devanagari Optical Character Recognition, Annotation tool

The project has source code and data related to the following tools: 1. Optical Character Recognition. Recognize machine printed Devanagari with or without a dictionary. 2. Document Image Analysis. Automatic page segmentation of document images in multiple Indian languages. Identifies pictures, lines, and words in a document scanned at 300 dpi. 3. Multi-lingual annotation. An interface that has transilteration and a soft-keyboard using which multiple languages can be input. The UI also...

1 Review

Downloads: 3 This Week

Last Update: 2019-07-25
See Project
14

Gamera

Gamera is a framework for the creation of structured document analysis applications by domain experts. It combines a programming library with GUI tools for the training and interactive development of recognition systems.

Downloads: 0 This Week

Last Update: 2016-05-11
See Project
15

libcrn

libcrn is document image processing library written in C++11 for Linux, Windows, Mac OsX and Google Android. It is a toolbox that allows to create easily software such as OCRs and layout analysis tools.

Downloads: 0 This Week

Last Update: 2016-10-23
See Project
16

Extract Objects from Image

Connected Component Labeling Algorithm - Extracting Objects From image

fast Connected Component Labeling Algorithm - java application - Extracting Objects From image

Downloads: 0 This Week

Last Update: 2015-07-07
See Project
17

KALIMAT Multipurpose Arabic Corpus

A corpus that could be of help for researchers working on Arabic NLP

KALIMAT a Multipurpose Arabic Corpus We are pleased to announce the immediate availability of KALIMAT 1.0, KALIMAT is an Arabic natural language resource that consists of: 1) 20,291 Arabic articles collected from the Omani newspaper Alwatan by (Abbas et al. 2011). 2) 20,291 Extractive Single-document system summaries. 3) 2,057 Extractive Multi-document system summaries. 4) 20,291 Named Entity Recognised articles. 5) 20,291 Part of Speech Tagged articles. 6) 20,291 Morphologically...

Downloads: 30 This Week

Last Update: 2015-04-09
See Project
18

Psaltiki Gamera Toolkit

A toolkit for the optical recognition of Psaltiki 19th century music notation. It is based on and requires the Gamera document image analysis framework (http://gamera.sf.net).

Downloads: 0 This Week

Last Update: 2012-12-07
See Project
19

Socr3

Socr3 is a plugin-oriented, open source platform upon which I'm building an OCR suite. The name Socr3 stands for "Open Source Optical Character Recognition, Reading, Rendering, and Exporting", and is subject to change in the future.

Downloads: 0 This Week

Last Update: 2016-11-29
See Project

Previous
You're on page 1
Next

Related Searches

free ocr software

all indian language ocr

handwriting recognition source code

document classification

Related Categories

Artificial Intelligence

Software Development

Scientific/Engineering

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
225 Broadway Suite 1600
San Diego, CA 92101
+1 (858) 454-5900

Resources

Support
Site Documentation
Site Status

© 2024 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise

Thanks for helping keep SourceForge clean.

X

You seem to have CSS turned off. Please don't fill out this field.

You seem to have CSS turned off. Please don't fill out this field.

Briefly describe the problem (required):

Upload screenshot of ad (required):

Select a file, or drag & drop file here.

✔

✘

Screenshot instructions:

Click URL instructions:
Right-click on the ad, choose "Copy Link", then paste here →
(This may not be possible with some types of ads)

More information about our ad policies

Ad destination/click URL: