Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Formats and Protocols
Data Formats Software
Search Results

Search Results for "recognition"

x

Sort By:

Relevance

Clear All Filters

OS

Linux 15
Windows 14
Mac 12
More...
BSD 4
ChromeOS 2
Mobile Operating Systems 1

Category

Formats and Protocols 17
- Data Formats 17
  - CSV 1
  - JSON 1
  - PDF 4
  - PostScript 1
  - TeX/LaTeX 4
  - XML 2
Artificial Intelligence 5
Multimedia 3
Scientific/Engineering 2
Software Development 2
Business 1
Internet 1
System 1
Text Editors 1

License

OSI-Approved Open Source 15
Public Domain 1

Translations

English 2
Brazilian Portuguese 1
Catalan 1
Chinese (Simplified) 1
More...
Chinese (Traditional) 1
French 1
German 1
Italian 1
Russian 1
Spanish 1
Ukrainian 1

Programming Language

Python 7
C++ 3
Java 2
TypeScript 2
More...
C 1
C# 1
Unix Shell 1

Status

Beta 2
Production/Stable 2
Alpha 1

Showing 17 open source projects for "recognition"

View related business solutions

Data Formats Clear Filters & Widen Search

$300 Free Credits to Build on Google Cloud
New to Google Cloud? Get $300 in credits to explore Compute Engine, BigQuery, Cloud Run, Gemini Enterprise Agent Platform, and more.

Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query petabytes in BigQuery, or build agents with Gemini Enterprise Agent Platform. Once your credits are used, keep building with 20+ always-free tier products including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. No commitment required—just sign up and start building.

Claim $300 Free
Ship Agents Faster
Transform your applications and workflows into powerful agentic systems at global scale.

Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.

Get Started Free
1

Pix2Text

Open-Source Python3 tool for recognizing layouts, tables, and math

An Open-Source Python3 tool for recognizing layouts, tables, math formulas, and text in images, converting them into Markdown format. A free alternative to Mathpix, empowering seamless conversion of visual content into text-based representations. 80+ languages are supported. Pix2Text (P2T) aims to be a free and open-source Python alternative to Mathpix, and it can already accomplish Mathpix's core functionality. Pix2Text (P2T) can recognize layouts, tables, images, text, and mathematical...

Downloads: 9 This Week

Last Update: 2026-02-07
See Project
2

OCRmyPDF

OCRmyPDF adds an OCR text layer to scanned PDF files

OCRmyPDF adds an optical character recognition (OCR) text layer to scanned PDF files, allowing them to be searched. PDF is the best format for storing and exchanging scanned documents. Unfortunately, PDFs can be difficult to modify. OCRmyPDF makes it easy to apply image processing and OCR (recognized, searchable text) to existing PDFs.

Downloads: 109 This Week

Last Update: 1 day ago
See Project
3
$Rapid LaTeX OCR$

Rapid LaTeX OCR

Formula recognition based on LaTeX-OCR and ONNXRuntime

Formula recognition based on LaTeX-OCR and ONNXRuntime. rapid_latex_ocr is a tool to convert formula images to latex format. The reasoning code in the repo is modified from LaTeX-OCR, the model has all been converted to ONNX format, and the reasoning code has been simplified, Inference is faster and easier to deploy. The repo only has codes based on ONNXRuntime or OpenVINO inference in onnx format and does not contain training model codes.

Downloads: 1 This Week

Last Update: 2024-11-03
See Project
4

pdfly

CLI tool to extract (meta)data from PDF and manipulate PDF files

A Python library designed for manipulating PDF files with functionalities for extraction, transformation, and document generation.

Downloads: 0 This Week

Last Update: 2025-10-13
See Project
Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
5

JSON Hero

JSON Hero is an open-source, beautiful JSON explorer for the web

JSON Hero is a beautiful and powerful JSON viewer designed for developers who work with large and complex JSON files. It runs as a web-based interface (and as a standalone app) that provides semantic, interactive rendering of JSON content, helping users understand the structure and meaning of data at a glance. JSON Hero automatically detects data types such as URLs, dates, colors, and base64 images, and presents them in meaningful ways. It’s designed for productivity and readability, with...

Downloads: 6 This Week

Last Update: 2025-07-17
See Project
6

Unredact

A simple tool for reading in poorly redacted documents

Unredact is a specialized tool that attempts to reconstruct redacted or obscured text in images, PDFs, or screenshots using a combination of image processing and generative AI inference to suggest plausible completions of blurred, black-boxed, or jumbled content. Unlike traditional optical character recognition (OCR), which only reads visible text, Unredact focuses on inferring missing content where redaction has been applied by analyzing surrounding context, font characteristics, and linguistic patterns to produce candidate reconstructions. It accepts a variety of input formats, automatically identifies redacted regions, and then generates text suggestions that are presented alongside visual overlays so users can choose or refine outputs.

Downloads: 5 This Week

Last Update: 2026-02-03
See Project
7

Google2SRT

Download, save and convert multiple subtitles from YouTube videos

Google2SRT allows you to download, save and convert multiple subtitles and translations from YouTube and Google Video to SubRip (.srt) format, which is recognized by most video players. You can download XML subtitles or simply type video's URL, Google2SRT will do the rest.

33 Reviews

Downloads: 26 This Week

Last Update: 2025-01-11
See Project
8

realwatermark

A Python application to add watermarks (text or image) to PDF files

A Python application to add watermarks (text or image) to PDF files, converts them into image and back to PDF with options for OCR and compression.

Downloads: 0 This Week

Last Update: 2025-01-27
See Project
9

SimpleXlsxWriter

C++ library for creating XLSX files for MS Excel 2007 and above.

This library represents XLSX files writer for Microsoft Excel 2007 and above. The main feature of this library is that it uses C++ standard file streams. On the one hand it results in almost unnoticeable memory and CPU resources consumption while processing (that may be very useful at saving a large data arrays), but on the other hand it makes unfeasible to edit data that were written. Hence, if using this library the structure of the future report should be known enough. The library...

9 Reviews

Downloads: 10 This Week

Last Update: 2023-04-24
See Project
Secure File Transfer for Windows with Cerberus by Redwood
Protect and share files over FTP/S, SFTP, HTTPS and SCP with the #1 rated Windows file transfer server.

Cerberus supports unlimited users and connections on a single IP, with built-in encryption, 2FA, and a browser-based web client — all deployable in under 15 minutes with a 25-day free trial.

Try for Free
10

Mozilla JPEG Encoder Project

Improved JPEG encoder

MozJPEG improves JPEG compression efficiency achieving higher visual quality and smaller file sizes at the same time. It is compatible with the JPEG standard, and the vast majority of the world's deployed JPEG decoders. MozJPEG is compatible with the libjpeg API and ABI. It is intended to be a drop-in replacement for libjpeg. MozJPEG is a strict superset of libjpeg-turbo's functionality. All MozJPEG's improvements can be disabled at run time, and in that case it behaves exactly like...

1 Review

Downloads: 4 This Week

Last Update: 2022-08-15
See Project
11

html2canvas

A JavaScript HTML screenshot renderer

html2canvas is a JavaScript HTML renderer. The script provides you with the tools to take screenshots of webpages directly on the browser. The screenshot is based on the DOM and therefore, it may not be 100% accurate to the real representation, given that it is not an actual screenshot, but a type of screenshot built based on the available data and information of the page. The script renders such page as a canvas image, by reading the DOM and the different styles of the featured elements. It...

Downloads: 10 This Week

Last Update: 2023-09-07
See Project
12

addcsv2ods

Adds csv files to a ods spreadsheet

This bash script adds several csv files into an existing .ods spreadsheet. Each csv file creates a sheet. If there is no input spreadsheet provided, it creates one and inserts all csv files in it.

Downloads: 1 This Week

Last Update: 2021-11-03
See Project
13

Budou

Budou is an auto organizer tool for beautiful line breaking in CJK

...The tool supports multiple segmentation backends, including Google Cloud Natural Language API, MeCab, and TinySegmenter, enabling flexibility for both cloud-based and offline processing. Budou can be used via command line, in Python scripts, or integrated into web applications, and it provides advanced options such as caching and entity recognition for improved segmentation accuracy.

Downloads: 0 This Week

Last Update: 2025-10-11
See Project
14

Highlight

Source code to formatted text converter

Highlight converts source code to HTML, XHTML, RTF, ODT, LaTeX, TeX, SVG, BBCode, Pango markup, and terminal escape sequences with colored syntax highlighting. Language definitions and color themes are customizable. Highlight was designed to offer a flexible but easy-to-use syntax highlighter for several output formats. No syntax or coloring information is hardcoded, instead all relevant data is stored in configuration scripts. These Lua scripts may be altered and enhanced with plug-in...

Downloads: 1 This Week

Last Update: 2024-05-15
See Project
15

DataLogChanger

an CSV / ASCII Data Log file converter.

... * in- and output file size is virtually unlimited * very fast (>100k lines per second processing speed) * Easy selection of source file name * Selectable destination filename suffix * Automatic header recognition * Automatic time offset recognition (e.g. SPICE only accepts poitive time values) * Selection of a single or all Columns * Selection of a time or line range to be converted * Reduction of time resolution (by skipping lines) * Separate Offset and Gain calculation for time and values

Downloads: 0 This Week

Last Update: 2017-08-09
See Project
16

MathOCR

A scientific document recognition system

MathOCR is a printed scientific document recognition system. MathOCR is still in the pre-alpha stage, recognition result may not be good enough for practical purposes. MathOCR is a printed scientific document recognition system written in pure Java. MathOCR has the functionality of image preprocessing, layout analysis and character recognition, especially the ability to recognize mathematical expression.

Downloads: 0 This Week

Last Update: 2024-05-16
See Project
17

Table Structure Recognition Library

A library to extract the contents of a table, from a textual input formatted in a tabular way.

Downloads: 0 This Week

Last Update: 2014-06-27
See Project

Previous
You're on page 1
Next

Related Searches

ocr

pdf

cjpeg

recognition

pdf to text

pdf ocr

umi-ocr

json viewer

srt to speech

formula

Related Categories

Formats and Protocols

Artificial Intelligence

Multimedia

Scientific/Engineering

Software Development

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise