Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Search Results

Search Results for "text batch processing tools"

x

Sort By:

Relevance

Clear All Filters

OS

Windows 400
Linux 333
Mac 295
More...
BSD 182
ChromeOS 152
Desktop Operating Systems 21
Server Operating Systems 7
Mobile Operating Systems 5
Embedded Operating Systems 2
Game Consoles 1

Category

Artificial Intelligence 127
Text Editors 123
Software Development 112
Multimedia 63
Business 40
Scientific/Engineering 32
System 28
Formats and Protocols 25
Internet 24
Education 20
Printing 7
Database 6
Communications 4
Desktop Environment 4
Games 3
Religion and Philosophy 2
Security 2
Terminals 2
Mobile 1

License

OSI-Approved Open Source 324
Creative Commons Attribution License 11
Other License 9
Public Domain 9
More...
GNU Free Documentation License 2

Translations

Programming Language

Status

Production/Stable 100
Beta 60
Alpha 27
Pre-Alpha 16
More...
Planning 11
Mature 10

Showing 400 open source projects for "text batch processing tools"

View related business solutions

Windows Clear Filters & Widen Search

MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
Stop Storing Third-Party Tokens in Your Database
Auth0 Token Vault handles secure token storage, exchange, and refresh for external providers so you don't have to build it yourself.

Rolling your own OAuth token storage can be a security liability. Token Vault securely stores access and refresh tokens from federated providers and handles exchange and renewal automatically. Connected accounts, refresh exchange, and privileged worker flows included.

Try Auth0 for Free
1

FFmpeg Batch AV Converter

FFmpeg Batch AV Converter

FFmpeg Batch AV Converter is a graphical front-end for FFmpeg designed to simplify advanced multimedia processing through an intuitive interface while preserving full access to FFmpeg’s capabilities. It allows users to perform complex encoding, conversion, and editing operations using drag-and-drop workflows instead of command-line input. The application supports both single and batch processing, enabling users to handle large volumes of media files efficiently. ...

Downloads: 3 This Week

Last Update: 2026-04-24
See Project
2

Chandra

OCR model for complex documents with layout-aware structured outputs

...Chandra can be run locally using transformer-based inference or deployed with a high-performance server setup for large-scale processing. It also includes command-line tools and optional web-based interfaces to simplify interaction and batch processing workflows.

Downloads: 3 This Week

Last Update: 2026-03-18
See Project
3

text-extract-api

Document (PDF, Word, PPTX ...) extraction and parse API

...Instead of requiring developers to integrate multiple document parsing libraries individually, the system centralizes text extraction capabilities into a unified API that standardizes the output. The platform supports automated processing pipelines that detect file types and apply the appropriate extraction method to obtain the most accurate text representation possible. It can be integrated into document analysis systems, knowledge retrieval tools, and AI pipelines that rely on clean textual data. ...

Downloads: 0 This Week

Last Update: 2026-03-05
See Project
4

abogen

Generate audiobooks from EPUBs, PDFs and text with captions

abogen is a tool designed to generate audiobooks (or speech narrations) from textual sources such as EPUBs, PDFs, or plain text, with synchronized captions. In other words, it automates the pipeline of reading a digital book (or document), converting its text into speech via a TTS engine, and packaging the result into an audiobook format — likely along with timestamped captions or subtitles that align with the spoken audio. This can be very useful for accessibility, content consumption on...

Downloads: 4 This Week

Last Update: 2026-02-06
See Project
Try Google Cloud Risk-Free With $300 in Credit
No hidden charges. No surprise bills. Cancel anytime.

Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.

Start Free
5

LLM-Aided OCR Project

Enhances Tesseract OCR output using LLMs (local or API)

...This AI-assisted correction process helps reconstruct missing characters, fix formatting mistakes, and produce more coherent text outputs. The project is particularly useful for digitizing historical documents, research papers, and scanned materials where traditional OCR often struggles. It also includes tools for processing batches of images or documents, enabling automated document digitization workflows.

Downloads: 0 This Week

Last Update: 2026-03-22
See Project
6

VSET

Based onVapoursynthGraphic video batch pressing processing tool

VSET is a multimedia toolkit focused on simplifying video processing tasks through structured workflows built on top of FFmpeg. It provides tools for tasks such as transcoding, clipping, and format conversion while abstracting complex command-line operations. The project is designed to help users automate repetitive media processing tasks with minimal configuration. It supports batch processing, enabling efficient handling of large numbers of media files. ...

Downloads: 0 This Week

Last Update: 2026-04-27
See Project
7

OpenMed

Open source healthcare AI

...OpenMed can be used in three main ways: as a simple Python API for scripts and notebooks, as a Docker-friendly FastAPI service for backend integration, and as a batch-processing system for multi-document workflows.

Downloads: 12 This Week

Last Update: 3 days ago
See Project
8

deepdoctection

A Repo For Document AI

DeepDoctection is a document AI framework that applies deep learning techniques to analyze and extract structured data from scanned documents, PDFs, and images. deepdoctection is a Python library that orchestrates document extraction and document layout analysis tasks using deep learning models. It does not implement models but enables you to build pipelines using highly acknowledged libraries for object detection, OCR and selected NLP tasks and provides an integrated frameworks for...

Downloads: 0 This Week

Last Update: 2026-05-15
See Project
9

AionUi

Free, local, open-source Cowork for Gemini CLI, Claude Code, Codex

...It enhances productivity by offering smart file management features like batch renaming, automatic organization, and intelligent file classification, thereby reducing manual overhead when working with large datasets or complex document structures. AionUi also supports a remote WebUI mode, allowing users to access their local AI tools securely over a network from other devices while keeping all processing and data on their own hardware.

Downloads: 52 This Week

Last Update: 17 hours ago
See Project
Full-stack observability with actually useful AI | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
10

StaxRip

Video encoding GUI for Windows

...Because StaxRip automates the invocation of complex command-line tools via a GUI, it lowers the barrier for less technical users while offering advanced configuration for experts. The tool supports batch processing, hardware acceleration (where supported), scripting, and automation through PowerShell or built-in project templates, making it versatile for single-file conversions.

62 Reviews

Downloads: 39 This Week

Last Update: 2026-04-09
See Project
11

Zerox OCR

PDF to Markdown with vision models

A dead simple way of OCR-ing a document for AI ingestion. Documents are meant to be a visual representation after all. With weird layouts, tables, charts, etc. The vision models just make sense. ZeroX is an open-source machine learning framework designed for fast experimentation and production deployment, optimized for speed and ease of use.

Downloads: 0 This Week

Last Update: 2024-12-18
See Project
12

Shutter Encoder

A professional video compression tool accessible to all

Shutter Encoder is a cross-platform video and audio processing application designed to provide professional-grade encoding and conversion tools through an accessible graphical interface. Built primarily on FFmpeg, it offers a wide range of media operations including transcoding, compression, format conversion, and editing. The software supports numerous codecs and formats, enabling users to prepare media for broadcasting, streaming, or archiving.

Downloads: 11 This Week

Last Update: 2026-05-04
See Project
13

FFBox

A multimedia transcoded treasure chest / a FFmpeg case

FFBox is a graphical multimedia processing application that provides an accessible interface for working with FFmpeg operations such as encoding, conversion, and editing. It allows users to perform tasks like trimming, merging, and compressing media files without using command-line tools. The software supports a wide range of audio and video formats, making it suitable for diverse media workflows.

Downloads: 0 This Week

Last Update: 2026-04-30
See Project
14

AUTOMATIC1111 Stable Diffusion web UI

Stable Diffusion web UI

...The interface also supports prompt editing, batch processing, custom scripts, and many community extensions, making it a highly customizable and continually evolving platform for creative AI art generation.

1 Review

Downloads: 171 This Week

Last Update: 2025-06-02
See Project
15

CompressO

Convert any video/image into a tiny size. 100% free & open-source

compressO is a cross-platform, open-source multimedia compression application designed to reduce the size of videos and images while preserving visual quality. Built using modern frameworks such as Rust and Tauri, it runs locally on the user’s machine, ensuring fast performance and complete privacy without requiring cloud processing. The application supports a variety of media formats and provides controls for adjusting compression levels, resolution, and output quality. In addition to...

Downloads: 14 This Week

Last Update: 2026-04-24
See Project
16

Lesan

New way to create web server and NoSQL data model

Lesan is a multilingual text processing and translation library designed for natural language processing (NLP) applications. It provides tools for text normalization, tokenization, and translation across multiple languages.

Downloads: 0 This Week

Last Update: 2026-04-18
See Project
17

Umi-OCR

OCR software, free and offline

Umi-OCR is a free and open-source optical character recognition (OCR) tool designed to provide fast, offline text extraction from images, screenshots, PDFs, and more without requiring a network connection. It includes a highly efficient offline OCR engine with built-in multilingual recognition libraries, so users can extract text across multiple languages with high accuracy directly on their machines. The software supports flexible usage patterns including screenshot capture OCR, batch processing of large sets of images or documents, PDF parsing, QR code detection, and layout-aware paragraph output. ...

Downloads: 43 This Week

Last Update: 2026-01-15
See Project
18

Pathway

Python ETL framework for stream processing, real-time analytics, LLM

...Pathway is especially well-suited for scenarios like financial analytics, IoT, fraud detection, and logistics, where high-velocity and continuously changing data is the norm. Unlike traditional batch processing frameworks, Pathway continuously updates the results of your data logic as new events arrive, functioning more like a database that reacts in real-time. It supports Python, integrates with modern data tools, and offers a deterministic dataflow model to ensure reproducibility and correctness.

Downloads: 0 This Week

Last Update: 2026-04-23
See Project
19

tidytext

Text mining using tidy tools

tidytext brings tidy data principles to text mining by converting text into a tidy data frame format. It provides tools for tokenization, sentiment analysis, n‑gram creation, and term‑document matrices, enabling interoperability with dplyr, ggplot2, and other tidyverse workflows.

Downloads: 1 This Week

Last Update: 2025-07-30
See Project
20

nunif

Misc; latest version of waifu2x; 2D video to stereo 3D video

nunif is a deep learning–based image processing framework focused on image upscaling, restoration, denoising, and enhancement tasks using neural network models. The project provides a collection of AI-powered utilities designed primarily for anime-style artwork, illustrations, and high-quality image restoration workflows. It includes command-line tools and graphical interfaces for applying trained neural models to improve image resolution and visual clarity while minimizing artifacts. nunif supports GPU acceleration and batch processing, making it suitable for creators, archivists, and enthusiasts handling large image collections. ...

Downloads: 1 This Week

Last Update: 2026-05-06
See Project
21

Hugging Face - Speech To Speech

Open speech-to-speech models and pipelines by Hugging Face toolkit AI

This project from Hugging Face focuses on enabling direct speech-to-speech processing using modern machine learning models. It provides tools and reference implementations that allow audio input to be transformed into audio output without requiring an intermediate text representation. Hugging Face - Speech To Speech builds on recent advances in speech modeling, combining components such as speech recognition, translation, and synthesis into unified pipelines. ...

Downloads: 3 This Week

Last Update: 2026-03-18
See Project
22

FFmpegFreeUI

3FUI is ffmpeg's light professional interactive shell on Windows

...It supports dozens of video, audio, and image encoders, including hardware-accelerated options, and allows custom parameter input for advanced use cases. The software also includes batch processing capabilities, real-time progress tracking, plugin extensibility, and integration with FFmpeg utilities like ffprobe and ffplay.

Downloads: 7 This Week

Last Update: 2026-04-24
See Project
23

Faster Whisper

Faster Whisper transcription with CTranslate2

Faster Whisper is an optimized implementation of the Whisper speech recognition model designed to deliver significantly faster inference while maintaining comparable accuracy. It leverages efficient inference engines and optimized computation strategies to reduce latency and resource consumption. The system is particularly useful for real-time or large-scale transcription tasks where performance is critical. It supports multiple model sizes, allowing users to balance speed and accuracy based...

Downloads: 35 This Week

Last Update: 2026-04-06
See Project
24

DeepSeek-OCR 2

Visual Causal Flow

DeepSeek-OCR-2 is the second-generation optical character recognition system developed to improve document understanding by introducing a “visual causal flow” mechanism, enabling the encoder to reorder visual tokens in a way that better reflects semantic structure rather than strict raster scan order. It is designed to handle complex layouts and noisy documents by giving the model causal reasoning capabilities that mimic human visual scanning behavior, enhancing OCR performance on documents...

Downloads: 6 This Week

Last Update: 2026-02-03
See Project
25

Voice-Pro

Comprehensive Gradio WebUI for audio processing

Voice-Pro is the best gradio WebUI for transcription, translation and text-to-speech. It can be easily installed with one click. Create a virtual environment using Miniconda, running completely separate from the Windows system (fully portable). Supports real-time transcription and translation, as well as batch mode.

1 Review

Downloads: 28 This Week

Last Update: 2025-12-05
See Project

Previous
You're on page 1
2
3
4
5
Next

Related Searches

ocr

umi-ocr

mkvmerge gui

automatic1111

aionui

umi

umi-ocr_paddle_v2.1.5.7z.exe

portable stable diffusion

whisper-windows-x64.exe

voice cloning

Related Categories

Artificial Intelligence

Text Editors

Software Development

Multimedia

Business

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise