Search Results for "text batch processing tools"

Sort By:

Showing 455 open source projects for "text batch processing tools"

View related business solutions

$300 Free Credits for Your Google Cloud Projects
Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.

Start Free Trial
Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
1

FFmpeg Batch AV Converter

FFmpeg Batch AV Converter

FFmpeg Batch AV Converter is a graphical front-end for FFmpeg designed to simplify advanced multimedia processing through an intuitive interface while preserving full access to FFmpeg’s capabilities. It allows users to perform complex encoding, conversion, and editing operations using drag-and-drop workflows instead of command-line input. The application supports both single and batch processing, enabling users to handle large volumes of media files efficiently. ...

Downloads: 2 This Week

Last Update: 2026-05-27
See Project
2

text-extract-api

Document (PDF, Word, PPTX ...) extraction and parse API

...Instead of requiring developers to integrate multiple document parsing libraries individually, the system centralizes text extraction capabilities into a unified API that standardizes the output. The platform supports automated processing pipelines that detect file types and apply the appropriate extraction method to obtain the most accurate text representation possible. It can be integrated into document analysis systems, knowledge retrieval tools, and AI pipelines that rely on clean textual data. ...

Downloads: 0 This Week

Last Update: 2026-03-05
See Project
3

abogen

Generate audiobooks from EPUBs, PDFs and text with captions

abogen is a tool designed to generate audiobooks (or speech narrations) from textual sources such as EPUBs, PDFs, or plain text, with synchronized captions. In other words, it automates the pipeline of reading a digital book (or document), converting its text into speech via a TTS engine, and packaging the result into an audiobook format — likely along with timestamped captions or subtitles that align with the spoken audio. This can be very useful for accessibility, content consumption on...

Downloads: 6 This Week

Last Update: 2026-02-06
See Project
4

Chandra

OCR model for complex documents with layout-aware structured outputs

...Chandra can be run locally using transformer-based inference or deployed with a high-performance server setup for large-scale processing. It also includes command-line tools and optional web-based interfaces to simplify interaction and batch processing workflows.

Downloads: 1 This Week

Last Update: 2026-03-18
See Project
Earn up to 16% annual interest with Nexo.
More flexibility. More control.

Generate interest, access liquidity without selling, and execute trades seamlessly. All in one platform. Geographic restrictions, eligibility, and terms apply.

Get started with Nexo.
5

LLM-Aided OCR Project

Enhances Tesseract OCR output using LLMs (local or API)

...This AI-assisted correction process helps reconstruct missing characters, fix formatting mistakes, and produce more coherent text outputs. The project is particularly useful for digitizing historical documents, research papers, and scanned materials where traditional OCR often struggles. It also includes tools for processing batches of images or documents, enabling automated document digitization workflows.

Downloads: 1 This Week

Last Update: 2026-03-22
See Project
6

VSET

Based onVapoursynthGraphic video batch pressing processing tool

VSET is a multimedia toolkit focused on simplifying video processing tasks through structured workflows built on top of FFmpeg. It provides tools for tasks such as transcoding, clipping, and format conversion while abstracting complex command-line operations. The project is designed to help users automate repetitive media processing tasks with minimal configuration. It supports batch processing, enabling efficient handling of large numbers of media files. ...

Downloads: 0 This Week

Last Update: 2026-04-27
See Project
7

deepdoctection

A Repo For Document AI

DeepDoctection is a document AI framework that applies deep learning techniques to analyze and extract structured data from scanned documents, PDFs, and images. deepdoctection is a Python library that orchestrates document extraction and document layout analysis tasks using deep learning models. It does not implement models but enables you to build pipelines using highly acknowledged libraries for object detection, OCR and selected NLP tasks and provides an integrated frameworks for...

Downloads: 0 This Week

Last Update: 2026-06-12
See Project
8

FFBox

A multimedia transcoded treasure chest / a FFmpeg case

FFBox is a graphical multimedia processing application that provides an accessible interface for working with FFmpeg operations such as encoding, conversion, and editing. It allows users to perform tasks like trimming, merging, and compressing media files without using command-line tools. The software supports a wide range of audio and video formats, making it suitable for diverse media workflows.

Downloads: 2 This Week

Last Update: 2026-06-07
See Project
9

ComfyUI Essentials

Essential nodes that are weirdly missing from ComfyUI core

ComfyUI_essentials is a ComfyUI custom node collection that adds practical nodes the author considers missing from the ComfyUI core. The project focuses on useful workflow building blocks rather than generic duplicates, with nodes for image handling, mask processing, sampling, segmentation, conditioning, text, and miscellaneous operations. Its image tools include functions for batching, cropping, flipping, resizing, compositing, background removal, color matching, LUT application, sharpening, tiling, and latent previewing. Its mask tools include blur, smoothing, fixing, flipping, color-based masks, segmentation masks, bounding boxes, transition masks, and batch utilities. ...

Downloads: 0 This Week

Last Update: 2026-06-12
See Project
99.99% Uptime for MySQL and PostgreSQL Databases
Sub-second maintenance. 2x read/write performance. Built-in vector search for AI apps.

Cloud SQL Enterprise Plus delivers near-zero downtime with 35 days of point-in-time recovery. Supports MySQL, PostgreSQL, and SQL Server.

Try Free
10

StaxRip

Video encoding GUI for Windows

...Because StaxRip automates the invocation of complex command-line tools via a GUI, it lowers the barrier for less technical users while offering advanced configuration for experts. The tool supports batch processing, hardware acceleration (where supported), scripting, and automation through PowerShell or built-in project templates, making it versatile for single-file conversions.

62 Reviews

Downloads: 32 This Week

Last Update: 2026-06-06
See Project
11

Zerox OCR

PDF to Markdown with vision models

A dead simple way of OCR-ing a document for AI ingestion. Documents are meant to be a visual representation after all. With weird layouts, tables, charts, etc. The vision models just make sense. ZeroX is an open-source machine learning framework designed for fast experimentation and production deployment, optimized for speed and ease of use.

Downloads: 1 This Week

Last Update: 2024-12-18
See Project
12

Shutter Encoder

A professional video compression tool accessible to all

Shutter Encoder is a cross-platform video and audio processing application designed to provide professional-grade encoding and conversion tools through an accessible graphical interface. Built primarily on FFmpeg, it offers a wide range of media operations including transcoding, compression, format conversion, and editing. The software supports numerous codecs and formats, enabling users to prepare media for broadcasting, streaming, or archiving.

Downloads: 20 This Week

Last Update: 2026-06-28
See Project
13

AionUi

Free, local, open-source Cowork for Gemini CLI, Claude Code, Codex

...It enhances productivity by offering smart file management features like batch renaming, automatic organization, and intelligent file classification, thereby reducing manual overhead when working with large datasets or complex document structures. AionUi also supports a remote WebUI mode, allowing users to access their local AI tools securely over a network from other devices while keeping all processing and data on their own hardware.

Downloads: 23 This Week

Last Update: 2 days ago
See Project
14

Pathway

Python ETL framework for stream processing, real-time analytics, LLM

...Pathway is especially well-suited for scenarios like financial analytics, IoT, fraud detection, and logistics, where high-velocity and continuously changing data is the norm. Unlike traditional batch processing frameworks, Pathway continuously updates the results of your data logic as new events arrive, functioning more like a database that reacts in real-time. It supports Python, integrates with modern data tools, and offers a deterministic dataflow model to ensure reproducibility and correctness.

Downloads: 1 This Week

Last Update: 2026-06-12
See Project
15

OpenMed

Open source healthcare AI

...OpenMed can be used in three main ways: as a simple Python API for scripts and notebooks, as a Docker-friendly FastAPI service for backend integration, and as a batch-processing system for multi-document workflows.

Downloads: 1 This Week

Last Update: 2026-07-01
See Project
16

FFmpegFreeUI

3FUI is ffmpeg's light professional interactive shell on Windows

...It supports dozens of video, audio, and image encoders, including hardware-accelerated options, and allows custom parameter input for advanced use cases. The software also includes batch processing capabilities, real-time progress tracking, plugin extensibility, and integration with FFmpeg utilities like ffprobe and ffplay.

Downloads: 22 This Week

Last Update: 22 hours ago
See Project
17

nunif

Misc; latest version of waifu2x; 2D video to stereo 3D video

nunif is a deep learning–based image processing framework focused on image upscaling, restoration, denoising, and enhancement tasks using neural network models. The project provides a collection of AI-powered utilities designed primarily for anime-style artwork, illustrations, and high-quality image restoration workflows. It includes command-line tools and graphical interfaces for applying trained neural models to improve image resolution and visual clarity while minimizing artifacts. nunif supports GPU acceleration and batch processing, making it suitable for creators, archivists, and enthusiasts handling large image collections. ...

Downloads: 3 This Week

Last Update: 2026-05-06
See Project
18

AUTOMATIC1111 Stable Diffusion web UI

Stable Diffusion web UI

...The interface also supports prompt editing, batch processing, custom scripts, and many community extensions, making it a highly customizable and continually evolving platform for creative AI art generation.

1 Review

Downloads: 192 This Week

Last Update: 2025-06-02
See Project
19

Faster Whisper

Faster Whisper transcription with CTranslate2

Faster Whisper is an optimized implementation of the Whisper speech recognition model designed to deliver significantly faster inference while maintaining comparable accuracy. It leverages efficient inference engines and optimized computation strategies to reduce latency and resource consumption. The system is particularly useful for real-time or large-scale transcription tasks where performance is critical. It supports multiple model sizes, allowing users to balance speed and accuracy based...

Downloads: 85 This Week

Last Update: 2026-04-06
See Project
20

CompressO

Convert any video/image into a tiny size. 100% free & open-source

compressO is a cross-platform, open-source multimedia compression application designed to reduce the size of videos and images while preserving visual quality. Built using modern frameworks such as Rust and Tauri, it runs locally on the user’s machine, ensuring fast performance and complete privacy without requiring cloud processing. The application supports a variety of media formats and provides controls for adjusting compression levels, resolution, and output quality. In addition to...

Downloads: 13 This Week

Last Update: 2026-04-24
See Project
21

Lesan

New way to create web server and NoSQL data model

Lesan is a multilingual text processing and translation library designed for natural language processing (NLP) applications. It provides tools for text normalization, tokenization, and translation across multiple languages.

Downloads: 0 This Week

Last Update: 2026-04-19
See Project
22

Osenpa PDF Tools

Local-first PDF tools for Windows.

Osenpa PDF Tools is a local-first Windows desktop app for common PDF workflows. It supports merging, organizing, splitting, compressing, converting, OCR text export, signing, watermarking, protecting, cleaning, and batch queue management. The app is designed to keep files on the user's device, without accounts or cloud upload for PDF processing.

Downloads: 5 This Week

Last Update: 2026-06-30
See Project
23

Umi-OCR

OCR software, free and offline

Umi-OCR is a free and open-source optical character recognition (OCR) tool designed to provide fast, offline text extraction from images, screenshots, PDFs, and more without requiring a network connection. It includes a highly efficient offline OCR engine with built-in multilingual recognition libraries, so users can extract text across multiple languages with high accuracy directly on their machines. The software supports flexible usage patterns including screenshot capture OCR, batch processing of large sets of images or documents, PDF parsing, QR code detection, and layout-aware paragraph output. ...

Downloads: 45 This Week

Last Update: 2026-01-15
See Project
24

tidytext

Text mining using tidy tools

tidytext brings tidy data principles to text mining by converting text into a tidy data frame format. It provides tools for tokenization, sentiment analysis, n‑gram creation, and term‑document matrices, enabling interoperability with dplyr, ggplot2, and other tidyverse workflows.

Downloads: 0 This Week

Last Update: 2025-07-30
See Project
25

SciSpaCy

A full spaCy pipeline and models for scientific/biomedical documents

ScispaCy is a spaCy extension optimized for processing biomedical and scientific text, providing domain-specific NLP models for tasks like named entity recognition (NER) and dependency parsing.

Downloads: 2 This Week

Last Update: 2025-10-01
See Project