Search Results for "text batch processing tools" - Page 2

Showing 304 open source projects for "text batch processing tools"

View related business solutions
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • 1
    pyVideoTrans

    pyVideoTrans

    Translate the video from one language to another and embed dubbing

    ...The tool supports both command-line and GUI modes, making it accessible to developers and creatives needing batch or automated processing.
    Downloads: 17 This Week
    Last Update:
    See Project
  • 2
    PDFCraft

    PDFCraft

    PDFCraft is a free, privacy-focused PDF toolkit

    PDFCraft is an extensible toolkit for creating, editing, and transforming PDF documents with both a graphical interface and a scripting API, making it useful for users ranging from casual editors to automated document processors. At its core, the project provides a clean, modern UI where you can rearrange pages, annotate text, insert images, fill forms, and export to multiple formats, all without needing a heavyweight commercial PDF suite. But beyond manual editing, it also offers a...
    Downloads: 39 This Week
    Last Update:
    See Project
  • 3
    SciSpaCy

    SciSpaCy

    A full spaCy pipeline and models for scientific/biomedical documents

    ScispaCy is a spaCy extension optimized for processing biomedical and scientific text, providing domain-specific NLP models for tasks like named entity recognition (NER) and dependency parsing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Readest

    Readest

    Readest is a modern, feature-rich ebook reader

    Readest is a project meant to facilitate reading, studying, or consuming content by integrating reading tools with AI-powered assistance. Although the repository is not as widely documented or popular as some, the idea is that Readest supports features to help with reading comprehension — likely combining OCR / text retrieval, translation, note-taking, or summarization for reading materials (eBooks, articles, PDFs). The goal appears to be to let users feed in arbitrary reading material and...
    Downloads: 25 This Week
    Last Update:
    See Project
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    More flexibility. More control.

    Generate interest, access liquidity without selling, and execute trades seamlessly. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 5
    AutoCut

    AutoCut

    Cut videos with a text editor

    ...AutoCut supports multiple transcription backends, including Whisper and faster-whisper modes, allowing users to choose based on speed or accuracy needs. After editing the transcript text, the corresponding video clips are merged into the final output, and the tool also produces matching subtitle files. Its command-line interface can be integrated into scripts, making it suitable for automated workflows or batch processing.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 6
    edge-tts

    edge-tts

    Use Microsoft Edge's online text-to-speech service from Python

    edge-tts is a Python module and command-line tool that gives you direct access to Microsoft Edge’s online text-to-speech service without needing the Edge browser, Windows, or any API key. It wraps the same cloud voices used by Edge, exposing them through a simple CLI (edge-tts, edge-playback) and a Python API, so you can script high-quality speech generation in your own applications. The tool lets you list available voices, specify locale and voice name, and generate audio files in common...
    Downloads: 35 This Week
    Last Update:
    See Project
  • 7
    MCScanX

    MCScanX

    MCScanX: Multiple Collinearity Scan toolkit X version

    MCScanX is a toolkit for detecting gene synteny and collinearity, aiding in the evolutionary analysis of gene arrangements across multiple genomes. It extends the original MCScan algorithm by incorporating additional utilities for visualization and downstream analyses.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 8
    Short Video Factory

    Short Video Factory

    AI tool for automatic batch short video creation and editing

    Short Video Factory is an open source desktop application designed to simplify the creation of short-form videos using AI-driven automation. It enables users to generate product marketing clips and general content videos by combining simple prompt-based input with pre-prepared media assets. Short Video Factory integrates multiple stages of video production, including script generation, voice synthesis, video editing, and subtitle effects, into a single streamlined workflow. By leveraging AI...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    Verticals v3

    Verticals v3

    Automated YouTube Shorts pipeline

    Verticals v3 is an automated content generation workflow designed to create and process YouTube Shorts videos programmatically. It combines multiple tools and scripts to handle tasks such as downloading source material, editing clips, adding subtitles, and formatting output for vertical video platforms. The pipeline emphasizes automation, allowing users to produce short-form content at scale with minimal manual intervention. It integrates FFmpeg and other media processing tools to handle...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 10
    TTime

    TTime

    Screenshots, word marking, OCR, AI, translation software

    ...It also supports clipboard monitoring and silent OCR processing, enabling seamless workflows where extracted text can be translated automatically without interrupting the user. The interface is designed to be lightweight and responsive, with customizable shortcuts and floating tools that enhance usability during multitasking.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 11
    Youwee

    Youwee

    A beautiful, cross-platform downloader for YouTube, TikTok, Instagram

    Youwee is a modern cross-platform media downloader built with Tauri and React that supports downloading content from over 1800 websites including YouTube, TikTok, and Instagram. It provides a polished graphical interface that simplifies media downloading while leveraging powerful tools like yt-dlp and FFmpeg under the hood. The application supports advanced features such as batch downloads, playlist management, and extraction of audio or subtitles. It also integrates AI capabilities,...
    Downloads: 25 This Week
    Last Update:
    See Project
  • 12
    Scanopy

    Scanopy

    Clean network diagrams, One-time setup, zero upkeep

    Scanopy is a powerful multi-modal data capture and analysis toolkit that enables users to collect, process, and visualize structured and unstructured information from a variety of sources in a flexible pipeline. It is built to handle complex scanning tasks — such as OCR, document analysis, audio transcription, network data capture, and image extraction — while providing unified APIs and workflows that make managing heterogeneous data sources seamless. Developers can compose custom pipelines...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 13
    Pocket TTS

    Pocket TTS

    A TTS that fits in your CPU (and pocket)

    ...Because it is CPU-oriented, it fits well in server environments where GPU access is limited, in desktop apps, or in edge deployments where simplicity matters more than maximum throughput. It also emphasizes developer ergonomics, providing a straightforward API surface that can be integrated into pipelines, assistants, accessibility tools, or batch generation scripts.
    Downloads: 12 This Week
    Last Update:
    See Project
  • 14
    OmniTools

    OmniTools

    Self-hosted collection of powerful web-based tools for everyday tasks

    OmniTools is a self-hosted web application that bundles a large collection of everyday utilities into a single clean interface you can run on your own infrastructure. It’s designed to replace the random assortment of “free online tools” people use for quick tasks, while avoiding ads, tracking, and the need to upload sensitive files to unknown servers. A key design choice is that file processing happens entirely on the client side, meaning your data stays in your browser instead of being sent to the backend. The tool catalog spans both technical and non-technical needs, including image, video, audio, PDF, text, date/time, math, and data format utilities like JSON/CSV/XML helpers. ...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 15
    EEGLAB

    EEGLAB

    EEGLAB is an open source signal processing environment

    EEGLAB is an open source, MATLAB-based interactive environment for analyzing electrophysiological signals such as EEG and MEG. It incorporates powerful tools for data import, preprocessing, independent component analysis (ICA), time-frequency analysis, artifact rejection, and visualization—all within a GUI framework that also supports scripting and plugin extensions. EEGLAB is an open source signal processing environment for electrophysiological signals running on Matlab and Octave (command...
    Downloads: 12 This Week
    Last Update:
    See Project
  • 16
    Chess Tools for Windows

    Chess Tools for Windows

    Batch tools for creating Polyglot books and PGN-based statistics.

    Open source scripts and tools for Windows. Every zip archive contains a readme file with further information. PGNStat and PGNXtract are batch processing tools for Windows. Simply unpack the zip archive into a directory of your choice. The batch files require free and open source third-party tools. These are not included in the zip archives and must be downloaded separately.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    MindNLP

    MindNLP

    Easy-to-use and high-performance NLP and LLM framework

    MindNLP is a natural language processing library built on the MindSpore framework, providing tools and models for various NLP tasks.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Apache OpenNLP

    Apache OpenNLP

    Apache OpenNLP

    Apache OpenNLP is a machine learning-based NLP library that provides tools for text-processing tasks such as tokenization, sentence segmentation, and named entity recognition.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    DeerFlow

    DeerFlow

    Deep Research framework, combining language models with tools

    DeerFlow is an open-source, community-driven “deep research” framework / multi-agent orchestration platform developed by ByteDance. It aims to combine the reasoning power of large language models (LLMs) with automated tool-use — such as web search, web crawling, Python execution, and data processing — to enable complex, end-to-end research workflows. Instead of a monolithic AI assistant, DeerFlow defines multiple specialized agents (e.g. “planner,” “searcher,” “coder,” “report generator”)...
    Downloads: 114 This Week
    Last Update:
    See Project
  • 20
    FastFlix

    FastFlix

    FastFlix is a free GUI for H.264, HEVC and AV1 hardware and software

    FastFlix is a graphical user interface for AV1 video encoding that simplifies the process of converting video files into the modern AV1 format. It acts as a frontend for encoding tools such as SVT-AV1 and AOM, allowing users to configure advanced encoding parameters without using the command line. The application focuses on balancing quality and encoding speed, offering presets and tuning options for different use cases. It supports batch encoding, enabling users to process multiple files...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 21
    SwarmUI

    SwarmUI

    Modular AI image and video generation web UI with extensible tools

    SwarmUI is a modular web-based user interface designed for AI-driven image generation, with a strong focus on usability, performance, and extensibility. It serves as a unified environment for working with multiple AI models, including Stable Diffusion and newer image and video generation systems, allowing users to create and manage outputs through a browser interface. SwarmUI is built to accommodate both beginners and advanced users by offering a simple “Generate” interface alongside more...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 22
    IronCalc

    IronCalc

    Main engine of the IronCalc ecosystem

    IronCalc is a new, modern, work-in-progress spreadsheet engine and set of tools to work with spreadsheets in diverse settings. IronCalc is a lightweight, open-source computational engine designed for performing mathematical operations, formula calculations, and data-driven tasks.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    baoyu-skills

    baoyu-skills

    Skills shared by Baoyu for improving daily work efficiency with Claude

    ...The system supports marketplace-style installation, where users can selectively install or update individual skills rather than a monolithic package. It integrates with various external services, including AI APIs and browser automation tools, to expand its capabilities beyond basic text processing.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24
    julep

    julep

    A new DSL and server for AI agents and multi-step tasks

    Julep is a platform for creating AI agents that remember past interactions and can perform complex tasks. It offers long-term memory and manages multi-step processes. Julep enables the creation of multi-step tasks incorporating decision-making, loops, parallel processing, and integration with numerous external tools and APIs. While many AI applications are limited to simple, linear chains of prompts and API calls with minimal branching, Julep is built to handle more complex scenarios.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Cookbook (Google Gemini)

    Cookbook (Google Gemini)

    Examples and guides for using the Gemini API

    ...It provides a structured learning path with quick-start tutorials for beginners and practical examples for advanced users. The repository covers a wide range of Gemini capabilities, including text, images, video, speech, robotics, and multimodal interactions. It highlights newly introduced features such as Gemini 2.5 models (Flash and Pro), Gemini’s native image generation, Veo for video generation, robotics-focused reasoning models, and Lyria for TTS and music generation. The Cookbook also includes tutorials on advanced API workflows such as grounding answers with external tools, batch-mode request handling, and live multimodal interactivity with LiveAPI. ...
    Downloads: 1 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB