Showing 20 open source projects for "text batch processing tools"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • 1
    Trurl

    Trurl

    A command line tool for URL parsing and manipulation

    trurl is a command-line tool developed by the curl project for parsing and manipulating URLs. It allows users to modify URL components easily, aiding in tasks like scripting and testing.​
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    ReductStore

    ReductStore

    The fastest time series object store for Edge AI

    History storage and management of images, vibration data, text, labels, and more - all in one place with the highest performance. Merge blob and time series functionalities, reducing the need for multiple databases. Customize real-time data retention policies and replication strategies. Store billions of time-stamped blobs with AI labels and access them with low latency. Outperform other databases with a customized solution for time-series object data. Capture and access blob data as time...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3

    sed-pcre2

    GNU sed with PCRE2 regexp

    GNU sed with PCRE2 regexp support
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    ezthumb
    A video thumbnail generator based on FFMPEG.
    Leader badge
    Downloads: 17 This Week
    Last Update:
    See Project
  • Auth0 B2B Essentials: SSO, MFA, and RBAC Built In Icon
    Auth0 B2B Essentials: SSO, MFA, and RBAC Built In

    Unlimited organizations, 3 enterprise SSO connections, role-based access control, and pro MFA included. Dev and prod tenants out of the box.

    Auth0's B2B Essentials plan gives you everything you need to ship secure multi-tenant apps. Unlimited orgs, enterprise SSO, RBAC, audit log streaming, and higher auth and API limits included. Add on M2M tokens, enterprise MFA, or additional SSO connections as you scale.
    Sign Up Free
  • 5

    Simple Java Image Tool

    Easy-to-use command line tool for images

    SJIT is an easy-to-use command line program to analyze, manipulate and convert image files.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    The Art of Command Line

    The Art of Command Line

    Master the command line, in one page

    The Art of Command Line is a single, highly curated page of tips that distills years of Unix command-line experience into practical, memorable guidance. It emphasizes fluency: small habits and commands that compound into faster debugging, data wrangling, and system navigation. The content spans basic shell usage, text processing with tools like grep/sed/awk, networking and performance inspection, and advice for working safely with root and destructive commands. Many entries highlight lesser-known flags or idioms that save keystrokes or avoid pitfalls, and the list aims to be dense but scannable. It is written for Linux first while acknowledging macOS and Windows differences where relevant. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Common Resource Grep - crgrep

    Common Resource Grep - crgrep

    Common Resource Grep

    CRGREP searches for matching text in databases, various document formats, archives and other difficult to access resources. A command line tool for name and content text matching in database tables, plain files, MS Office documents, PDF, archives, MP3 audio, image meta-data, scanned documents, maven dependencies and web resources. CRGREP will search resources within resources of any arbitrary combination or depth, so text within a document within a zip archive, and so on. Here you...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 8
    Flat file extractor can be used for reading and parsing different flat file structures and printing them in different formats. ffe is a command line tool developed in GNU/Linux environment and it is distributed under GPL. Project moved to https://github.com/igitur/ffe
    Leader badge
    Downloads: 9 This Week
    Last Update:
    See Project
  • 9
    CSVfix

    CSVfix

    Command-line tool specifically designed to deal with CSV data

    ...Unfortunately, the CSV files you are given, or are required to produce, never seem to be in quite the right format for your particular business application. And because of the structure of CSV records, using standard text processing tools like sed, awk and perl is not as simple as it might be. Usage: http://csvfix.byethost5.com/csvfix15/csvfix.html?csvfix.html?Usage.html?i=1&i=2 CSVfix aims to provide a solution to these problems. It is a command-line stream editor specifically designed to deal with CSV data. With it you can, among other things:
    Downloads: 74 This Week
    Last Update:
    See Project
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 10
    pdfsandwich generates "sandwich" OCR pdf files, i.e. pdf files which contain only images (but no editable text) will be processed by optical character recognition (OCR) and the text will be added to each page invisibly "behind" the images. pdfsandwich is a command line tool which is supposed to be useful to OCR scanned books or journals. It is able to recognize the page layout even for multicolumn text. Essentially, pdfsandwich is a wrapper script which calls the following binaries:...
    Leader badge
    Downloads: 299 This Week
    Last Update:
    See Project
  • 11
    The tlve program is a command-line tool for parsing different tlv (tag-length-value) structures and for printing them in various text-based formats. tlve is developed in GNU/Linux environment and it is distributed under GPL.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12

    isbntools

    A command line tool to extract, transform and get metadata for ISBNs

    As of 2015-06-02, this project is no longer under active development.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13

    grepp

    An ultimate text-analysing tool

    A command line tool for text file analyis, filtering, splitting and reporting. Runs under Java (1.5+), supports plugins written in Groovy. Has nix and win batch files in distributions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    SplitPDF -SplitPDF.jar- is a ‘command-line driven’ Java-program, it splits a PDF-file by bookmarks into separated PDF’s. The bookmark is used as title for the newly created PDF. Extremely usefull and fast in a batch processing environment.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Let's you synchronize related scheduled jobs, which execute across many different machines. Instead of adding time padding between jobs (which can fail), use this utility. Please feel free to add more functions!
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    binreplace is a command line tool to search and replace binary sequences in a file. It can be very powerful when used together with grep and sed. It uses prefix tree internally to search. It's efficient even with millions of search terms.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Random name generator library with rule files. Fast, lightweight and easy. Can generate rule files from custom text, you can easily use it into your application and use the command line tool to generate the best suitable name for your needs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Chordeus is yet another Chord Pro to PDF converter that creates nice looking guitar chord sheets. You can create single sheets or a whole songbook using the command line tool or a simple wizard-like GUI.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    xmlsnipe is a command line tool to display, remove, and modify content of nodes within an XML document
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Atox is a fully customizable Python library and command-line tool for converting plain text into XML.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB