Showing 109 open source projects for "batch text processing"

View related business solutions
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery turns your data warehouse into an AI platform. No new languages required.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • Host LLMs in Production With On-Demand GPUs Icon
    Host LLMs in Production With On-Demand GPUs

    NVIDIA L4 GPUs. 5-second cold starts. Scale to zero when idle.

    Deploy your model, get an endpoint, pay only for compute time. No GPU provisioning or infrastructure management required.
    Try Free
  • 1
    ExifTool

    ExifTool

    ExifTool meta information reader/writer

    ExifTool is a battle-tested Perl application for reading, writing, and batch-editing metadata in thousands of file types—images, videos, audio, documents, and more. It understands major standards like EXIF, IPTC, and XMP as well as an enormous range of camera maker notes and container formats (for example, QuickTime/MP4, PDF, TIFF). Typical workflows include extracting metadata to JSON/CSV/XML, renaming files from timestamps or tags, shifting capture times, copying tags between files, and...
    Downloads: 75 This Week
    Last Update:
    See Project
  • 2
    Perl 5

    Perl 5

    The Perl programming language

    This repository contains the reference implementation of the Perl 5 programming language, including the interpreter, core modules, build system, and an extensive test suite. Perl 5 is a multi-paradigm language renowned for powerful text processing, rich regular expressions, and pragmatic glue code across systems. The core distribution is highly portable, building on Unix, Linux, Windows, and many other platforms, with stable release cycles and careful back-compatibility. A C API (XS) and embedding APIs allow tight integration with native libraries and host applications, while the CPAN ecosystem supplies hundreds of thousands of reusable modules. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 3
    vRain

    vRain

    Chinese Ancient eBooks Generator

    vRain is a small, focused automation tool aimed at simplifying routine tasks on education-oriented web platforms, particularly those that require periodic attendance or check-ins. It encapsulates login, session management, and request sequencing so repetitive actions can be executed reliably on a schedule. A configuration-driven approach lets users store multiple accounts, adjust intervals, and toggle features without modifying code. The program surfaces clear logs and result summaries so...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    biber
    Biber is a sophisticated bibliography processing backend for the LaTeX biblatex package. It supports a unsurpassed feature set for automated conformance to complex bibliography style requirements such as labelling, sorting and name handling. It has comprehensive Unicode support.
    Leader badge
    Downloads: 334 This Week
    Last Update:
    See Project
  • Catch Bugs Before Your Customers Do Icon
    Catch Bugs Before Your Customers Do

    Real-time error alerts, performance insights, and anomaly detection across your full stack. Free 30-day trial.

    Move from alert to fix before users notice. AppSignal monitors errors, performance bottlenecks, host health, and uptime—all from one dashboard. Instant notifications on deployments, anomaly triggers for memory spikes or error surges, and seamless log management. Works out of the box with Rails, Django, Express, Phoenix, Next.js, and dozens more. Starts at $23/month with no hidden fees.
    Try AppSignal Free
  • 5

    namefix.pl, 1st and best batch renamer

    Cross-platform batch file renamer with GUI and CLI since 2000

    namefix.pl is a cross-platform batch file renaming utility first released in 2006. It automates the cleanup of large sets of filenames using regular expressions and configurable rules. Features include handling of whitespace, punctuation, extensions, directory processing, safe overwrite checks, and dry-run mode. Both command-line and Tk GUI interfaces are provided.
    Leader badge
    Downloads: 4 This Week
    Last Update:
    See Project
  • 6

    Bulgarian language support

    Spell check, grammar check and hyphenation for Bulgarian language

    The goal of this project is to provide spell check, grammar check and hyphenation for Bulgarian language for Open Source products such as OpenOffice.org, LibreOffice, TeX, aspell, ispell, hunspell etc.
    Leader badge
    Downloads: 42 This Week
    Last Update:
    See Project
  • 7
    Unihanconver

    Unihanconver

    Traditional/Simplified Chinese conversion with CLI or GUI

    Tool to convert between Traditional/Simplified Chinese directly in Unicode (not GB/Big5 conversion). It is written in Perl and does not use any external libraries. It provides a command-line utility as well as a GTK+ interface for X Window.
    Downloads: 31 This Week
    Last Update:
    See Project
  • 8
    RefDB is a reference database and bibliography tool for SGML, XML, and LaTeX documents, sort of a Reference Manager or BibTeX for markup languages. It is portable and known to run on Linux, Free/NetBSD, OSX, Solaris, and Windows/Cygwin.
    Downloads: 14 This Week
    Last Update:
    See Project
  • 9
    Cclite

    Cclite

    Cclite Alternative Currency Software

    ...Multi-registry (group), multi-currency, with inter-registry transactions using web services (SOAP,REST), with rough templates for 17 languages. Various payment interfaces email, SMS, jabber, batch. User manual. Note Cclite is NOT crypto, it's mutual social credit! https://github.com/hbarnard/cclite-android-app this is now also here as: https://sourceforge.net/projects/cclite-android-app
    Downloads: 0 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 10
    ExifCleaner

    ExifCleaner

    Desktop app to sweep away exif metadata from images and other files

    ExifCleaner is a cross-platform desktop app for cleaning metadata from images, videos, PDFs and other files. With ExifCleaner you can keep file sizes to a minimum and keep potential attackers from being able to use your GPS and device metadata.
    Downloads: 17 This Week
    Last Update:
    See Project
  • 11
    Perl Audio Converter

    Perl Audio Converter

    Linux Audio Converter / Tagger / CD Ripper

    ...It can also extract audio from the following video extensions: RM, RV, ASF, DivX, MPG, MKV, MPEG, AVI, MOV, OGM, OGV, QT, VCD, SVCD, M4V, NSV, NUV, PSP, SMK, VOB, FLV, WEBM and WMV. Parallel Processing, a CD ripping function with CDDB support, batch conversion, tag preservation for most supported formats, independent tag reading & writing, service menus for KDE Dolphin/Konqueror, Gnome Nautilus script, and action scripts for Nemo/Thunar are also provided.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 12

    VecText

    Converting text to a structured representation

    ...This way of non-interactive communication enables incorporating the application into a more complicated data mining process integrating several software packages or performing multiple conversions in a batch.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13

    Safe Harbor Deidentification

    Safe Harbor Deidentification for medical documents

    Phalanx - Deidentify Safe Harbor Deidentification Mode of Phalanx is an abridged pipeline of NLP annotators culminating in NER annotators which write output of text offsets. It uses the Safe Harbor deidentification method.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Café is an yagte (*) multipurpose, tabbed, scriptable, lightweight, fast, UTF-8 ready and full-featured text editor with IDE aspirations. Cafe' uses Perl as macro language. (*) yet another GTK+ text editor
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    XSH is a powerfull command-line XML editing tool/programming language in the manner of Unix shell interpreters and line-oriented text editors like ed which can be used either interactively or for batch-mode XML processing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    SmGen

    Verilog Finite State Machine (FSM) Code Generator

    SmGen is a finite state machine (FSM) generator for Verilog. On the other hand, it is not an FSM entry tool. The input is behavioral Verilog with clock boundaries specifically set by the designer. SmGen unrolls this behavioral code and generates an FSM from it in synthesizable Verilog. Clock boundaries are explicitly provided by the designer so there is good control on the expected timing
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Betty

    Betty

    Holberton-style C code checker written in Perl

    Betty is a Perl-based coding style checker that enforces the Holberton School coding style (inspired by the Linux kernel style) for C code and documentation. It identifies inconsistencies, style violations, and formatting issues in C source files. You should be aware that by default, some text editors are using spaces instead of tabs. For instance, when you press tab key on emacs, by default, leading spaces will be put, and that will cause Betty to raise a lot of warnings. Please find some...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Capsim(r) C Text Mode Kernel(TMK),DSP and communication blocks, topologies, libraries and tools for the development of high performance block diagram digital signal processing and communications systems,built in interpreter for scripting.SystemC support.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Perl Text Editor

    Perl Text Editor

    Source code for perl simple text editor.

    Simple text editor. Run: $perl 01text0.pl
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    360-FAAR  Firewall Analysis Audit Repair

    360-FAAR Firewall Analysis Audit Repair

    360-FAAR Analyze FW1 Cisco Netscreen Policy Offline Using Config/Logs

    360-FAAR (Firewall Analysis Audit and Repair) is an offline, command line, firewall policy manipulation tool to filter, compare to logs, merge, translate and output firewall commands for new policies, in Checkpoint dbedit, Cisco ASA or ScreenOS commands, and its one file! Read Policy and Logs for: Checkpoint FW1 (in odumper.csv / logexport format), Netscreen ScreenOS (in get config / syslog format), Cisco ASA (show run / syslog format), 360-FAAR compares firewall policies and uses...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Expose

    Expose

    A simple static site generator for photoessays

    ...Implemented as a Bash script, it converts directories of media files into cleanly structured static websites with built-in themes. By default, it includes both a blog-style layout and a Medium-inspired theme, but users can also build their own templates. Expose reads associated text files, YAML metadata, and folder structures to automatically generate navigation menus, captions, and styling for each gallery. It supports image and video customization through ImageMagick and FFmpeg, enabling batch effects, filters, watermarks, and even video stabilization. With minimal setup, users can transform raw media collections into polished photoessays suitable for personal portfolios, storytelling, or lightweight publishing.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22

    BioC

    We describe a simple XML format to share text documents and annotation

    A minimalist approach to share text documents and data annotations. Allows a large number of different annotations to be represented. Project files contain: - simple code to hold/read/write data and perform sample processing. - BioC-formatted corpora - BioC tools that work with BioC corpora BioC goals - simplicity - interoperability - broad use - reuse There should be little investment required to learn to use a format or a software module to process that format. ...
    Leader badge
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23

    PLP

    Powerfull pre-processor

    Powerful Verilog Preprocessor. PLP stands for Perl Pre-processor. Perl is used as "control language" that is embedded in the Verilog code (or any other code) to generate code on the fly. It is used commonly as a Verilog pre-processor but can be used with any target/output language (C, C++, Java, VHDL, plain text etc)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    MARF is a general cross-platform framework with a collection of algorithms for audio (voice, speech, and sound) and natural language text analysis and recognition along with sample applications (identification, NLP, etc.) of its use, implemented in Java.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25

    Filekit

    File Processing Tools - string search and DOS command execution

    Filekit is a set of file processing GUI tools that can be put into an USB flash, so that no installation is required. Aim of the project is to include many file processing tools such as file searching tool, batch rename file, find duplicate files, split and join files and so on ... Release 0.1.0 consist of the multi-threaded file search tool that allow you to select multiple folders and files, then search those files that contains particular strings using regular expression. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB