Showing 54 open source projects for "batch text processing"

View related business solutions
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI Studio. Switch between models without switching platforms.
    Start Free
  • 1
    Memobase

    Memobase

    Fast backend for long-term AI user memory via structured profiles

    ...Instead of relying purely on traditional embedding-based retrieval or RAG systems, Memobase uses profile and timeline structures to deliver memory that reflects user context efficiently and meaningfully. The system focuses on three principal performance metrics: high search performance, reduced large language model (LLM) costs through batch processing techniques, and low latency with minimal SQL operations. Memobase supports integration with existing LLM workflows via APIs and SDKs (including Python, Node, and Go), making it easy to adopt within diverse application stacks.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2

    UniversalTextExtractor

    Command-line toolset for extracting text from files

    Command-line toolset for extracting text from files (documents, images, archives) into SQLite with OCR support. Simple, expandable, one shell script only.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    RefDB is a reference database and bibliography tool for SGML, XML, and LaTeX documents, sort of a Reference Manager or BibTeX for markup languages. It is portable and known to run on Linux, Free/NetBSD, OSX, Solaris, and Windows/Cygwin.
    Downloads: 14 This Week
    Last Update:
    See Project
  • 4
    DocScript is an approach to document preparation. It presents tools and utilities to edit and publish documents. The philosophy behind the DocScript project is to utilize the programming tools you're working with anyway in your daily work.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Catch Bugs Before Your Customers Do Icon
    Catch Bugs Before Your Customers Do

    Real-time error alerts, performance insights, and anomaly detection across your full stack. Free 30-day trial.

    Move from alert to fix before users notice. AppSignal monitors errors, performance bottlenecks, host health, and uptime—all from one dashboard. Instant notifications on deployments, anomaly triggers for memory spikes or error surges, and seamless log management. Works out of the box with Rails, Django, Express, Phoenix, Next.js, and dozens more. Starts at $23/month with no hidden fees.
    Try AppSignal Free
  • 5
    Script Echo Color

    Script Echo Color

    Terminal text colorizing and simplifies script coding.

    ScriptEchoColor simplifies Linux terminal text colorizing, formatting and several steps of script coding.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Snowmix

    Snowmix

    Video mixer for mixing live and recorded video and audio feeds

    New version 0.5.2.1 Released December 29th 2025. Snowmix is a Swiss army knife tool for mixing live and recorded video and audio feeds. It supports 2D and 3D clipping, scaling and transparent overlay of video, png graphics and text. It supports animation of video, images and texts through native commands changing scale, placement, transparency and rotation. Animation and actions can also be controlled through native scripting and an embedded Tcl and/or Python interpreter. Snowmix is...
    Leader badge
    Downloads: 32 This Week
    Last Update:
    See Project
  • 7

    Esegui SB

    Flexible video encoding script supporting multiple formats and codecs.

    Video Encoding and Processing Script This bash script streamlines video encoding and processing by leveraging FFmpeg. It supports a wide range of formats and codecs through FFmpeg, including MKV, MP4, x264, x265, AV1 and VP9. Key features include: Flexible Video and Audio Encoding: Utilizes FFmpeg's libraries for various codecs and settings. Subtitle Handling: Include or exclude subtitles as needed. Customizable Settings: Adjust CRF values and aspect ratios to suit your...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    AI File Sorter

    AI File Sorter

    Local AI file organization with categorization and rename suggestions

    AI File Sorter is a cross-platform desktop application that uses AI to organize files and suggest meaningful file names based on real content, not just filenames or extensions. The app can analyze image files locally and propose human-readable rename suggestions (for example, IMG_2048.jpg → clouds_over_lake.jpg). It can also analyze the text content of documents to improve categorization and renaming. Supported formats include PDF, DOCX, XLSX, PPTX, ODT, ODS, ODP, and common text files....
    Downloads: 261 This Week
    Last Update:
    See Project
  • 9
    BmuS

    BmuS

    BmuS - Powerful linux backup program deduplication, encryption & more

    BmuS is a powerful free backup program for the automated backup of files, directories, and MySQL databases from a Linux / Raspberry Pi system to a NAS or network drive. You can also sync two NAS (i.e. Synology to Synology or UGREEN etc. and vice versa). BmuS can be installed directly on the source system or via a Docker repository, which is also available. With Docker, you can easily use BmuS on different operating systems, such as a Mac or Windows. Visit the Quick Start Guide and FAQ on...
    Downloads: 5 This Week
    Last Update:
    See Project
  • Push Code. Get a Production URL. Done. Icon
    Push Code. Get a Production URL. Done.

    Cloud Run deploys any language instantly. Scales to zero. Pay only when code runs.

    Skip the Kubernetes configs. Cloud Run handles HTTPS, scaling, and infrastructure automatically. Two million requests free per month.
    Try Cloud Run Free
  • 10
    LPub3D

    LPub3D

    LDraw™ editor for LEGO® style digital building instructions.

    LPub3D is an Open Source WYSIWYG editing application for creating LEGO® style digital building instructions. LPub3D is developed and maintained by Trevor SANDY. It uses the LDraw™ parts library, the most comprehensive library of digital Open Source LEGO® bricks available (www.ldraw.org/ ) and reads the LDraw LDR and MPD model file formats. LPub3D is available for free under the GNU Public License v3 and runs on Windows, Linux and macOS Operating Systems. LPub3D is also...
    Leader badge
    Downloads: 34 This Week
    Last Update:
    See Project
  • 11

    xsd2pgschema

    Relational database replication tool based on XML Schema

    xsd2pgschema is a Java application suite, which converts XML Schema 1.1 (hierarchical data model) to PostgreSQL DDL (relational data model) and supports XML data migration into PostgreSQL based on the XML Schema without defects on information content. It also supports full-text indexing via either Apache Lucene or Sphinx Search utilizing the relational data model. File conversion from XML to CSV, TSV, or JSON is possible as well as mapping XML Schema to JSON Schema. Obtained PostgreSQL...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 12
    EBookGenTools

    EBookGenTools

    EBook Generation Tools - scripts to create ebook formats EPUB, DOC

    EBookGenTools is a set of GNU/Linux shell scripts to process plain text for a book into HTML and electronic book formats. It was developed to create EPUB and DOC files from book text exported from novel writing software such as Manuskript, StoryBook, or your favourite text editor. EBookGenTools builds on the power of other software to create the following ebook formats: - EPUB: Calibre - ebook management - DOC: LibreOffice - free office suite These tools can be used directly to...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    Terminals Are Sexy

    Terminals Are Sexy

    A curated list of Terminal frameworks, plugins & resources

    ...The list favors practical, well-maintained projects and is organized so you can quickly discover alternatives for common tasks (navigation, search, networking, monitoring, text processing, and more). It also points to customization ideas—prompts, themes, and fonts—that help you craft a clear, legible, and pleasant environment. Because it’s a living list, contributions are encouraged, keeping entries current as tools evolve. If you spend time in a terminal—whether writing code, administering systems, or just automating chores—this is a single, convenient place to find better defaults and new favorites.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    Pure Bash Bible

    Pure Bash Bible

    A collection of pure bash alternatives to external processes

    ...Its goal is to reduce reliance on external tools like sed, awk, or grep, which can slow down scripts and add unnecessary dependencies. The project is organized as a reference book of function-based code snippets, each showcasing practical solutions for string manipulation, text processing, file operations, and more. By relying exclusively on Bash built-ins, these methods can make scripts faster, more portable, and easier to maintain. Every snippet is linted with shellcheck, and many have accompanying tests, ensuring correctness and usability. The project is not only a coding resource but also an educational tool for Bash users who want to better understand the full power of the language.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 15
    LaTeX Reference Card Creator

    LaTeX Reference Card Creator

    A Makefile based build system for creating LaTeX reference cards

    LaTeX Reference Card Creator is a Makefile based build system for creating reference cards. LaTeX Reference Card Creator compiles content into PDF, DjVu, TEX DVI, HTML and PostScript output formats. A three column reference card will be created. Features include batch image format conversions, spell checking, broken link checking, automatic backups and .zip and .tar.gz distribution building. LaTeX Reference Card Creator provides many LaTeX examples which can be used to make a reference card.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Expose

    Expose

    A simple static site generator for photoessays

    ...Implemented as a Bash script, it converts directories of media files into cleanly structured static websites with built-in themes. By default, it includes both a blog-style layout and a Medium-inspired theme, but users can also build their own templates. Expose reads associated text files, YAML metadata, and folder structures to automatically generate navigation menus, captions, and styling for each gallery. It supports image and video customization through ImageMagick and FFmpeg, enabling batch effects, filters, watermarks, and even video stabilization. With minimal setup, users can transform raw media collections into polished photoessays suitable for personal portfolios, storytelling, or lightweight publishing.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    Texinfo Web Publisher

    Texinfo Web Publisher

    Multi-format web publishing system based on Texinfo

    Texinfo Web Publisher is a Makefile based publishing system featuring simultaneous con- tent creation into HTML, non-split HTML, Framed HTML, HTML Zip, XML, DocBook, PDF, DjVu, PostScript, DVI, Plain text, Info and EPUB book formats. All Texinfo Web Publisher output formats are from a single source. Texinfo Web Publisher can be used for website creation has FTP deployment capabilities and supports Cascading Style Sheets (CSS). Texinfo Web Publisher is a low maintenance solution for...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18

    Yet Another xdelta-based Patch Creator

    A tool to help you create xdelta-based patches quickly and easily

    YAXBPC is a tool designed to assist the users of xdelta3 to create patches quickly and easily. It's designed by a fagsubber for fagsubbing, but generic usuage also is fine. Requirements: .Net Framework 2.0 (Windows), or Mono (Linux/Mac). xdelta3 package is also required if you want to run YAXBPC in Linux/Mac. YAXBPC is licensed under Apache License Version 2.0. All wrongs are reserved to me; you can use the right parts. Latest dev source code can be accessed here:...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    LaTeX Web Publisher

    LaTeX Web Publisher

    LaTeX Web Publisher is a Makefile based Web publishing system

    LaTeX Web Publisher is a Makefile based Web publishing system featuring content creation into HTML, non-split HTML, HTML Zip, PDF, DjVu, PostScript, DVI and Plain text formats. All LaTeX Web Publisher output formats are from a single LaTeX source and have indices. LaTeX Web Publisher can be used for website creation and has FTP deployment capabilities. A website created with LaTeX Web Publisher will have HTML, non-split HTML and PDF content formats. The website will have complete HTML...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    MARF is a general cross-platform framework with a collection of algorithms for audio (voice, speech, and sound) and natural language text analysis and recognition along with sample applications (identification, NLP, etc.) of its use, implemented in Java.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    TEA is a text editor that provides a wide range of text-processing functions (over 100) and the syntax highlighting. There are two branches of TEA: Qt-based and GTK-based.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 22

    unpack with password list

    Helper script to unpack archive files using a password list

    Simple shell script that unpacks every archive in a folder using a password list. You don't need to call the right packer and all the passwords in the list will be tested.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    docx2txt

    docx2txt

    Perl based utility to extract formatted text content from MS Docx file

    Docx2txt is a Perl based command-line utility to convert (even corrupted) Microsoft docx documents to reasonably formatted text files, along with appropriate character conversions. Apart from Perl it also requires a command line unzipping program like unzip/7z/pkzipc/wzunzip.
    Leader badge
    Downloads: 75 This Week
    Last Update:
    See Project
  • 24
    Antiexcel is a Microsoft-Excel file reader. It shows the excel file on a text console or an x-console. It can be used as an attachment viewer for any text-based email client ( ex. mutt)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    DPRK pull is a script that pulls the English language North Korean news articles from the KCNA website and puts them into one file for reading by a Text to Speech program.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next
MongoDB Logo MongoDB