Showing 1000 open source projects for "batch text processing"

View related business solutions
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI Studio. Switch between models without switching platforms.
    Start Free
  • 1
    LaTeX Reference Card Creator

    LaTeX Reference Card Creator

    A Makefile based build system for creating LaTeX reference cards

    LaTeX Reference Card Creator is a Makefile based build system for creating reference cards. LaTeX Reference Card Creator compiles content into PDF, DjVu, TEX DVI, HTML and PostScript output formats. A three column reference card will be created. Features include batch image format conversions, spell checking, broken link checking, automatic backups and .zip and .tar.gz distribution building. LaTeX Reference Card Creator provides many LaTeX examples which can be used to make a reference card.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Anaphraseus
    Anaphraseus is a CAT (Computer Aided Translation) tool, OpenOffice.org 2-3 macro set for OpenOffice/LibreOffice Writer similar to famous Wordfast. It works with the Wordfast Translation Memory format (*.TXT), and supports text segmentation.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 3
    XSLT syntax highlighting

    XSLT syntax highlighting

    Java based XSLT Processor extension for syntax highlighting

    Please note that project moved to GitHub: https://github.com/xmlark/xslthl This is an implementation of syntax highlighting as an extension module for XSLT processors (Xalan, Saxon), so if you have e.g. article about programming written in DocBook, code examples can be automatically syntax highlighted during the XSLT processing phase.
    Leader badge
    Downloads: 110 This Week
    Last Update:
    See Project
  • 4
    Jed Modes Repository
    A collection of S-Lang extension scripts (modes) for the Jed text editor, contributed by Jed users. Browse the repository at http://jedmodes.sf.net/
    Leader badge
    Downloads: 1 This Week
    Last Update:
    See Project
  • Powerful App Monitoring Without Surprise Bills Icon
    Powerful App Monitoring Without Surprise Bills

    AppSignal starts at $23/month with all features included. No overages, no hidden fees. 30-day free trial.

    Tired of monitoring tools that punish you for scaling? AppSignal offers transparent, predictable pricing with every feature unlocked on every plan. Track errors, monitor performance, detect anomalies, and manage logs across Ruby, Python, Node.js, and more. Trusted by developers since 2012 with free dev-to-dev support. No credit card required to start your 30-day trial.
    Try AppSignal Free
  • 5
    bsed

    bsed

    Simple SQL-like syntax on top of Perl text processing

    bsed is a stream editor that offers a simple SQL-like syntax for text processing tasks. Designed to replace basic uses of tools like sed, grep, AWK, and Perl, it allows users to perform complex text manipulations with intuitive commands.​
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Pgn2ltx is a simple converter that reads chess games in PGN format and outputs them to a LaTeX input file.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    ChessTask is a frontend for easily creating chess tasks with LaTeX. The tasks are stored in a list which can be exported, either to a LaTeX input file, or to HTML files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Fen2eps is a small console program that converts FEN (Forsyth Edwards Notation) strings to EPS (Encapsulated Postscript) files containing the chess board diagram.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Café is an yagte (*) multipurpose, tabbed, scriptable, lightweight, fast, UTF-8 ready and full-featured text editor with IDE aspirations. Cafe' uses Perl as macro language. (*) yet another GTK+ text editor
    Downloads: 0 This Week
    Last Update:
    See Project
  • 99.99% Uptime for Your Most Critical Databases Icon
    99.99% Uptime for Your Most Critical Databases

    Sub-second maintenance. 2x read/write performance. Built-in vector search for AI apps.

    Cloud SQL Enterprise Plus delivers near-zero downtime with 35 days of point-in-time recovery. Supports MySQL, PostgreSQL, and SQL Server.
    Try Free
  • 10
    Web Book Downloader

    Web Book Downloader

    Download websites as e-book: pdf, txt, epub.

    This application allows user to download chapters from website in 3 ways: - from table of contents; - from range: first chapter address, last chapter address; - by crawling from first chapter to n; In settings you can customize language, input(website encoding) for simplicity output is in the same encoding. If you want your language add new class into strings package, and new fields into Settings class and GUI menu(initialize method).
    Downloads: 3 This Week
    Last Update:
    See Project
  • 11
    Common Litt

    Common Litt

    Simple java script library for auto literation, input tool.

    This project focuses of auto conversion in between language alphabets. Using this 'lit.js' library currently you can done conversion in between English - Tamil - Sinhala scrpits vise-versa. This is use full when you need to know how write something in an other given language. This is still at development stage but works purfectly and easy to customize as well. Live demo available at: http://commonlitt.42web.io/ For UI creations I had used Bootstrap and Jquery. For easy array...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12

    SmGen

    Verilog Finite State Machine (FSM) Code Generator

    SmGen is a finite state machine (FSM) generator for Verilog. On the other hand, it is not an FSM entry tool. The input is behavioral Verilog with clock boundaries specifically set by the designer. SmGen unrolls this behavioral code and generates an FSM from it in synthesizable Verilog. Clock boundaries are explicitly provided by the designer so there is good control on the expected timing
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Diffuse
    Diffuse is a graphical tool for comparing and merging text files. It can retrieve files for comparison from Bazaar, CVS, Darcs, Git, Mercurial, Monotone, RCS, Subversion, and SVK repositories.
    Leader badge
    Downloads: 95 This Week
    Last Update:
    See Project
  • 14

    Ghawwas_V4

    An open source system for Arabic corpora processing

    Ghawwas (previously known as Khawas) is an open source system for Arabic corpora processing. Ghawwas V4.0 provides the following main functions: a. Frequency list for single word and N-Grams b. Concordance c. Collocation (MI, CHI Squared, LL, T-Score, Z Score, Dice, Log Dice, Weirdness Coefficient) d. Lexical patterns search e. Two corpora frequency profile comparison based on MI, CHI, LL, T-Score, Z Score, Dice, Log Dice, Weirdness Coefficient f. Accept Windows and UTF-8 character...
    Downloads: 11 This Week
    Last Update:
    See Project
  • 15
    mbFXWords

    mbFXWords

    Analyze text. Diagonal read subject, predicate, obj. Search other pdf.

    Version 1.04. Applies and builds upon Apache OpenNLP. For English, French and German files. JavaFX Application, runs with Oracle Java Runtime Environment version 8 that is including JavaFX. NLP extensions: - Divide sentences in subclauses: segmentation. - Divide plain text: subject, predicate, object. - Count words: stemming. - Search for similar content: pdf's. Gives out subject, predicate and object of sentences of pdf and plain text files. Provides comfortable GUI. Automatic...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Vrapper

    Vrapper

    Vim-like editing in Eclipse

    Vrapper is an eclipse plugin which acts as a wrapper for existing eclipse text editors to provide a Vim-like input scheme for moving around and editing text. Eclipse Update Site: http://vrapper.sourceforge.net/update-site/stable
    Downloads: 12 This Week
    Last Update:
    See Project
  • 17
    Mavscript

    Mavscript

    Calculations in a text document

    Mavscript allows the user to do calculations in a text document. Plain text, LaTeX and OpenOffice Writer files (.odt) are supported. The calculation is done by the algebra system Yacas (default), Jasymca or by the Java interpreter BeanShell.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    iText®, a JAVA PDF library

    iText®, a JAVA PDF library

    PDF Library for Developers

    iText is an open-source PDF library available for Java and .NET (C#). iText allows you to effortlessly generate and manipulate standards-compliant PDF documents with a powerful and feature-rich SDK. With iText, you can create archivable and accessible PDFs, split and merge documents, fill and flatten forms, digitally sign documents, and more. iText add-ons enable additional functionality, such as PDF creation from HTML templates, secure redaction, OCR, and much more. The latest...
    Leader badge
    Downloads: 153 This Week
    Last Update:
    See Project
  • 19
    PDF-Shuffler
    PDF-Shuffler is a small python-gtk application, which helps the user to merge or split pdf documents and rotate, crop and rearrange their pages using an interactive and intuitive graphical interface. It is a frontend for python-pyPdf.
    Downloads: 55 This Week
    Last Update:
    See Project
  • 20
    TeXML is an XML vocabulary for TeX. The processor transforms the TeXML markup into the TeX markup, escaping special and out-of-encoding characters. The intended audience is developers who automatically generate [La]TeX or ConTeXt files.
    Leader badge
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    Command-line/Ant-task/embeddable text file preprocessor. Macros, flow control, expressions. Recursive directory processing. Extensible in Java to display data from any data sources (as database). Can generate complete homepages (tree of HTML-s, images, etc.)
    Leader badge
    Downloads: 99 This Week
    Last Update:
    See Project
  • 22
    Create beautiful song books for your church or fellowship using this LaTeX package and related tools.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 23
    JSesh is an ancient Egyptian hieroglyphic text processor, currently used by professionnals and amateurs alike. It runs on all platforms supporting java (Mac, Windows, Linux). It can be used as a library for your own softwares too. SOURCES HAVE MOVED TO GITHUB: https://github.com/rosmord/jsesh
    Downloads: 9 This Week
    Last Update:
    See Project
  • 24
    JUSH is a syntax highlighting component written in JavaScript. It highlights HTML, CSS, JS, PHP and SQL code embedded into each other. Beside syntax highlighting, it provides links to the documentation for all supported languages.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    cpDetector is a proxy for codepage detection of documents. It delegates to multiple instances that try to detect the codepage by different techinques. A command line executeable is shipped that allows to sort documents by codepage.
    Downloads: 6 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB