Open Source Linux Text Processing Software - Page 6

Text Processing Software for Linux

View 9 business solutions
  • Build Securely on AWS with Proven Frameworks Icon
    Build Securely on AWS with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 1
    Utility to create a text table from delimited text input.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 2
    tgen generates a Web site from a collection of input files of a variety of types, using a set of registered HTML autogenerators. Cvs-Brancher allows scheduling of web deployments. vwebedit provides web-based editing of cvs repositories.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 3
    xvnkb is a Vietnamese keyboard input program for X Window. It provides a useful way of editing Vietnamese in X Window environments with popular input methods and charsets. It runs as a small X application in the background without any interference with o
    Downloads: 3 This Week
    Last Update:
    See Project
  • 4

    DWDS/Dialing Concordance

    a collection of indexing and search tools for corpus linguists

    DWDS/Dialing Concordance (DDC) - a collection of index and search tools for corpus linguists
    Leader badge
    Downloads: 1 This Week
    Last Update:
    See Project
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 5
    Lout is a batch document formatter. It reads a high-level description of a document similar in style to LaTeX and produces a PostScript file which can be printed on most laser printers. Plain text and PDF output are also available.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    RTextDoc

    RTextDoc

    An editor for structured documents

    RTextDoc is an editor for structured text documents such as LaTeX, AsciiDoc, DocBook. RTextDoc has proofreading capabilities: on-the-fly spelling, instant grammar checking and built-in free dictionaries. RTextDoc has syntax highlighting, bracket matching, folding, document structure browser for sections and labels, bookmarks, manager for LaTeX symbols, an editor for mathematical equations,integrated BibTeX database manager and several tools to convert LaTeX to HTML and back. AsciiDoc files can be converted to DocBook, HTML and PDF files.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7

    MindRaider

    MindRaider is a personal notebook and outliner.

    MindRaider is a personal notebook and outliner. Where do you keep private remarks like ideas, plans, gift tips and howtos? Loads of documents and remarks spread around the file system? Can you find a remark when you need it? No? Try MindRaider!
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    AESTextCrypt

    AESTextCrypt

    Encrypt and decrypt text using AES 256 bit encryption

    AESTextCrypt is an easy-to-use open source tool for text encryption and decryption. Primarily intended for use with email, use it wherever you need to protect text from prying eyes. The encrypted text can be copy/pasted into any text-handling application (e.g. email) instead of plain text. Convenience buttons are provided for clipboard operations. AESTextCrypt uses AES-256 bit encryption which is the strongest available encryption scheme. It also employs bcrypt, which implements key-stretching and an adaptive key setup phase, the complexity (number of rounds) of which is automatically set to match the processing power of the encrypting computer. This makes it highly resistant to dictionary attack. AESTextCrypt is written in Java, so can be run on all desktop platforms - Windows, Mac and Linux.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    Babeldoc is integration tool that can plumb together data flows. It is completely configurable and scriptable. It is heavily XML biased but not exclusively so.
    Leader badge
    Downloads: 2 This Week
    Last Update:
    See Project
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 10

    Change File Encoding

    Change encoding of text files.

    Change File Encoding is a utility that allows you to change the encoding of text files. For example, files saved in US-ASCII can be converted to UTF-8. Over 170 encodings are supported. Requires Java 1.8 or higher.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 11
    DocBook XSLT stylesheets & utils to OpenDocument transformation (text/presentation/...) with support for Corporate Identity templates.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 12
    A small Objective-C library which provides a SAX-like object-oriented interface to the Expat XML parser library.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    General information, and a pack of tools for manipulating the Persian (Farsi) language and script, on different platforms and operating systems.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 14
    Mathematics formula renderer for Microsoft Word. Easy to use and really fast + Mathematics drawing toolbar for Microsoft Word + Math exercices storage Database for Microsoft Access. Available in French and English!!!
    Downloads: 2 This Week
    Last Update:
    See Project
  • 15
    The FreeDOS Edlin project is the standard line editor in the FreeDOS operating system.
    Leader badge
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16
    GitSync is a shell script designed to simplify the usage of the version control system GIT (see www.git-scm.com for more information) by providing a "do everything to sync my repository" command.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    This project has moved to https://github.com/workinghard/GuitarTeX2 GuitarTeX2 is based on the idea of Chord. It takes a Chord file containing Chordpro directives to produce good-looking and easy-to-play song sheets for guitarists in PostScript or PDF format. GuitarTeX2 is a further development of GuitarTeX.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 18
    Downloadable and open source Chinese-Spanish vocabulary inspired by the CEDICT and EDICT dictionaries. It is distributed in a plain Unicode text file that can be easily ported to other formats or used by different applications.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 19
    J2ME Memopad is a simple MIDP application designed to allow storage and retrieval of notes. It will have the ability to search and generate a list of results, as well as categorize your memos. The basic design of the memopad is similar to the Palm.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 20

    Khawas

    An Arabic Corpora Processing Tool

    The new version is available at https://sourceforge.net/projects/ghawwasv4/
    Downloads: 2 This Week
    Last Update:
    See Project
  • 21
    A collection of tools and resources for using bangla on computers
    Downloads: 2 This Week
    Last Update:
    See Project
  • 22
    A run-time configurable character stream tokenizer that allows the user to define token classes via regular expressions. The developer is not limited to predefined notions of whitespace, commenting, or word modalities.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 23
    A XHTML to PDF converter: with this library, you can transform simple XHTML pages to nice and printable PDF files. This project is based on the excellent webzine article "Pdfizer, a dumb HTML to PDF converter, in C#" written by Jonathan de Halleux.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24
    A universal suite of utilities for large corpora processing.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25
    Queequeg is an English grammar checker for non-native English speakers.
    Downloads: 2 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB