Open Source Linux Text Processing Software - Page 2

Text Processing Software for Linux

View 9 business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Context for your AI agents Icon
    Context for your AI agents

    Crawl websites, sync to vector databases, and power RAG applications. Pre-built integrations for LLM pipelines and AI assistants.

    Build data pipelines that feed your AI models and agents without managing infrastructure. Crawl any website, transform content, and push directly to your preferred vector store. Use 10,000+ tools for RAG applications, AI assistants, and real-time knowledge bases. Monitor site changes, trigger workflows on new data, and keep your AIs fed with fresh, structured information. Cloud-native, API-first, and free to start until you need to scale.
    Try for free
  • 1
    FAR - Find And Replace
    Search and replace operations on file content accross multiple files. Recursive operations within entire directory trees. FAR comes with support for regular expressions (regex) over multiple lines, automatic backup and various character encodings. Run grep like extractions to condense or rearrange sources, or perform bulk file renaming.
    Downloads: 37 This Week
    Last Update:
    See Project
  • 2
    Ada Class Library

    Ada Class Library

    Ada Class Library - an object orientated library for Ada.

    Text search and replace. Scripting (small tool programs). CGI scripts. Execution of external programs (incl. I/O redirection). Garbage Collection. Extendended Booch Components. CD-Recorder
    Leader badge
    Downloads: 165 This Week
    Last Update:
    See Project
  • 3
    AutoText

    AutoText

    Text expander(auto-type) application

    AutoText is a text expander(auto-typer) application that can minimize time that you spent typing phrases that your often use. You can bind an abbreviation(autotext) to some text snippet, phrase or sentence and that text will be automatically typed by AutoText each time you type the abbreviation. Keyboard keys press/hold/toggle is supported by using macros. AutoText works in any Windows program.
    Leader badge
    Downloads: 51 This Week
    Last Update:
    See Project
  • 4
    OOoFBTools

    OOoFBTools

    Open/Libre Office extension for converting eBooks in fb2 format

    Open/Libre Office extension for converting and processing eBooks in FictionBook2 format with validator. Apache OpenOffice Extensions page: http://extensions.openoffice.org/en/project/ooo-fbtools Libre Office Extensions page: http://extensions.libreoffice.org/extension-center/fbtools Внимание! Приглашаются разработчики! dikbsd последние годы тяжело тянуть даже текущее сопровождение. И есть интерес передать проект в хорошие руки, не доводя до кризиса, а с передачей опыта.
    Downloads: 34 This Week
    Last Update:
    See Project
  • AI-First Supply Chain Management Icon
    AI-First Supply Chain Management

    Supply chain managers, executives, and businesses seeking AI-powered solutions to optimize planning, operations, and decision-making across the supply

    Logility is a market-leading provider of AI-first supply chain management solutions engineered to help organizations build sustainable digital supply chains that improve people’s lives and the world we live in. The company’s approach is designed to reimagine supply chain planning by shifting away from traditional “what happened” processes to an AI-driven strategy that combines the power of humans and machines to predict and be ready for what’s coming. Logility’s fully integrated, end-to-end platform helps clients know faster, turn uncertainty into opportunity, and transform the supply chain from a cost center to an engine for growth.
    Learn More
  • 5
    The goal of this tool is to simplify and accelerate the process of creating bookmarks for DjVu and PDF documents. You can see additional information on the project page.
    Leader badge
    Downloads: 27 This Week
    Last Update:
    See Project
  • 6
    The DITA Open Toolkit is an implementation of the OASIS DITA XML Specification. The Toolkit transforms DITA content into many deliverable formats. See https://www.dita-ot.org/ for documentation and links to downloads. The source code and issue trackers have been moved to https://github.com/dita-ot/dita-ot
    Downloads: 24 This Week
    Last Update:
    See Project
  • 7
    Vim provides a rich set of tools which makes generating latex easy, pain-free and quite pleasurable. This web-site aims at bringing together the rich set of tools the vim community has produced over the years into a central repository
    Downloads: 25 This Week
    Last Update:
    See Project
  • 8
    Pdftohtml is a tool based on the Xpdf package which translates pdf documents into html format.
    Downloads: 23 This Week
    Last Update:
    See Project
  • 9
    FCKeditor

    FCKeditor

    FCKeditor (retired)

    FCKeditor is the previous version of CKEditor and has been discontinued after version 2. The new CKEditor is redesigned from the ground up, offering more WYSIWYG text editing features, enhanced security and better integration. Don’t force yourself with retro FCKeditor. Switch to the new, cool CKEditor at ckeditor.com
    Downloads: 16 This Week
    Last Update:
    See Project
  • HOA Software Icon
    HOA Software

    Smarter Community Management Starts Here

    Simplify HOA management with software that handles everything from financials to communication.
    Learn More
  • 10
    HarfBuzz

    HarfBuzz

    Open source text shaping engine

    HarfBuzz is an open source text-shaping engine with a C API that turns fonts and strings of character codes into a form that is correctly arranged for the corresponding language and writing system. This is essentially the process of text shaping: translating a string of character codes into a properly arranged sequence of glyphs that can be rendered onto a screen or into final output form for inclusion in a document. This shaping depends on a number of factors: the input string, the active font, the script (or writing system) of the string, and the string's language. Various font formats have their own set of standard text-shaping rules. With Harfbuzz, you can properly shape all the major writing systems. HarfBuzz is cross-platform and supports all major software platforms and font formats.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 11
    This program makes PNG and/or GIF graphics from DVI files. It produces high-quality images while its internals are tuned for speed. It supports PK, VF, PostScript and TrueType fonts, color, PostScript inclusion, bitmap inclusion, and has CJK support
    Leader badge
    Downloads: 93 This Week
    Last Update:
    See Project
  • 12
    GATE
    NOTE THAT THE SOURCE CODE AND ISSUE TRACKER HAVE NOW MOVED TO GITHUB. FIND US AT https://github.com/GateNLP/ GATE (General Architecture for Text Engineering) is an architecture, framework and development environment for developing, evaluating and embedding Human Language Technology. See http://gate.ac.uk for full details.
    Downloads: 14 This Week
    Last Update:
    See Project
  • 13
    ChordSmith

    ChordSmith

    Chordpro editor that can display, transpose and print song sheets.

    ChordSmith is a chordpro editor that can display, transpose and print song sheets containing chords and lyrics. It can also edit and convert song sheet formats (including Harmonica tabs) between chordpro format (chords in square brackets in line with lyrics) and two-line format (chords above lyrics). You can find many free sources of song sheets in both formats on the Internet. More information at https://chordsmith.sourceforge.io/ Here are just a few of ChordSmith's many features:
    Downloads: 67 This Week
    Last Update:
    See Project
  • 14
    Madedit-Mod

    Madedit-Mod

    MadEdit-Mod is a cross platform Text/Hex editor based on MadEdit

    Madedit-Mod is a cross platform text/hex editor base on MadEdit with a log of critical bug fix from me or other developers. A lot of new features were added, such as Drag-Drop Edit(cross platform), Highlight word, etc. The reason that I maintained this project is that the author of MadEdit had not worked on it for for a long time and I really like it and need more features. Find more information on Wiki pages. Currently supported Languages: English Chinese Simplified (zh_CN) Chinese Traditional(zh_TW) German(de_DE) (Not done) Greek(el) Italian(it_IT) (Not done) Japanese(ja_JP) Polish(pl_PL) (Not done) Russian(ru_RU) (Not done) Spanish(es) (Not done) You may help to add more translation for MadEdit-Mod by Poedit https://poedit.net
    Leader badge
    Downloads: 31 This Week
    Last Update:
    See Project
  • 15
    tika-python

    tika-python

    Python binding to the Apache Tika™ REST services

    A Python port of the Apache Tika library that makes Tika available using the Tika REST Server. This makes Apache Tika available as a Python library, installable via Setuptools, Pip and easy to install. To use this library, you need to have Java 7+ installed on your system as tika-python starts up the Tika REST server in the background. To get this working in a disconnected environment, download a tika server file (both tika-server.jar and tika-server.jar.md5, which can be found here) and set the TIKA_SERVER_JAR environment variable to TIKA_SERVER_JAR="file:////tika-server.jar" which successfully tells python-tika to "download" this file and move it to /tmp/tika-server.jar and run as a background process. This is the only way to run python-tika without internet access. Without this set, the default is to check the tika version and pull latest every time from Apache.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16
    JSesh is an ancient Egyptian hieroglyphic text processor, currently used by professionnals and amateurs alike. It runs on all platforms supporting java (Mac, Windows, Linux). It can be used as a library for your own softwares too. SOURCES HAVE MOVED TO GITHUB: https://github.com/rosmord/jsesh
    Downloads: 13 This Week
    Last Update:
    See Project
  • 17
    Vrapper

    Vrapper

    Vim-like editing in Eclipse

    Vrapper is an eclipse plugin which acts as a wrapper for existing eclipse text editors to provide a Vim-like input scheme for moving around and editing text. Eclipse Update Site: http://vrapper.sourceforge.net/update-site/stable
    Downloads: 8 This Week
    Last Update:
    See Project
  • 18
    pgfplots - A TeX package to draw normal and/or logarithmic plots directly in TeX in two and three dimensions with a user-friendly interface and pgfplotstable - a TeX package to round and format numerical tables. Examples in manuals and/or on web site. ATTENTION: As of February 2020, the development has been moved to https://github.com/pgf-tikz/pgfplots ! Please refer to the new page for downloads and change requests! This page is kept in read-only mode.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 19
    Jericho HTML Parser is a java library allowing analysis and manipulation of parts of an HTML document, including server-side tags, while reproducing verbatim any unrecognised or invalid HTML.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 20
    Regular Expression Editor (RegExpEditor)

    Regular Expression Editor (RegExpEditor)

    regex as a tool, not as a problem

    Regular Expressions (aka regex, regexp) made easy. This simple tool manipulates text with regular expressions. Highlighting of regular expression results. See the real power of regex! Use Scala to do manipulate your search results even more.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 21
    LaTeX-Mk is a collection of makefile fragments for managing small to large LaTeX based documentation projects. The idea is that especially large documents, there may be many many steps required to typeset the document (export modified figures to postscr
    Leader badge
    Downloads: 21 This Week
    Last Update:
    See Project
  • 22
    Leader badge
    Downloads: 37 This Week
    Last Update:
    See Project
  • 23
    Colorer Library
    Colorer provides source text syntax highlighting services. It colorizes source codes in editor systems (more than 200 syntaxes). Uses powerful HRC format(XML, RE, context free grammas), allowing to support any language. Available as Eclipse plugin.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 24
    xmltoman and xmlmantohtml are two small scripts to convert xml to man pages in groff format or html. It features the usual man page items such a "description", "options", "see also" etc.
    Leader badge
    Downloads: 32 This Week
    Last Update:
    See Project
  • 25
    ANTLR

    ANTLR

    Parser generator to read, process, or translate structured text

    ANTLR (ANother Tool for Language Recognition) is a powerful parser generator for reading, processing, executing, or translating structured text or binary files. It's widely used to build languages, tools, and frameworks. From a grammar, ANTLR generates a parser that can build and walk parse trees. It’s widely used in academia and industry to build all sorts of languages, tools, and frameworks. Twitter search uses ANTLR for query parsing, with over 2 billion queries a day. The languages for Hive and Pig, the data warehouse and analysis systems for Hadoop, both use ANTLR. Lex Machina uses ANTLR for information extraction from legal texts. Oracle uses ANTLR within SQL Developer IDE and their migration tools. NetBeans IDE parses C++ with ANTLR. The HQL language in the Hibernate object-relational mapping framework is built with ANTLR.
    Downloads: 1 This Week
    Last Update:
    See Project