Open Source Linux Text Processing Software - Page 2

Text Processing Software for Linux

View 9 business solutions
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • Fully Managed MySQL, PostgreSQL, and SQL Server Icon
    Fully Managed MySQL, PostgreSQL, and SQL Server

    Automatic backups, patching, replication, and failover. Focus on your app, not your database.

    Cloud SQL handles your database ops end to end, so you can focus on your app.
    Try Free
  • 1
    AutoText

    AutoText

    Text expander(auto-type) application

    AutoText is a text expander(auto-typer) application that can minimize time that you spent typing phrases that your often use. You can bind an abbreviation(autotext) to some text snippet, phrase or sentence and that text will be automatically typed by AutoText each time you type the abbreviation. Keyboard keys press/hold/toggle is supported by using macros. AutoText works in any Windows program.
    Leader badge
    Downloads: 53 This Week
    Last Update:
    See Project
  • 2
    The goal of this tool is to simplify and accelerate the process of creating bookmarks for DjVu and PDF documents. You can see additional information on the project page.
    Leader badge
    Downloads: 27 This Week
    Last Update:
    See Project
  • 3
    OOoFBTools

    OOoFBTools

    Open/Libre Office extension for converting eBooks in fb2 format

    Open/Libre Office extension for converting and processing eBooks in FictionBook2 format with validator. Apache OpenOffice Extensions page: http://extensions.openoffice.org/en/project/ooo-fbtools Libre Office Extensions page: http://extensions.libreoffice.org/extension-center/fbtools Внимание! Приглашаются разработчики! dikbsd последние годы тяжело тянуть даже текущее сопровождение. И есть интерес передать проект в хорошие руки, не доводя до кризиса, а с передачей опыта.
    Downloads: 27 This Week
    Last Update:
    See Project
  • 4
    Stanford CoreNLP

    Stanford CoreNLP

    Stanford CoreNLP, a Java suite of core NLP tools

    CoreNLP is your one stop shop for natural language processing in Java! CoreNLP enables users to derive linguistic annotations for text, including token and sentence boundaries, parts of speech, named entities, numeric and time values, dependency and constituency parses, coreference, sentiment, quote attributions, and relations. CoreNLP currently supports 6 languages, Arabic, Chinese, English, French, German, and Spanish. The centerpiece of CoreNLP is the pipeline. Pipelines take in raw text, run a series of NLP annotators on the text, and produce a final set of annotations. Pipelines produce CoreDocuments, data objects that contain all of the annotation information, accessible with a simple API, and serializable to a Google Protocol Buffer. CoreNLP generates a variety of linguistic annotations, including parts of speech, named entities, dependency parses, and coreference.
    Downloads: 4 This Week
    Last Update:
    See Project
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 5
    Pdftohtml is a tool based on the Xpdf package which translates pdf documents into html format.
    Downloads: 20 This Week
    Last Update:
    See Project
  • 6
    FCKeditor

    FCKeditor

    FCKeditor (retired)

    FCKeditor is the previous version of CKEditor and has been discontinued after version 2. The new CKEditor is redesigned from the ground up, offering more WYSIWYG text editing features, enhanced security and better integration. Don’t force yourself with retro FCKeditor. Switch to the new, cool CKEditor at ckeditor.com
    Downloads: 15 This Week
    Last Update:
    See Project
  • 7
    FAR - Find And Replace
    Search and replace operations on file content accross multiple files. Recursive operations within entire directory trees. FAR comes with support for regular expressions (regex) over multiple lines, automatic backup and various character encodings. Run grep like extractions to condense or rearrange sources, or perform bulk file renaming.
    Downloads: 19 This Week
    Last Update:
    See Project
  • 8
    HarfBuzz

    HarfBuzz

    Open source text shaping engine

    HarfBuzz is an open source text-shaping engine with a C API that turns fonts and strings of character codes into a form that is correctly arranged for the corresponding language and writing system. This is essentially the process of text shaping: translating a string of character codes into a properly arranged sequence of glyphs that can be rendered onto a screen or into final output form for inclusion in a document. This shaping depends on a number of factors: the input string, the active font, the script (or writing system) of the string, and the string's language. Various font formats have their own set of standard text-shaping rules. With Harfbuzz, you can properly shape all the major writing systems. HarfBuzz is cross-platform and supports all major software platforms and font formats.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 9
    JSesh is an ancient Egyptian hieroglyphic text processor, currently used by professionnals and amateurs alike. It runs on all platforms supporting java (Mac, Windows, Linux). It can be used as a library for your own softwares too. SOURCES HAVE MOVED TO GITHUB: https://github.com/rosmord/jsesh
    Downloads: 18 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    Ada Class Library

    Ada Class Library

    Ada Class Library - an object orientated library for Ada.

    Text search and replace. Scripting (small tool programs). CGI scripts. Execution of external programs (incl. I/O redirection). Garbage Collection. Extendended Booch Components. CD-Recorder
    Leader badge
    Downloads: 71 This Week
    Last Update:
    See Project
  • 11
    Unihanconver

    Unihanconver

    Traditional/Simplified Chinese conversion with CLI or GUI

    Tool to convert between Traditional/Simplified Chinese directly in Unicode (not GB/Big5 conversion). It is written in Perl and does not use any external libraries. It provides a command-line utility as well as a GTK+ interface for X Window.
    Leader badge
    Downloads: 21 This Week
    Last Update:
    See Project
  • 12
    Anaphraseus
    Anaphraseus is a CAT (Computer Aided Translation) tool, OpenOffice.org 2-3 macro set for OpenOffice/LibreOffice Writer similar to famous Wordfast. It works with the Wordfast Translation Memory format (*.TXT), and supports text segmentation.
    Downloads: 13 This Week
    Last Update:
    See Project
  • 13
    Madedit-Mod

    Madedit-Mod

    MadEdit-Mod is a cross platform Text/Hex editor based on MadEdit

    Madedit-Mod is a cross platform text/hex editor base on MadEdit with a log of critical bug fix from me or other developers. A lot of new features were added, such as Drag-Drop Edit(cross platform), Highlight word, etc. The reason that I maintained this project is that the author of MadEdit had not worked on it for for a long time and I really like it and need more features. Find more information on Wiki pages. Currently supported Languages: English Chinese Simplified (zh_CN) Chinese Traditional(zh_TW) German(de_DE) (Not done) Greek(el) Italian(it_IT) (Not done) Japanese(ja_JP) Polish(pl_PL) (Not done) Russian(ru_RU) (Not done) Spanish(es) (Not done) You may help to add more translation for MadEdit-Mod by Poedit https://poedit.net
    Leader badge
    Downloads: 30 This Week
    Last Update:
    See Project
  • 14
    An Objective-C Framework for Regular Expressions using the PCRE Library for Mac OS X Cocoa and GNUstep.
    Downloads: 15 This Week
    Last Update:
    See Project
  • 15

    Bulgarian language support

    Spell check, grammar check and hyphenation for Bulgarian language

    The goal of this project is to provide spell check, grammar check and hyphenation for Bulgarian language for Open Source products such as OpenOffice.org, LibreOffice, TeX, aspell, ispell, hunspell etc.
    Leader badge
    Downloads: 46 This Week
    Last Update:
    See Project
  • 16
    Jericho HTML Parser is a java library allowing analysis and manipulation of parts of an HTML document, including server-side tags, while reproducing verbatim any unrecognised or invalid HTML.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 17
    Vrapper

    Vrapper

    Vim-like editing in Eclipse

    Vrapper is an eclipse plugin which acts as a wrapper for existing eclipse text editors to provide a Vim-like input scheme for moving around and editing text. Eclipse Update Site: http://vrapper.sourceforge.net/update-site/stable
    Downloads: 7 This Week
    Last Update:
    See Project
  • 18
    PDFBox is a Java PDF Library. This project will allow access to all of the components in a PDF document. More PDF manipulation features will be added as the project matures. This ships with a utility to take a PDF document and output a text file.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 19
    LaTeX-Mk is a collection of makefile fragments for managing small to large LaTeX based documentation projects. The idea is that especially large documents, there may be many many steps required to typeset the document (export modified figures to postscr
    Leader badge
    Downloads: 24 This Week
    Last Update:
    See Project
  • 20
    ChordSmith

    ChordSmith

    Chordpro editor that can display, transpose and print song sheets.

    ChordSmith is a chordpro editor that can display, transpose and print song sheets containing chords and lyrics. It can also edit and convert song sheet formats (including Harmonica tabs) between chordpro format (chords in square brackets in line with lyrics) and two-line format (chords above lyrics). You can find many free sources of song sheets in both formats on the Internet. More information at https://chordsmith.sourceforge.io/ Here are just a few of ChordSmith's many features:
    Downloads: 41 This Week
    Last Update:
    See Project
  • 21
    xmltoman and xmlmantohtml are two small scripts to convert xml to man pages in groff format or html. It features the usual man page items such a "description", "options", "see also" etc.
    Leader badge
    Downloads: 40 This Week
    Last Update:
    See Project
  • 22
    Vim provides a rich set of tools which makes generating latex easy, pain-free and quite pleasurable. This web-site aims at bringing together the rich set of tools the vim community has produced over the years into a central repository
    Downloads: 8 This Week
    Last Update:
    See Project
  • 23
    pgfplots - A TeX package to draw normal and/or logarithmic plots directly in TeX in two and three dimensions with a user-friendly interface and pgfplotstable - a TeX package to round and format numerical tables. Examples in manuals and/or on web site. ATTENTION: As of February 2020, the development has been moved to https://github.com/pgf-tikz/pgfplots ! Please refer to the new page for downloads and change requests! This page is kept in read-only mode.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 24
    This program makes PNG and/or GIF graphics from DVI files. It produces high-quality images while its internals are tuned for speed. It supports PK, VF, PostScript and TrueType fonts, color, PostScript inclusion, bitmap inclusion, and has CJK support
    Leader badge
    Downloads: 42 This Week
    Last Update:
    See Project
  • 25
    Calc2LaTeX is an OpenOffice.org Calc (Spreadsheet) macro for converting tables. It makes making tables on LaTeX very easy.
    Leader badge
    Downloads: 8 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB