Open Source Linux Text Processing Software - Page 7

Text Processing Software for Linux

View 9 business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Cloud tools for web scraping and data extraction Icon
    Cloud tools for web scraping and data extraction

    Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.

    Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
    Explore 10,000+ tools
  • 1
    HTML template library written in C inspired by perl HTML::Template. Template language has HTML-like tags (tmpl_var, tmpl_if, tmpl_loop, etc.) Use library to build a variable list and pass it to a template.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2

    DWDS/Dialing Concordance

    a collection of indexing and search tools for corpus linguists

    DWDS/Dialing Concordance (DDC) - a collection of index and search tools for corpus linguists
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    WYSIWYG .NET

    WYSIWYG .NET

    WYSIWYG html editor for .NET (C#, VB.NET)

    WYSIWYG .NET editor is an HTML editor that attempt to display the web page as it will show on the browser. It's a visual editor, and you don’t manipulate the code directly.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    EBookGenTools

    EBookGenTools

    EBook Generation Tools - scripts to create ebook formats EPUB, DOC

    EBookGenTools is a set of GNU/Linux shell scripts to process plain text for a book into HTML and electronic book formats. It was developed to create EPUB and DOC files from book text exported from novel writing software such as Manuskript, StoryBook, or your favourite text editor. EBookGenTools builds on the power of other software to create the following ebook formats: - EPUB: Calibre - ebook management - DOC: LibreOffice - free office suite These tools can be used directly to create ebooks. The advantage provided by EBookGenTools is to automate the process, thus saving an author time when creating and recreating ebook formats. EBookGenTools is a shortened form for Electronic Book Generation Tools. For more details see: https://sourceforge.net/p/ebookgentools/code/ci/master/tree/README.md
    Downloads: 2 This Week
    Last Update:
    See Project
  • Desktop and Mobile Device Management Software Icon
    Desktop and Mobile Device Management Software

    It's a modern take on desktop management that can be scaled as per organizational needs.

    Desktop Central is a unified endpoint management (UEM) solution that helps in managing servers, laptops, desktops, smartphones, and tablets from a central location.
    Learn More
  • 5
    This java program counts the number of words in a LaTeX file. All LaTeX commands are supported. The table of contents and other tables, page numbers, page headers are not counted. This is due to the fact that lwc counts from source file.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    Mateusz's Saucy Editor

    Mateusz's Saucy Editor

    A simple console-mode text editor

    Mateusz's Saucy Editor (MSEDIT) is a simple editor working in the DOS environment. I decided to write it, because I had been using the Microsoft's EDIT editor for years, without finding any free alternative. Of course, there is plenty of free DOS editors out there, but no one has ever matched my expectations (or should I say, my taste). I guess that's because I had already been "formatted" by the MS editor :-) You will probably notice that MSEDIT is very similar to Microsoft's EDIT. That's a pure coincidence, and there is no connection at all between both of them.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7
    A simple application to transform XML in CSV like file.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 8
    OOoLatex is no more maintained. Please consider using TexMaths (http://roland65.free.fr/texmaths/) OOoLatex is a set marcos designed to provide latex support into OpenOffice. Complex equations can be inserted as images, the latex code is saved into the image attribute while simpler equations are expanded into symbol characters to be inserted as text.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    PDF::API2 is 'The Next Generation' of Text::PDF::API, a Perl module-chain that facilitates the creation and modification of PDF files. It features support for the 14 base PDF Core Fonts, TrueType fonts, and Adobe-Type1, with unicode mappings, embedding o
    Downloads: 2 This Week
    Last Update:
    See Project
  • G-P - Global EOR Solution Icon
    G-P - Global EOR Solution

    Companies searching for an Employer of Record solution to mitigate risk and manage compliance, taxes, benefits, and payroll anywhere in the world

    With G-P's industry-leading Employer of Record (EOR) and Contractor solutions, you can hire, onboard and manage teams in 180+ countries — quickly and compliantly — without setting up entities.
    Learn More
  • 10
    A XHTML to PDF converter: with this library, you can transform simple XHTML pages to nice and printable PDF files. This project is based on the excellent webzine article "Pdfizer, a dumb HTML to PDF converter, in C#" written by Jonathan de Halleux.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 11
    Site map generator compatible with Google, with a graphical user interface. Build the map of your site in five minutes and register the generated file at Google for a complete referencing of your site.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 12
    A Perl script that splits a long HTML file into separate inter-linked pages, according to the headings in the original file. Useful for maintaining both a print version and a browsable version of a site.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    This project is now part of Gnome Subtitles (http://gnomesubtitles.org). SubLib was a library that eases the development of subtitling applications. It supports the most common text-based subtitle formats and allows for subtitle editing, conversion and synchronization.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 14
    TeXML is an XML vocabulary for TeX. The processor transforms the TeXML markup into the TeX markup, escaping special and out-of-encoding characters. The intended audience is developers who automatically generate [La]TeX or ConTeXt files.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 15
    TextPaint

    TextPaint

    this is the text editing and painting software

    this is the text editing and painting software. you can edit text files in our application
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16
    Thunderpad (formerly Textpad)

    Thunderpad (formerly Textpad)

    Simple and lightweight text editor

    Thunderpad is a simple, general-purpose and cross-platform text editor written in C++ using the Qt libraries. Thunderpad aims to be faster and more lightweight than most text editors, but as useful as them. Additionally, Thunderpad supports syntax highlighting, word count, line count and comes with various color schemes.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    WikiPDF is a mediawiki extension based on Wiki2PDF that adds PDF/LaTeX features to mediawiki. Wiki2PDF is a python script to convert multiple articles of a mediawiki based wiki (pre-configured to use with www.wikipedia.org) to a single LaTeX or PDF file.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 18
    Ncurses based hex editor with vi/vim-like interface. Features include large file support, search highlight, multiple undo/redo, visual select, cut/paste, blob coloring, file tabs, and much more.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 19
    mbFXWords

    mbFXWords

    Analyze text. Diagonal read subject, predicate, obj. Search other pdf.

    Version 1.04. Applies and builds upon Apache OpenNLP. For English, French and German files. JavaFX Application, runs with Oracle Java Runtime Environment version 8 that is including JavaFX. NLP extensions: - Divide sentences in subclauses: segmentation. - Divide plain text: subject, predicate, object. - Count words: stemming. - Search for similar content: pdf's. Gives out subject, predicate and object of sentences of pdf and plain text files. Provides comfortable GUI. Automatic language detection.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 20
    pyfiglet is a full port of the FIGlet specification (http://www.figlet.org/) into pure python. It takes ASCII text and renders it in ASCII art fonts. It can be used on the commandline or as an Object Oriented driver library in your own programs.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 21
    This a text editor which has the option to save the content as a PDF document. It can also read existing .rtf documents and render them in the editor. These can then be saved as PDF there by providing a converter from RTF to PDF format.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 22
    A freely-available Markdown text-to-HTML translator, written in C++, intended for integration into C++ programs rather than for use in web applications.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    For the latest version, please download from: http://www.splashportal.net/Editor
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    Emerald Text Editor (jEditor)

    Emerald Text Editor (jEditor)

    Emerald Text Editor is a tabbed text editor with heavy customizability

    Emerald Text Editor (Emerald Editor, or Emerald as I call it), formerly called jEditor, is a text editor that is much similar to notepad in the fact that it let's you edit text but it makes use of the tabbed panes which means that you can have multiple tabs up at once allowing you to edit multiple files at one time. Emerald Text Editor also comes with a toolbar which tells you how quickly you are typing and how many characters are in your current document. The program is also customizable, meaning you can edit some of the main features of the program. The name was changed to fit a future naming scheme I'm going to have.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    Simple Java delimited and fixed width file parser. Handles CSV, Excel CSV, Tab, Pipe delimiters, just to name a few. Maps column positions in the file to user friendly names via XML. See "FlatPack Feature List" under News for complete feature list.
    Downloads: 1 This Week
    Last Update:
    See Project