Open Source Python Text Processing Software

Python Text Processing Software

View 91 business solutions

Browse free open source Python Text Processing Software and projects below. Use the toggles on the left to filter open source Python Text Processing Software by OS, license, language, programming language, and project status.

  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 1
    Scribus

    Scribus

    Powerful desktop publishing software

    Scribus is an Open Source program that brings professional page layout to Linux, BSD UNIX, Solaris, OpenIndiana, GNU/Hurd, Mac OS X, OS/2 Warp 4, eComStation, and Windows desktops with a combination of press-ready output and new approaches to page design. Underneath a modern and user-friendly interface, Scribus supports professional publishing features, such as color separations, CMYK and spot colors, ICC color management, and versatile PDF creation.
    Leader badge
    Downloads: 13,774 This Week
    Last Update:
    See Project
  • 2
    Notepad++ Python Script

    Notepad++ Python Script

    A Python Scripting plugin for Notepad++

    A Python Scripting plugin for Notepad++. Complete easy script access to all of the editor's features (including absolutely everything in Scintilla). Configurable menus and toolbar options, assign shortcuts to scripts.
    Leader badge
    Downloads: 450 This Week
    Last Update:
    See Project
  • 3
    Utilities for general- and special-purpose documentation. Includes reStructuredText, the easy to read, easy to use, what-you-see-is-what-you-get plaintext markup language.
    Leader badge
    Downloads: 130 This Week
    Last Update:
    See Project
  • 4
    Diffuse
    Diffuse is a graphical tool for comparing and merging text files. It can retrieve files for comparison from Bazaar, CVS, Darcs, Git, Mercurial, Monotone, RCS, Subversion, and SVK repositories.
    Leader badge
    Downloads: 124 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    PDF-Shuffler
    PDF-Shuffler is a small python-gtk application, which helps the user to merge or split pdf documents and rotate, crop and rearrange their pages using an interactive and intuitive graphical interface. It is a frontend for python-pyPdf.
    Leader badge
    Downloads: 41 This Week
    Last Update:
    See Project
  • 6
    meld-installer

    meld-installer

    Meld Installer for Windows

    Bundles Portable Python (with PyGTK) and Meld together in an easy to use installer. This allows you to not have to worry about setting up Python or PyGTK and you can keep Meld's Python separate from other Python installations on your machine. ** NOTE ** Meld 3.11 and later now have official installers, hence this project is no longer supported. You can download the new installer here: https://download.gnome.org/binaries/win32/meld/. You should uninstall the old 1.8 version before upgrading.
    Downloads: 28 This Week
    Last Update:
    See Project
  • 7
    Tomoe is a handwriting character recognition engine.
    Downloads: 34 This Week
    Last Update:
    See Project
  • 8
    TeXML is an XML vocabulary for TeX. The processor transforms the TeXML markup into the TeX markup, escaping special and out-of-encoding characters. The intended audience is developers who automatically generate [La]TeX or ConTeXt files.
    Leader badge
    Downloads: 14 This Week
    Last Update:
    See Project
  • 9
    DrPython is a highly customizable cross-platform ide to aid programming in Python. It was developed with teaching in mind, and has a clean, simple interface. It is written in Python, using wxPython as the gui.
    Downloads: 2 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 10
    EpiDoc: Epigraphic Documents in TEI XML

    EpiDoc: Epigraphic Documents in TEI XML

    XML text markup for ancient documents

    The EpiDoc Collaborative is developing specifications and tools for standards-based, digital publication and interchange of scholarly and educational editions of documentary and literary texts like inscriptions and papyri. The link below will take you to the EpiDoc home page on this site.
    Leader badge
    Downloads: 3 This Week
    Last Update:
    See Project
  • 11
    PyRtfLib is a python library that provides a parser and few translators like rtf to html and to simple text.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 12
    Tarjamento de Dados Pessoais e Sigilosos

    Tarjamento de Dados Pessoais e Sigilosos

    Ferramenta de Tarjamento de Dados Pessoais e Sigilosos

    TarjaPDF v2.0 Beta — Ferramenta de Tarjamento de Dados Pessoais e Sigilosos Proteja dados sensíveis em PDFs com segurança irreversível. Interface moderna com dark mode, marcação manual (texto, linha e área livre), detecção automática de CPF, RG, e-mail, telefone, nomes próprios e endereços. Escaneamento inteligente com análise preditiva: destaca dados pessoais para revisão antes de tarjar. Detecção de nomes via heurística e base oficial, com dicionário customizável. Relatório de conformidade LGPD após cada operação. Security by Design: salva exclusivamente como PDF-imagem, impossibilitando recuperação dos dados tarjados. Novo na versão 2.0: marcação por área, buscar e tarjar, scan preditivo com clique para tarjar blocos, menu de contexto e gerenciador de nomes. ⚠️ Alguns antivírus podem gerar falso positivo devido ao empacotamento com PyInstaller. O software é seguro. Ao instalar, adicione exceção no antivírus ou desabilite temporariamente a proteção em tempo real.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 13
    PyRTF is a pure python module for the efficient creation of RTF documents.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 14
    DocLite is a simple documentation authoring system, it produces multi-page HTML output (like this document) in a style similar to that found in the Linux HOTWTOs or other DocBook created documents.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 15
    pyfiglet is a full port of the FIGlet specification (http://www.figlet.org/) into pure python. It takes ASCII text and renders it in ASCII art fonts. It can be used on the commandline or as an Object Oriented driver library in your own programs.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 16
    Alphabetizer

    Alphabetizer

    Take a list of words or sentences and arrange them alphabetically.

    Alphabetizer lets anyone take a list of words or sentences and arranged them in alphabetical order easily. Alphabetizer is a tool that takes a list of words or phrases and arranged them in alphabetical order. This tool is useful for organizing information, creating glossaries, sorting names, or any task where the items in a list need to be in alphabetical order. Overall, Alphabetizer can save time and effort by quickly organizing information and making it easier to read and comprehend.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 17
    Buzz is a fast graphical editor for XML files with special support for OPML. Using the OPML convergence tools it will edit about any outline and many forms of indented text, including Python. In fact, Buzz was written with Buzz! It is written in P
    Downloads: 3 This Week
    Last Update:
    See Project
  • 18
    VeilClip

    VeilClip

    Offline clipboard manager for Windows with history, search, and locked

    VeilClip is an open-source, offline clipboard manager for Windows 10 and Windows 11. It stores copied text, links, images, and file paths locally on your PC so you can search, pin, edit, reuse, and protect them later without a cloud account. Main features: - Clipboard history for text, links, images, and file paths - Fast search by content and source application - Favorites and pinned reusable items - Locked Notes protected by a PIN - Built-in text and image editing - Local backup, export, and import - Multi-language interface - Offline-first and privacy-focused design VeilClip uses local storage and does not require a cloud account for normal use.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 19
    A Python-based template and view-controller framework derived from HTML::Mason. Supports the full featureset of Mason, allowing component-based web development with Python-embedded HTML, and includes many new concepts and features not found in Mason.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    Queequeg is an English grammar checker for non-native English speakers.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    TextLite is meant to become a text editor, quick, fast, and simple, dedicated to the generation of plain text ASCII files.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    xml2txt is a text formatter for XMl in the same way the FO is a PDF formatter. It uses python to convert an XML document to well-formatted text, wtih borders, indents, and tables.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    wordaxe (formerly deco-cow): A hyphenation library for Python. Several hyphenation algorithms: - the pattern-based from TeX/OOO, - by decomposition of compound words for German language. Includes support for paragraph line-breaking with ReportLab.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    Project is dead.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    A simple Gedit Plugin which enables a markup syntax to produce extended characters. 59 defaults but extra character support can be added easily making this plugin highly configurable.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB