Open Source Python Text Processing Software

Python Text Processing Software

View 91 business solutions

Browse free open source Python Text Processing Software and projects below. Use the toggles on the left to filter open source Python Text Processing Software by OS, license, language, programming language, and project status.

  • Gen AI apps are built with MongoDB Atlas Icon
    Gen AI apps are built with MongoDB Atlas

    The database for AI-powered applications.

    MongoDB Atlas is the developer-friendly database used to build, scale, and run gen AI and LLM-powered apps—without needing a separate vector database. Atlas offers built-in vector search, global availability across 115+ regions, and flexible document modeling. Start building AI apps faster, all in one place.
    Start Free
  • Build Securely on Azure with Proven Frameworks Icon
    Build Securely on Azure with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 1
    Scribus

    Scribus

    Powerful desktop publishing software

    Scribus is an Open Source program that brings professional page layout to Linux, BSD UNIX, Solaris, OpenIndiana, GNU/Hurd, Mac OS X, OS/2 Warp 4, eComStation, and Windows desktops with a combination of press-ready output and new approaches to page design. Underneath a modern and user-friendly interface, Scribus supports professional publishing features, such as color separations, CMYK and spot colors, ICC color management, and versatile PDF creation.
    Leader badge
    Downloads: 15,386 This Week
    Last Update:
    See Project
  • 2
    Notepad++ Python Script

    Notepad++ Python Script

    A Python Scripting plugin for Notepad++

    A Python Scripting plugin for Notepad++. Complete easy script access to all of the editor's features (including absolutely everything in Scintilla). Configurable menus and toolbar options, assign shortcuts to scripts.
    Leader badge
    Downloads: 437 This Week
    Last Update:
    See Project
  • 3
    Diffuse
    Diffuse is a graphical tool for comparing and merging text files. It can retrieve files for comparison from Bazaar, CVS, Darcs, Git, Mercurial, Monotone, RCS, Subversion, and SVK repositories.
    Leader badge
    Downloads: 164 This Week
    Last Update:
    See Project
  • 4
    Utilities for general- and special-purpose documentation. Includes reStructuredText, the easy to read, easy to use, what-you-see-is-what-you-get plaintext markup language.
    Leader badge
    Downloads: 110 This Week
    Last Update:
    See Project
  • No-Nonsense Code-to-Cloud Security for Devs | Aikido Icon
    No-Nonsense Code-to-Cloud Security for Devs | Aikido

    Connect your GitHub, GitLab, Bitbucket, or Azure DevOps account to start scanning your repos for free.

    Aikido provides a unified security platform for developers, combining 12 powerful scans like SAST, DAST, and CSPM. AI-driven AutoFix and AutoTriage streamline vulnerability management, while runtime protection blocks attacks.
    Start for Free
  • 5
    PDF-Shuffler
    PDF-Shuffler is a small python-gtk application, which helps the user to merge or split pdf documents and rotate, crop and rearrange their pages using an interactive and intuitive graphical interface. It is a frontend for python-pyPdf.
    Downloads: 42 This Week
    Last Update:
    See Project
  • 6
    meld-installer

    meld-installer

    Meld Installer for Windows

    Bundles Portable Python (with PyGTK) and Meld together in an easy to use installer. This allows you to not have to worry about setting up Python or PyGTK and you can keep Meld's Python separate from other Python installations on your machine. ** NOTE ** Meld 3.11 and later now have official installers, hence this project is no longer supported. You can download the new installer here: https://download.gnome.org/binaries/win32/meld/. You should uninstall the old 1.8 version before upgrading.
    Downloads: 19 This Week
    Last Update:
    See Project
  • 7
    DrPython is a highly customizable cross-platform ide to aid programming in Python. It was developed with teaching in mind, and has a clean, simple interface. It is written in Python, using wxPython as the gui.
    Downloads: 13 This Week
    Last Update:
    See Project
  • 8
    bsed

    bsed

    Simple SQL-like syntax on top of Perl text processing

    bsed is a stream editor that offers a simple SQL-like syntax for text processing tasks. Designed to replace basic uses of tools like sed, grep, AWK, and Perl, it allows users to perform complex text manipulations with intuitive commands.​
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    PyRtfLib is a python library that provides a parser and few translators like rtf to html and to simple text.
    Downloads: 26 This Week
    Last Update:
    See Project
  • Build Securely on AWS with Proven Frameworks Icon
    Build Securely on AWS with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 10
    Tomoe is a handwriting character recognition engine.
    Downloads: 22 This Week
    Last Update:
    See Project
  • 11
    EpiDoc: Epigraphic Documents in TEI XML

    EpiDoc: Epigraphic Documents in TEI XML

    XML text markup for ancient documents

    The EpiDoc Collaborative is developing specifications and tools for standards-based, digital publication and interchange of scholarly and educational editions of documentary and literary texts like inscriptions and papyri. The link below will take you to the EpiDoc home page on this site.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 12
    DocLite is a simple documentation authoring system, it produces multi-page HTML output (like this document) in a style similar to that found in the Linux HOTWTOs or other DocBook created documents.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 13
    PyRTF is a pure python module for the efficient creation of RTF documents.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 14
    rpl is a text replacement utility. It will replace strings with new strings in multiple text files. It can work recursively over directories and supports limiting the search by file suffixes. This version was originally written by Göran Weinholt.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 15
    Alphabetizer

    Alphabetizer

    Take a list of words or sentences and arrange them alphabetically.

    Alphabetizer lets anyone take a list of words or sentences and arranged them in alphabetical order easily. Alphabetizer is a tool that takes a list of words or phrases and arranged them in alphabetical order. This tool is useful for organizing information, creating glossaries, sorting names, or any task where the items in a list need to be in alphabetical order. Overall, Alphabetizer can save time and effort by quickly organizing information and making it easier to read and comprehend.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 16
    The Python scripts for the conversion from the Chinese Pinyin transcription(ISO 7098) to International Phonetic Alphabet(IPA), comprised of a core module for developers and a flexible GUI application for the common end-users on Modern Chinese phonetics.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 17
    Queequeg is an English grammar checker for non-native English speakers.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 18
    SE|PY is an ActionScript editor written in python, wxPython and using scintilla for text highlight, code collapsing. some features: snippets panel, functions panel and much more. Contain also Flush
    Downloads: 2 This Week
    Last Update:
    See Project
  • 19
    WikiPDF is a mediawiki extension based on Wiki2PDF that adds PDF/LaTeX features to mediawiki. Wiki2PDF is a python script to convert multiple articles of a mediawiki based wiki (pre-configured to use with www.wikipedia.org) to a single LaTeX or PDF file.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 20
    PyWord is a powerful and flexible text editor written in Python. It aims to be similar to other, existing editors (including emacs), but has several unique features as well.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    Buzz is a fast graphical editor for XML files with special support for OPML. Using the OPML convergence tools it will edit about any outline and many forms of indented text, including Python. In fact, Buzz was written with Buzz! It is written in P
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    A framework for creating freeware Truetype fonts based on Metafont language. Glyph outlines and bitmaps are converted, using a number of utilities, into TTF and BDF format.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    Pyana is a extension module that allows Python programs to interface with the Apache Software Foundation's Xalan XSLT transformation engine.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    SilverCity is a lexing package, based on Scintilla, that can provide lexical analysis for over 20 programming and markup langauges. Scripting language bindings currently exist for Python.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25

    SimpleTextFormatter

    STF automatically generates documentation

    STF is a system of automatically generating documentation under control of a program or a script. It is frequently used to automatically generate test reports. STF is also used to clean up the output of a process and turn it into a nice looking report.
    Downloads: 1 This Week
    Last Update:
    See Project
Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.