Open Source Python Text Processing Software

Python Text Processing Software

View 91 business solutions

Browse free open source Python Text Processing Software and projects below. Use the toggles on the left to filter open source Python Text Processing Software by OS, license, language, programming language, and project status.

  • All-in-One Payroll and HR Platform Icon
    All-in-One Payroll and HR Platform

    For small and mid-sized businesses that need a comprehensive payroll and HR solution with personalized support

    We design our technology to make workforce management easier. APS offers core HR, payroll, benefits administration, attendance, recruiting, employee onboarding, and more.
  • Gain insights and build data-powered applications Icon
    Gain insights and build data-powered applications

    Your unified business intelligence platform. Self-service. Governed. Embedded.

    Chat with your business data with Looker. More than just a modern business intelligence platform, you can turn to Looker for self-service or governed BI, build your own custom applications with trusted metrics, or even bring Looker modeling to your existing BI environment.
  • 1
    Scribus

    Scribus

    Powerful desktop publishing software

    Scribus is an Open Source program that brings professional page layout to Linux, BSD UNIX, Solaris, OpenIndiana, GNU/Hurd, Mac OS X, OS/2 Warp 4, eComStation, and Windows desktops with a combination of press-ready output and new approaches to page design. Underneath a modern and user-friendly interface, Scribus supports professional publishing features, such as color separations, CMYK and spot colors, ICC color management, and versatile PDF creation.
    Leader badge
    Downloads: 10,885 This Week
    Last Update:
    See Project
  • 2
    Diffuse
    Diffuse is a graphical tool for comparing and merging text files. It can retrieve files for comparison from Bazaar, CVS, Darcs, Git, Mercurial, Monotone, RCS, Subversion, and SVK repositories.
    Leader badge
    Downloads: 369 This Week
    Last Update:
    See Project
  • 3
    Notepad++ Python Script

    Notepad++ Python Script

    A Python Scripting plugin for Notepad++

    A Python Scripting plugin for Notepad++. Complete easy script access to all of the editor's features (including absolutely everything in Scintilla). Configurable menus and toolbar options, assign shortcuts to scripts.
    Leader badge
    Downloads: 249 This Week
    Last Update:
    See Project
  • 4
    Utilities for general- and special-purpose documentation. Includes reStructuredText, the easy to read, easy to use, what-you-see-is-what-you-get plaintext markup language.
    Leader badge
    Downloads: 143 This Week
    Last Update:
    See Project
  • Finance Automation that puts you in charge Icon
    Finance Automation that puts you in charge

    Tipalti delivers smart payables that elevate modern business.

    Our robust pre-built connectors and our no-code, drag-and-drop interface makes it easy and fast to automatically sync vendors, invoices, and invoice payment data between Tipalti and your ERP or accounting software.
  • 5
    PDF-Shuffler
    PDF-Shuffler is a small python-gtk application, which helps the user to merge or split pdf documents and rotate, crop and rearrange their pages using an interactive and intuitive graphical interface. It is a frontend for python-pyPdf.
    Leader badge
    Downloads: 111 This Week
    Last Update:
    See Project
  • 6
    meld-installer

    meld-installer

    Meld Installer for Windows

    Bundles Portable Python (with PyGTK) and Meld together in an easy to use installer. This allows you to not have to worry about setting up Python or PyGTK and you can keep Meld's Python separate from other Python installations on your machine. ** NOTE ** Meld 3.11 and later now have official installers, hence this project is no longer supported. You can download the new installer here: https://download.gnome.org/binaries/win32/meld/. You should uninstall the old 1.8 version before upgrading.
    Leader badge
    Downloads: 79 This Week
    Last Update:
    See Project
  • 7
    tika-python

    tika-python

    Python binding to the Apache Tika™ REST services

    A Python port of the Apache Tika library that makes Tika available using the Tika REST Server. This makes Apache Tika available as a Python library, installable via Setuptools, Pip and easy to install. To use this library, you need to have Java 7+ installed on your system as tika-python starts up the Tika REST server in the background. To get this working in a disconnected environment, download a tika server file (both tika-server.jar and tika-server.jar.md5, which can be found here) and set the TIKA_SERVER_JAR environment variable to TIKA_SERVER_JAR="file:////tika-server.jar" which successfully tells python-tika to "download" this file and move it to /tmp/tika-server.jar and run as a background process. This is the only way to run python-tika without internet access. Without this set, the default is to check the tika version and pull latest every time from Apache.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    Tomoe is a handwriting character recognition engine.
    Downloads: 19 This Week
    Last Update:
    See Project
  • 9
    DrPython is a highly customizable cross-platform ide to aid programming in Python. It was developed with teaching in mind, and has a clean, simple interface. It is written in Python, using wxPython as the gui.
    Downloads: 4 This Week
    Last Update:
    See Project
  • Manage your IT department more effectively Icon
    Manage your IT department more effectively

    Streamline your business from end to end with ConnectWise PSA

    ConnectWise PSA (formerly Manage) allows you to stop working in separate systems, and helps you build a more profitable business. No more duplicate data entries, inefficient employees, manual invoices, and the inability to accurately track client service issues. Get a behind the scenes look into the award-winning PSA that automates processes for each area of business: sales, help desk, support, finance, and HR.
  • 10
    EpiDoc: Epigraphic Documents in TEI XML

    EpiDoc: Epigraphic Documents in TEI XML

    XML text markup for ancient documents

    The EpiDoc Collaborative is developing specifications and tools for standards-based, digital publication and interchange of scholarly and educational editions of documentary and literary texts like inscriptions and papyri. The link below will take you to the EpiDoc home page on this site.
    Leader badge
    Downloads: 6 This Week
    Last Update:
    See Project
  • 11
    PyRTF is a pure python module for the efficient creation of RTF documents.
    Leader badge
    Downloads: 9 This Week
    Last Update:
    See Project
  • 12
    TeXML is an XML vocabulary for TeX. The processor transforms the TeXML markup into the TeX markup, escaping special and out-of-encoding characters. The intended audience is developers who automatically generate [La]TeX or ConTeXt files.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 13
    The Python scripts for the conversion from the Chinese Pinyin transcription(ISO 7098) to International Phonetic Alphabet(IPA), comprised of a core module for developers and a flexible GUI application for the common end-users on Modern Chinese phonetics.
    Leader badge
    Downloads: 8 This Week
    Last Update:
    See Project
  • 14
    A module for python/pygame used for typesetting text to the screen. It provides specialized functions for scrolling text, pages of text, selectable text, and an on screen text editor.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 15
    Pylero
    Pylero is an open-source Python-based text generator.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 16
    DocLite is a simple documentation authoring system, it produces multi-page HTML output (like this document) in a style similar to that found in the Linux HOTWTOs or other DocBook created documents.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    MahaKurawa.My.ID URL Extractor

    MahaKurawa.My.ID URL Extractor

    MahaKurawa.My.ID URL Extractor is Simple Tool to extract unique URL

    MahaKurawa.My.ID URL Extractor is Simple Tool to extract unique URL from any text content in instant. It's useful when you lazy enough to identify and copy-paste URL from your content one by one by yourself.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 18
    pyfiglet is a full port of the FIGlet specification (http://www.figlet.org/) into pure python. It takes ASCII text and renders it in ASCII art fonts. It can be used on the commandline or as an Object Oriented driver library in your own programs.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 19
    This project will entail Python modules which will offer an interface for Python programmers to produce TeX documents from within Python software.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    Word segmentation utility for Thai language written in C
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    A framework for creating freeware Truetype fonts based on Metafont language. Glyph outlines and bitmaps are converted, using a number of utilities, into TTF and BDF format.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    A simple text editor. Nice Notepad replacement. This is a basic a GUI text editor, edits simple text files. Useful for making quick edits to files. A small text editor without many advanced features, however it may evolve into having specialized features.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    PyRtfLib is a python library that provides a parser and few translators like rtf to html and to simple text.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    SE|PY is an ActionScript editor written in python, wxPython and using scintilla for text highlight, code collapsing. some features: snippets panel, functions panel and much more. Contain also Flush
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25

    arCHMage

    A reader and decompiler for files in the CHM format

    arCHMage is a reader and decompiler for files in the CHM format. This is the format used by Microsoft HTML Help, and is also known as Compiled HTML.
    Downloads: 1 This Week
    Last Update:
    See Project