Open Source Python Text Processing Software - Page 2

Python Text Processing Software

View 91 business solutions

Browse free open source Python Text Processing Software and projects below. Use the toggles on the left to filter open source Python Text Processing Software by OS, license, language, programming language, and project status.

  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 1
    Yet Another Python Interface to Expat. Object-oriented Python interface to Expat, designed for simplicity and ease of use.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    Alphabetizer

    Alphabetizer

    Take a list of words or sentences and arrange them alphabetically.

    Alphabetizer lets anyone take a list of words or sentences and arranged them in alphabetical order easily. Alphabetizer is a tool that takes a list of words or phrases and arranged them in alphabetical order. This tool is useful for organizing information, creating glossaries, sorting names, or any task where the items in a list need to be in alphabetical order. Overall, Alphabetizer can save time and effort by quickly organizing information and making it easier to read and comprehend.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    Contains a LaTeX style file and an associated GUI that allow for the annotation of LaTeX documents. Tracks changes made by multiple editors. This package provides a way for multiple authors to collaboratively edit a latex document.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    The converter performs automatically the full process of converting the files of a C project into the equivalent C++ files. Classes are created, var and functions becomes attributes and methods and the changes are propagated into all files.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 5
    DocScript is an approach to document preparation. It presents tools and utilities to edit and publish documents. The philosophy behind the DocScript project is to utilize the programming tools you're working with anyway in your daily work.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    A multiplatform visual implementation of the Unix utility grep
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    Sed.py is a python module to provide a easy way to do text stream processing. Just like the name of module, it likes to do the work that sed can do. But not in sed's way, it's in Python's way. To use this module, the knowledge of regexp is necessary.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    SilverCity is a lexing package, based on Scintilla, that can provide lexical analysis for over 20 programming and markup langauges. Scripting language bindings currently exist for Python.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    wordaxe (formerly deco-cow): A hyphenation library for Python. Several hyphenation algorithms: - the pattern-based from TeX/OOO, - by decomposition of compound words for German language. Includes support for paragraph line-breaking with ReportLab.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 10
    Project is dead.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    A simple Gedit Plugin which enables a markup syntax to produce extended characters. 59 defaults but extra character support can be added easily making this plugin highly configurable.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    A python module that provides algorithms for advanced search - basically all you need to build a search engine.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    AmandaVCD splits large VCF files into smaller files with only one vCard in each file.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Atox is a fully customizable Python library and command-line tool for converting plain text into XML.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Boarded - on-screen keyboard for X11
    Boarded is a virtual on-screen keyboard for the X11 window system. It's intended to be used on mobile devices without a hardware keyboard. It allows you to design your own keyboard layouts, specifying key positioning/grouping and setting own colors.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    A python script that uses wxwidgets. View or edit delimited data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Have you ever encountered dynamic web page (php, asp.net...) problems as like being browsed full of question marks? This (currently) tiny utility is for you. It tries to correct the problem by changing the file encoding to unicode.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19

    Chinese-Korean Bilingual Corpus

    Construction of Chinese-Korean Bilingual Corpus and Search Technology

    Construction of Chinese-Korean Bilingual Corpus and Search Technology. Some auto alignment programs and a search engine based on jung seong are provided.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Concurrence is a networked file editing program that enables multiple people to modify a document simultaneously. It is written entirely in Python, and uses the wxPython library for the GUI and the Twisted library for networking.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Software for improving typing speed. The user will type only the consonant of the words, and the software will work its way to find the desired word.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    The Contraction Converter is meant for people who write papers a lot and get deducted for having contractions. Run your file through this program and can't, etc. will be changed to the proper equivalent (ex. can not).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    xtopdf: Tools to convert other formats (x) to PDF; x as in math. - solve for x :-) Currently x == {.txt, .DBF}. Others to follow. Benefits: all those of PDF (better cross-platform viewing/printing, read-only, etc.)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Latex editor with document structure tree view and project handling. Latex output allows direct jump to warnings/errors. Projects folders gives support for figures/graphs. Editor component includes usual features like search/replace and syntax highlight.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    A modular system for extracting and converting Python docstrings into useful structured formats like HTML, XML, and TeX. Project inactive. Development taken over by Docutils, http://docutils.sourceforge.net/.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB