Showing 1641 open source projects for "python text"

View related business solutions
  • Secure File Transfer for Windows with Cerberus by Redwood Icon
    Secure File Transfer for Windows with Cerberus by Redwood

    Protect and share files over FTP/S, SFTP, HTTPS and SCP with the #1 rated Windows file transfer server.

    Cerberus supports unlimited users and connections on a single IP, with built-in encryption, 2FA, and a browser-based web client — all deployable in under 15 minutes with a 25-day free trial.
    Try for Free
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • 1

    pyMathSpeak

    A Python implementation of the MathSpeak Core Specification

    pyMathSpeak is a Python package that translates MathML equations into spoken text. It implements the MathSpeak Core Specification publicly available at http://www.gh-mathspeak.com/examples/grammar-rules/ The default translation targets English. Localization for different languages is possible, and a Traditional Chinese MathML translator is included for reference.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    ELI5

    ELI5

    A library for debugging/inspecting machine learning classifiers

    ELI5 is a Python library designed to help developers interpret, debug, and explain the predictions of machine learning models. The project focuses on improving model transparency by providing tools that visualize feature importance and prediction reasoning. It supports several popular machine learning frameworks including scikit-learn, XGBoost, LightGBM, CatBoost, and Keras.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Defox text to speech and downloader

    Defox text to speech and downloader

    Written or imported text offline read or online download.

    ...I'm worried about that. ! Note 2 : When you double click on the software maybe it will get some seconds to open. That's not my fault. I used Python language to make this software and Python was not supported speedy to modern computers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    gscrabble

    gscrabble

    لعبة كلمات متقاطعة جميلة

    crossword puzzle game is funny and useful.
    Downloads: 10 This Week
    Last Update:
    See Project
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 5
    nonechucks

    nonechucks

    Deal with bad samples in your dataset dynamically

    nonechucks is a library that provides wrappers for PyTorch's datasets, samplers and transforms to allow for dropping unwanted or invalid samples dynamically. What if you have a dataset of 1000s of images, out of which a few dozen images are unreadable because the image files are corrupted? Or what if your dataset is a folder full of scanned PDFs that you have to OCRize, and then run a language detector on the resulting text, because you want only the ones that are in English? Or maybe you...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    bsed

    bsed

    Simple SQL-like syntax on top of Perl text processing

    bsed is a stream editor that offers a simple SQL-like syntax for text processing tasks. Designed to replace basic uses of tools like sed, grep, AWK, and Perl, it allows users to perform complex text manipulations with intuitive commands.​
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7

    Safe Harbor Deidentification

    Safe Harbor Deidentification for medical documents

    Phalanx - Deidentify Safe Harbor Deidentification Mode of Phalanx is an abridged pipeline of NLP annotators culminating in NER annotators which write output of text offsets. It uses the Safe Harbor deidentification method.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    IDEA (Text Data Visualizer)

    IDEA (Text Data Visualizer)

    Text Data Visualizer with Django

    It is hard for non-developer to visualize data. But if you use IDEA, you can visualize data easily. If you want to test Project: IDEA locally on your environment, you require mecab-ko and mecab-ko-dic. If you have some data which you want to visualize, just put it in IDEA. Then click the Visualization button!
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    easy12306

    easy12306

    Automatic recognition of 12306 verification code

    Automatic recognition of 12306 verification code using machine learning algorithm. Identify never-before-seen pictures.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • 10
    Rasa Core

    Rasa Core

    Rasa Core is now part of the Rasa repo

    Rasa is an open source machine learning framework to automate text and voice-based conversations. With Rasa, you can build contextual assistants. Rasa helps you build contextual assistants capable of having layered conversations with lots of back-and-forth. In order for a human to have a meaningful exchange with a contextual assistant, the assistant needs to be able to use context to build on things that were previously discussed – Rasa enables you to build assistants that can do this in a...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    lazynlp

    lazynlp

    Library to scrape and clean web pages to create massive datasets

    LazyNLP is a lightweight tool for collecting and curating large-scale text datasets for machine learning and NLP applications with minimal manual effort.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    diffdmp

    diffdmp

    semantically compare two text files

    compare two text files (typically two versions of a text) using the diff-match-patch library, with semantic cleanup. produce an ordered list of additions, removals and identical text sections which allows for clear presentation of the differences, as well as recreating each of the texts. launch from command prompt without any arguments for usage instructions. library reference: https://github.com/google/diff-match-patch this utility has a simple GUI wrapper: http://vsspdiff.sf.net
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Invenio

    Invenio

    Invenio digital library framework

    Invenio is a highly customizable open-source framework for building large-scale digital repositories and research data platforms. Developed by CERN, it is designed to manage, index, and provide access to metadata-rich content such as publications, datasets, and multimedia files. Invenio provides a modular architecture, making it suitable for libraries, archives, and research institutions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    pangu.py

    pangu.py

    Paranoid text spacing in Python

    pangu.py is a Python port of the Pangu spacing tool that automatically inserts proper whitespace between CJK characters and Latin letters, numbers, or symbols. Mixed-script text often becomes cramped or ambiguous, and this library applies simple but effective typography rules to make it instantly more readable. It works both as a Python library and a command-line utility, so you can process strings in code, tidy files in bulk, or wire it into documentation and build pipelines. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 15
    NeuroNER

    NeuroNER

    Named-entity recognition using neural networks

    Named-entity recognition (NER) aims at identifying entities of interest in the text, such as location, organization and temporal expression. Identified entities can be used in various downstream applications such as patient note de-identification and information extraction systems. They can also be used as features for machine learning systems for other natural language processing tasks. Leverages the state-of-the-art prediction capabilities of neural networks (a.k.a. "deep learning") Is...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    oscp

    oscp

    multiplatform, small and handy audio/video player with network remote

    oscp by ariel/KotCzarny @ irc://irc.freenode.com/h3droid tiny media player: - libav (mp3, wav, ogg, aac, mpc, wma, flac, ape, avi, mkv, flv, etc.) - wildmidi (mid) - xmp (mod, med, xm, s3m, it, dbm, psm, omx, okt, digi, 669, mtm, acid, umx) - gme (ay, gbs, gym, hes, kss, nsf, nsfe, sap, spc, vgm) - sidplay2 (sid, psid, info) - mdxplay (mdx) - fc14dec (fc, fc13, fc14) - sc68 (sc68, sndh) - asapconv (sap, cmc, cm3, cmr, cms, dmc, dlt, mpt, mpd, rmt, tmc, tm8, tm2) -...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    Codepad

    Codepad

    A free source code editor and Notepad replacement for Windows

    A free source code editor and Notepad replacement for Windows. Source code on GitHub: https://github.com/altarawy/codepad
    Downloads: 4 This Week
    Last Update:
    See Project
  • 18
    CMD Plot Tool

    CMD Plot Tool

    Calculates and plots Colour Magnitude Diagrams from Astronomical data

    CMD Plot Tool calculates and plots Colour Magnitude Diagrams (CMDs) from astronomical photometric data, e.g. of a star cluster observed in two filter bandpasses. It can handle multiple file formats (plain text, DAOPHOT .mag files, ACS Survey of Galactic Globular Clusters .zpt files) to generate professional and customised plots, without the usual steep learning curve. Development using Python, OOP and a formal software development lifecycle model (Agile) allowed for the creation of an application that can be deployed on multiple systems. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    Canorus

    Canorus

    Music score editor

    Canorus is a free cross-platform music score editor. It supports an unlimited number and length of staffs, polyphony, a MIDI playback of notes, chord markings, lyrics, import/export filters to formats like MIDI, MusicXML, ABC Music, MusiXTeX and LilyPond
    Downloads: 11 This Week
    Last Update:
    See Project
  • 20
    TextRank

    TextRank

    TextRank implementation for Python 3

    TextRank is an implementation of the TextRank algorithm for extractive text summarization and keyword extraction, inspired by Google’s PageRank.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Tacotron-2

    Tacotron-2

    DeepMind's Tacotron-2 Tensorflow implementation

    Tacotron-2 is a TensorFlow implementation of DeepMind’s Tacotron-2 end-to-end text-to-speech architecture, which predicts mel spectrograms from raw text and then feeds them to a neural vocoder such as WaveNet. It reproduces the original paper’s hyperparameters exactly via paper_hparams.py, while also offering a tuned hparams.py with extra improvements that often yield better audio quality in practice. The repository is structured as a full training pipeline: dataset preparation,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    MyNotes

    MyNotes

    Sticky notes/post-it application for linux

    MyNotes is a sticky notes/post-it application. Notes are created using the system tray icon. They can be organized in categories and each category has a color. Images, checkboxes and a few predefined symbols can be inserted in the notes. The style of the text can be changed (alignment, style, color).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23

    Arabic Corpus

    Text categorization, arabic language processing, language modeling

    The Arabic Corpus {compiled by Dr. Mourad Abbas ( http://sites.google.com/site/mouradabbas9/corpora ) The corpus Khaleej-2004 contains 5690 documents. It is divided to 4 topics (categories). The corpus Watan-2004 contains 20291 documents organized in 6 topics (categories). Researchers who use these two corpora would mention the two main references: (1) For Watan-2004 corpus ---------------------- M. Abbas, K. Smaili, D. Berkani, (2011) Evaluation of Topic Identification Methods on...
    Leader badge
    Downloads: 21 This Week
    Last Update:
    See Project
  • 24
    Diffuse
    Diffuse is a graphical tool for comparing and merging text files. It can retrieve files for comparison from Bazaar, CVS, Darcs, Git, Mercurial, Monotone, RCS, Subversion, and SVK repositories.
    Leader badge
    Downloads: 102 This Week
    Last Update:
    See Project
  • 25
    Deepvoice3_pytorch

    Deepvoice3_pytorch

    PyTorch implementation of convolutional neural networks

    An open source implementation of Deep Voice 3: Scaling Text-to-Speech with Convolutional Sequence Learning.
    Downloads: 1 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB