768 projects for "python text" with 1 filter applied:

  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • Secure File Transfer for Windows with Cerberus by Redwood Icon
    Secure File Transfer for Windows with Cerberus by Redwood

    Protect and share files over FTP/S, SFTP, HTTPS and SCP with the #1 rated Windows file transfer server.

    Cerberus supports unlimited users and connections on a single IP, with built-in encryption, 2FA, and a browser-based web client — all deployable in under 15 minutes with a 25-day free trial.
    Try for Free
  • 1
    cocoNLP

    cocoNLP

    A Chinese information extraction tool

    cocoNLP is a lightweight natural-language processing toolkit geared toward practical information extraction from raw text, especially for Chinese and mixed Chinese–English content. Instead of requiring a heavy pipeline, it focuses on quick wins such as extracting names, places, organizations, emails, phone numbers, and dates directly from unstructured sentences. The project blends pattern-based methods with NLP heuristics, giving developers dependable results for real-world texts like chats,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    ELI5

    ELI5

    A library for debugging/inspecting machine learning classifiers

    ELI5 is a Python library designed to help developers interpret, debug, and explain the predictions of machine learning models. The project focuses on improving model transparency by providing tools that visualize feature importance and prediction reasoning. It supports several popular machine learning frameworks including scikit-learn, XGBoost, LightGBM, CatBoost, and Keras.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Invenio

    Invenio

    Invenio digital library framework

    Invenio is a highly customizable open-source framework for building large-scale digital repositories and research data platforms. Developed by CERN, it is designed to manage, index, and provide access to metadata-rich content such as publications, datasets, and multimedia files. Invenio provides a modular architecture, making it suitable for libraries, archives, and research institutions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Canorus

    Canorus

    Music score editor

    Canorus is a free cross-platform music score editor. It supports an unlimited number and length of staffs, polyphony, a MIDI playback of notes, chord markings, lyrics, import/export filters to formats like MIDI, MusicXML, ABC Music, MusiXTeX and LilyPond
    Downloads: 6 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 5
    Tacotron-2

    Tacotron-2

    DeepMind's Tacotron-2 Tensorflow implementation

    Tacotron-2 is a TensorFlow implementation of DeepMind’s Tacotron-2 end-to-end text-to-speech architecture, which predicts mel spectrograms from raw text and then feeds them to a neural vocoder such as WaveNet. It reproduces the original paper’s hyperparameters exactly via paper_hparams.py, while also offering a tuned hparams.py with extra improvements that often yield better audio quality in practice. The repository is structured as a full training pipeline: dataset preparation,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Diffuse
    Diffuse is a graphical tool for comparing and merging text files. It can retrieve files for comparison from Bazaar, CVS, Darcs, Git, Mercurial, Monotone, RCS, Subversion, and SVK repositories.
    Leader badge
    Downloads: 105 This Week
    Last Update:
    See Project
  • 7
    WIP

    WIP

    Create Siri Shortcuts with Python

    python-shortcuts is a library to create Siri Shortcuts on your laptop with your favorite text editor. It uses toml to represent shortcuts. The library is in a very early development state (PR welcome!), so it does not support all actions from the Shortcuts app.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    SG2Im

    SG2Im

    Code for "Image Generation from Scene Graphs", Johnson et al, CVPR 201

    sg2im is a research codebase that learns to synthesize images from scene graphs—structured descriptions of objects and their relationships. Instead of conditioning on free-form text alone, it leverages graph structure to control layout and interactions, generating scenes that respect constraints like “person left of dog” or “cup on table.” The pipeline typically predicts object layouts (bounding boxes and masks) from the graph, then renders a realistic image conditioned on those layouts....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    OpenSeq2Seq

    OpenSeq2Seq

    Toolkit for efficient experimentation with Speech Recognition

    OpenSeq2Seq is a TensorFlow-based toolkit for efficient experimentation with sequence-to-sequence models across speech and NLP tasks. Its core goal is to give researchers a flexible, modular framework for building and training encoder–decoder architectures while fully leveraging distributed and mixed-precision training. The toolkit includes ready-made models for neural machine translation, automatic speech recognition, speech synthesis, language modeling, and additional NLP tasks such as...
    Downloads: 0 This Week
    Last Update:
    See Project
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 10
    cnn-text-classification-tf

    cnn-text-classification-tf

    Convolutional Neural Network for Text Classification in Tensorflow

    The cnn-text-classification-tf repository by Denny Britz is a well-known educational implementation of convolutional neural networks for text classification using TensorFlow, aimed at helping developers and researchers understand how CNNs can be applied to natural language processing tasks. Based loosely on Kim’s influential paper on CNNs for sentence classification, this codebase demonstrates how to preprocess text data, convert words into learned embeddings, and apply multiple convolution...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Ponder

    Ponder

    C++ reflection library with Lua binding, and JSON and XML

    ...For example, Ponder can be used to expose and edit objects' attributes in a graphical user interface. It can also be used to do automatic binding of C++ classes to script languages such as Python or Lua. Another possible application would be the serialization of objects to XML, text or binary formats. Or you can even combine all these examples to provide a powerful and consistent interface for manipulating your objects outside C++ code.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    PDF-Shuffler
    PDF-Shuffler is a small python-gtk application, which helps the user to merge or split pdf documents and rotate, crop and rearrange their pages using an interactive and intuitive graphical interface. It is a frontend for python-pyPdf.
    Downloads: 47 This Week
    Last Update:
    See Project
  • 13
    TeXML is an XML vocabulary for TeX. The processor transforms the TeXML markup into the TeX markup, escaping special and out-of-encoding characters. The intended audience is developers who automatically generate [La]TeX or ConTeXt files.
    Leader badge
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14

    Presage

    the intelligent predictive text entry platform

    Presage (formerly Soothsayer) is an intelligent predictive text entry system. Presage generates predictions by modelling natural language as a combination of redundant information sources. Presage computes probabilities for words which are most likely to be entered next by merging predictions generated by the different predictive algorithms. Presage's modular and extensible architecture allows its language model to be extended and customized to utilize statistical, syntactic, and semantic...
    Leader badge
    Downloads: 228 This Week
    Last Update:
    See Project
  • 15
    mzitu

    mzitu

    Python crawler that downloads image galleries and analyzes titles

    ...Using text segmentation and frequency analysis, the project can create a word cloud representing common keywords found in the dataset. This makes the repository both a scraping example and a small data analysis experiment built around the collected content. Overall, mzitu serves as a learning-oriented implementation of Python web scraping, data processing, and visualization techniques.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    Generative Models

    Generative Models

    Collection of generative models, e.g. GAN, VAE in Pytorch

    This project is a comprehensive open-source collection of implementations of various generative machine learning models designed to help researchers and developers experiment with deep generative techniques. The repository contains practical implementations of well-known architectures such as Generative Adversarial Networks (GANs), Variational Autoencoders (VAEs), Restricted Boltzmann Machines, and Helmholtz Machines, implemented primarily using modern deep learning frameworks like PyTorch...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    Manifest Maker

    Manifest Maker

    Python app to create plain text manifest from files

    Manifest Maker is a graphical Python application which takes a file or group of files and creates a plain text manifest list of each item. The manifest includes the file name (including directory structure) as well as a checksum of the file. (No longer maintained)
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    PyEdit Editor

    PyEdit Editor

    Configurable Editor in Python

    PyEdit is a fully featured fast text editor written in Python. It can be used in any platform that has GTK. Because it is written in Python, it is easily configurable. Features code spell check and macro recording. Tested in Win/Lin/Mac --- Update: Using the editor full time V23 is mature enough for production --- Update: Had to downgrade to PyGTK 2.22.xx as 2,24,x had a double click message bug.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    tkFontChooser

    tkFontChooser

    Simple font chooser dialog for tkinter

    A simple font chooser for Tkinter that allow the user to select the font family among the fonts available on his/her system. The size and style (bold, italic, underline, strikethrough) of the text can be set too. This module contains a `FontChooser` class which implements the font chooser and an `askfont` function that displays the font chooser and returns the chosen font when the user closes the font chooser. The font is returned as a dictionary like the one returned by the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    DC-TTS

    DC-TTS

    TensorFlow Implementation of DC-TTS: yet another text-to-speech model

    DC-TTS is a TensorFlow implementation of the DC-TTS architecture, a fully convolutional text-to-speech system designed to be efficiently trainable while producing natural speech. It follows the “Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks with Guided Attention” paper, but the author adapts and extends the design to make it practical for real experiments. The model is split into two networks: Text2Mel, which maps text to mel-spectrograms, and SSRN...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    TEACUP

    TEACUP

    TCP Experiment Automation Controlled Using Python

    TEACUP automates many aspects of running TCP performance experiments in a specially-constructed physical testbed. TEACUP enables repeatable testing of different TCP algorithms over a range of emulated network path conditions, bottleneck rate limits and bottleneck queuing disciplines. TEACUP utilises a text-based configuration file to define experiments as combinations of parameters specifying desired network path and end host conditions. When multiple values are provided (e.g. for TCP...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    TextTeaser

    TextTeaser

    TextTeaser is an automatic summarization algorithm

    textteaser is an automatic text summarization algorithm implemented in Python. It extracts the most important sentences from an article to generate concise summaries that retain the core meaning of the original text. The algorithm uses features such as sentence length, keyword frequency, and position within the document to determine which sentences are most relevant.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    uncaptcha

    uncaptcha

    Defeating Google's audio reCaptcha with 85% accuracy

    uncaptcha is an open-source proof-of-concept system designed to demonstrate vulnerabilities in Google’s audio reCAPTCHA challenges by automatically solving them using speech recognition techniques. The project uses browser automation to navigate to CAPTCHA challenges, extract audio files, and process them through multiple speech-to-text services. By combining outputs from several transcription engines, the system increases the likelihood of correctly identifying the spoken digits or phrases...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24

    Indexmeister

    automatic indexing for large LaTex documents

    Indexmeister reads a variety of formats (.tex, .docx, .epub, and others) and suggests keywords for indexing. The included program Imbrowse provides a semi-automatic interface to rapidly add index tags to multi-file latex documents.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    chemfiles

    chemfiles

    Modern library for chemistry file reading and writing

    Chemfiles is a modern and high-quality library for reading and writing trajectory files created by computational chemistry simulations program. These trajectories contains atomic positions, velocities, names, topology and sometimes more. Running simulations produce enormous amounts of data, which has to be post-processed to extract physical information about the simulated system. Chemfiles provides an interface to access this information which is - unified: the same code will work with...
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB