Showing 1320 open source projects for "python text"

View related business solutions
  • Gen AI apps are built with MongoDB Atlas Icon
    Gen AI apps are built with MongoDB Atlas

    Build gen AI apps with an all-in-one modern database: MongoDB Atlas

    MongoDB Atlas provides built-in vector search and a flexible document model so developers can build, scale, and run gen AI apps without stitching together multiple databases. From LLM integration to semantic search, Atlas simplifies your AI architecture—and it’s free to get started.
    Start Free
  • Build Securely on AWS with Proven Frameworks Icon
    Build Securely on AWS with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 1
    HistogramsApp

    HistogramsApp

    Application that generates KDE-PDP plots from geochronological data

    HistogramsApp is a Python 3.6 application that generates (KDE and PDP) from geochronological data .HistogramsApp allows to interactively setup plot parameters such as the bandwidth and the peak detection sensibility. To cite the application please refer to: 1) https://www.tandfonline.com/doi/abs/10.1080/00206814.2021.1954556?journalCode=tigr20 Rodriguez-Corcho, A. F., Rojas-Agramonte, Y., Barrera-Gonzalez, J. A., Marroquin-Gomez, M. P., Bonilla-Correa, S., Izquierdo-Camacho, D.,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Rescatux

    Rescatux

    Easy to use Repair distribution for both Gnu/Linux and Windows systems

    Rescatux is a GNU/Linux repair cd (and eventually also Windows) but it is not like other rescue disks. Rescatux comes with Rescapp. Rescapp is a nice wizard that will guide you through your rescue and repair tasks. When the wizard is not able to solve your problem you can also enjoy of Rescatux unique support features: * Chat: Open the chat for asking help directly in Rescatux channel. * Share log: After running an option you can share its log (the action registry that it has done) so...
    Leader badge
    Downloads: 1,384 This Week
    Last Update:
    See Project
  • 3
    Paperless-ng

    Paperless-ng

    A supercharged version of paperless, scan, index and archive docs

    Paperless is a simple Django application running in two parts, a Consumer (the thing that does the indexing) and a Web server (the part that lets you search & download already-indexed documents). Paper is a nightmare. Environmental issues aside, there’s no excuse for it in the 21st century. It takes up space, collects dust, doesn’t support any form of a search feature, indexing is tedious, it’s heavy and prone to damage & loss. I wrote this to make “going paperless” easier. I do not have to...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    CompareDir

    CompareDir

    Tool to Compare 2 directories and handle the files.

    Did you ever copy many files to a portable usb-harddisk and use another system to update or delete files ? And later have trouble to find the few files you changed among all the others ? CompareDir does just that: it can compare 2 directories + subdirectories and you decide what files you want to keep, copy or delete. Manual: point it at a MasterDir, a SlaveDir than click "Diff" = show only the files that are available in only 1 directory-tree. Or click "All" = show all files in both...
    Downloads: 0 This Week
    Last Update:
    See Project
  • The Original Buy Center Software. Icon
    The Original Buy Center Software.

    Never Go To The Auction Again.

    VAN sources private-party vehicles from over 20 platforms and provides all necessary tools to communicate with sellers and manage opportunities. Franchise and Independent dealers can boost their buy center strategies with our advanced tools and an experienced Acquisition Coaching™ team dedicated to your success.
    Learn More
  • 5
    aseryla

    aseryla

    Aseryla code repositories

    This project describes a model of how the semantic human memory represents the information relevant to the objects of the world in text format. It provides a system and a GUI application capable of extracting and managing concepts and relations from English texts. https://aseryla2.sourceforge.io/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Texthero

    Texthero

    Text preprocessing, representation and visualization from zero to hero

    Texthero is a python package to work with text data efficiently. It empowers NLP developers with a tool to quickly understand any text-based dataset and it provides a solid pipeline to clean and represent text data, from zero to hero.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    Glazier

    Glazier

    A tool for automating the installation of Windows OS

    ...It streamlines the entire Windows imaging process by booting systems into the Windows Preinstallation Environment (WinPE), retrieving installation instructions from a web server, and automatically applying operating systems, software, and configurations. The tool is fully text-based and code-driven, with configurations written in YAML, allowing teams to leverage source control for versioning, collaboration, and quality assurance. By distributing installation data via HTTPS, Glazier ensures scalability and flexibility, supporting both simple local servers and large-scale cloud-based deployments. Its extensibility makes it easy for administrators to create custom actions using Python or PowerShell, enabling tailored automation for diverse enterprise environments. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    Marathi MTLD tool (for GNU/Linux)

    A tool for measuring MTLD score of Marathi-Devanagari text

    This tool has been developed as part of M. Phil. research work submitted to the Department of Linguistics, University of Mumbai in 2021. This is version 0.1 of the tool. The tool calculates the number of repeated words present in the Marathi text written in the Devnagari script by using the Measure of Textual Lexical Diversity (MTLD) approach. The program performs the lemmatization process on input text before calculating its MTLD score. The program accepts Devanagari Marathi text as an...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Pythopad

    Pythopad

    A free Python source code editor and Notepad replacement for Windows

    Downloads: 0 This Week
    Last Update:
    See Project
  • Smart Business Texting that Generates Pipeline Icon
    Smart Business Texting that Generates Pipeline

    Create and convert pipeline at scale through industry leading SMS campaigns, automation, and conversation management.

    TextUs is the leading text messaging service provider for businesses that want to engage in real-time conversations with customers, leads, employees and candidates. Text messaging is one of the most engaging ways to communicate with customers, candidates, employees and leads. 1:1, two-way messaging encourages response and engagement. Text messages help teams get 10x the response rate over phone and email. Business text messaging has become a more viable form of communication than traditional mediums. The TextUs user experience is intentionally designed to resemble the familiar SMS inbox, allowing users to easily manage contacts, conversations, and campaigns. Work right from your desktop with the TextUs web app or use the Chrome extension alongside your ATS or CRM. Leverage the mobile app for on-the-go sending and responding.
    Learn More
  • 10
    Kashgari

    Kashgari

    Kashgari is a production-level NLP Transfer learning framework

    Kashgari is a simple and powerful NLP Transfer learning framework, build a state-of-art model in 5 minutes for named entity recognition (NER), part-of-speech tagging (PoS), and text classification tasks.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Text Gen

    Text Gen

    Almost state of art text generation library

    Almost state of art text generation library. Text gen is a python library that allow you build a custom text generation model with ease. Something sweet built with Tensorflow and Pytorch(coming soon). Load your data, your data must be in a text format. Download the example data from the example folder. Tune your model to know the best optimizer, activation method to use.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Footcontroller

    Footcontroller

    Control your Linux PC with a standard foot pedal

    This python utility allows users on Linux to control their PC using a HID compatible USB foot pedal. Note: footcontroller does not support the new VEC Infinity in-USB3 pedal, which is not fully Linux compatible The foot pedal becomes an extra mouse or mini keyboard but footcontroller allows you to define multiple pedal sets which you can activate at the click of a button. It uses xdotool to provide you with the ability to assign commands to each pedal. Many foot pedals come with...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 13
    hebrew-gpt_neo

    hebrew-gpt_neo

    Hebrew text generation models based on EleutherAI's gpt-neo

    Hebrew text generation models based on EleutherAI's gpt-neo. Each was trained on a TPUv3-8 which was made available to me via the TPU Research Cloud Program. The Open Super-large Crawled ALMAnaCH coRpus is a huge multilingual corpus obtained by language classification and filtering of the Common Crawl corpus using the goclassy architecture.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Ciphey

    Ciphey

    Decrypt encryptions without knowing the key or cipher

    Fully automated decryption/decoding/cracking tool using natural language processing & artificial intelligence, along with some common sense. You don't know, you just know it's possibly encrypted. Ciphey will figure it out for you. Ciphey can solve most things in 3 seconds or less. Ciphey aims to be a tool to automate a lot of decryptions & decodings such as multiple base encodings, classical ciphers, hashes or more advanced cryptography. If you don't know much about cryptography, or you want...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 15
    FARM

    FARM

    Fast & easy transfer learning for NLP

    FARM makes Transfer Learning with BERT & Co simple, fast and enterprise-ready. It's built upon transformers and provides additional features to simplify the life of developers: Parallelized preprocessing, highly modular design, multi-task learning, experiment tracking, easy debugging and close integration with AWS SageMaker. With FARM you can build fast proofs-of-concept for tasks like text classification, NER or question answering and transfer them easily into production. Easy fine-tuning...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    AliceMind

    AliceMind

    ALIbaba's Collection of Encoder-decoders from MinD

    This repository provides pre-trained encoder-decoder models and its related optimization techniques developed by Alibaba's MinD (Machine IntelligeNce of Damo) Lab. Pre-trained models for natural language understanding (NLU). We extend BERT to a new model, StructBERT, by incorporating language structures into pre-training. Specifically, we pre-train StructBERT with two auxiliary tasks to make the most of the sequential order of words and sentences, which leverage language structures at the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    DeepSpeech

    DeepSpeech

    Open source embedded speech-to-text engine

    DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers. DeepSpeech is an open-source Speech-To-Text engine, using a model trained by machine learning techniques based on Baidu's Deep Speech research paper. Project DeepSpeech uses Google's TensorFlow to make the implementation easier. A pre-trained English model is available for use and can be downloaded following the...
    Downloads: 16 This Week
    Last Update:
    See Project
  • 18
    DrQA

    DrQA

    Reading Wikipedia to Answer Open-Domain Questions

    DrQA is an open-domain question answering system that reads large text corpora—famously Wikipedia—to answer natural language questions with extractive spans. It follows a two-stage pipeline: a fast document retriever first narrows down candidate articles, and a neural machine reader then predicts the exact answer span from those passages. The retriever relies on classic IR features (like TF-IDF and n-gram statistics) to remain lightweight and scalable to millions of documents. The reader is...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    pylatexenc

    pylatexenc

    Simple LaTeX parser providing latex-to-unicode and unicode-to-latex

    Simple LaTeX parser providing latex-to-unicode and unicode-to-latex conversion. Python 3.4 or 2.7. The library is designed to be as backward-compatible as reasonably possible and is able to run on old Python versions should it be necessary. (Use the setup.py script directly if you have Python 3.7, poetry doesn't seem to work with old Python versions.) The pylatexenc.latexencode module provides a function unicode_to_latex() which converts a Unicode string into LaTeX text and escape sequences. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    OpenNum

    OpenNum

    OpenNum lets you distribute solvers with a nice graphical interface

    Typically, to program a GUI is time consuming and requires experience with graphic libraries. OpenNum lets you create a graphical interface adapted to your solvers by simply editing an XML configuration file. More specifically, OpenNum lets you · to collect a hierarchical dataset, · to call any executable file and · to visualize scalar and vector fields, plot graphs or show simple plain text files. It also has other useful utilities specifically designed for numerical...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 21
    kamiFaka

    kamiFaka

    kamiFaka

    Applicable to all kinds of e-commerce, coupons, forum invitation codes, recharge cards, activation codes, registration codes, Tencent iQiyi points CDK, etc., support manual and automatic delivery, and a tiered wholesale model similar to 1688. Stisla UI: The web interface is beautiful Front-end uses VUE3.0, millisecond-level response. Has integrated Alipay face-to-face payment, WeChat official, Payjs, Hupijiao, YunGouOS, Yipay, Mugglepay, V visa-free and other more than a dozen payment...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    CRSLab

    CRSLab

    CRSLab is an open-source toolkit

    CRSLab is an open-source toolkit for building Conversational Recommender System (CRS). It is developed based on Python and PyTorch. CRSLab has the following highlights. Comprehensive benchmark models and datasets: We have integrated commonly-used 6 datasets and 18 models, including graph neural network and pre-training models such as R-GCN, BERT and GPT-2. We have preprocessed these datasets to support these models, and release for downloading. Extensive and standard evaluation protocols: We...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Big List of Naughty Strings

    Big List of Naughty Strings

    List of strings which have a high probability of causing issues

    The Big List of Naughty Strings is a community-maintained catalog of “gotcha” inputs that commonly break software, from unusual Unicode to SQL and script injection payloads. It exists so developers and QA engineers can easily test edge cases that normal test data would miss, such as zero-width characters, right-to-left marks, emojis, foreign alphabets, and long or malformed strings. By throwing these strings at forms, APIs, databases, and UIs, teams can discover encoding bugs, sanitizer...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24

    instagram-spammer

    instagram spammer, spam, spammer, spambot, spammerbot, pythonspam

    instagram-spammer is a script, that allows you to spam text on instagram. If you DON'T KNOW how to use this script please read README.TXT file. WARNING This script contains packages that may NEED to be installed using PIP. Use this script only with basic programming/python knowledge.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 25
    AS Arrow

    AS Arrow

    Add and Subtract game

    AS refer to Add and Sub. During the game process, the card present as Arrow. Game Rule: Delete all card to win. User can drag and drop card from cards library to original 6 cards. At the game start, original 6 cards look like: d -b e- a -c f User can connect d, e, f cards. Once d and e are deleted, user can connect to b. Cards are positive for a, b, c, d, e, f. And then connecting card is negative. In the similar way, the next card is positive (add/plus), and then the card is...
    Downloads: 0 This Week
    Last Update:
    See Project