Showing 75 open source projects for "python text parser"

View related business solutions
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    More flexibility. More control.

    Generate interest, access liquidity without selling, and execute trades seamlessly. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 1
    nanoGPT

    nanoGPT

    The simplest, fastest repository for training/finetuning models

    NanoGPT is a minimalistic yet powerful reimplementation of GPT-style transformers created by Andrej Karpathy for educational and research use. It distills the GPT architecture into a few hundred lines of Python code, making it far easier to understand than large, production-scale implementations. The repo is organized with a training pipeline (dataset preprocessing, model definition, optimizer, training loop) and inference script so you can train a small GPT on text datasets like Shakespeare or custom corpora. It emphasizes readability and clarity: the training loop is cleanly written, and the code avoids heavy abstractions, letting students follow the architecture step by step. ...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 2
    Concordia

    Concordia

    Crowdsourcing platform for full text transcription and tagging

    Concordia is a platform for crowdsourcing transcription and tagging of text in digitized images. It was developed by the Library of Congress so that volunteers of all backgrounds could transcribe and tag digitized images of manuscripts and typed materials from the Library’s collections that could not otherwise be done by optical character recognition.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Stanza

    Stanza

    Stanford NLP Python library for many human languages

    Stanza is a collection of accurate and efficient tools for the linguistic analysis of many human languages. Starting from raw text to syntactic analysis and entity recognition, Stanza brings state-of-the-art NLP models to languages of your choosing. Stanza is a Python natural language analysis package. It contains tools, which can be used in a pipeline, to convert a string containing human language text into lists of sentences and words, to generate base forms of those words, their parts of speech and morphological features, to give a syntactic structure dependency parse, and to recognize named entities. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Book4_Power-of-Matrix

    Book4_Power-of-Matrix

    Book_4_Matrix Power | The Iris Book: From Addition, Subtraction

    ...The project is designed to help readers progress from basic arithmetic toward machine learning fundamentals by building a strong conceptual understanding of vectors, matrices, and their operations. It combines explanatory text, diagrams, and Python examples to bridge theory and practical computation. The material emphasizes geometric interpretation and visual reasoning, which makes abstract linear algebra topics more accessible to beginners and self-learners. The repository is continuously updated and intended to accompany the broader Visualize-ML learning ecosystem. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Compliant and Reliable File Transfers Backed by Top Security Certifications Icon
    Compliant and Reliable File Transfers Backed by Top Security Certifications

    Cerberus FTP Server delivers SOC 2 Type II certified security and FIPS 140-2 validated encryption.

    Stop relying on non-certified, legacy file transfer tools that creak under the weight of modern security demands. Get full audit trails, advanced access controls and more supported by an award-winning team of experts. Start your free 25-day trial today.
    Start Free Trial
  • 5
    Think Python 2

    Think Python 2

    LaTeX source and supporting code for Think Python, 2nd edition

    ThinkPython2 is the repository for the second edition of Allen Downey’s Think Python textbook, which teaches programming fundamentals in Python to beginners. The code includes all of the example programs, exercises, and supplementary files referenced in the book, allowing learners to run the examples, experiment, and extend them. The repository contains clean, well-commented Python scripts that are easy to follow and map directly to chapters of the text, covering topics like variables, control flow, functions, recursion, data structures (lists, dictionaries), classes and objects, file I/O, and algorithmic thinking. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Google Open Source Project Style Guide

    Google Open Source Project Style Guide

    Chinese version of Google open source project style guide

    Each larger open source project has its own style guide, a series of conventions on how to write code for the project (sometimes more arbitrary). When all the code maintains a consistent style, it is more important when understanding large code bases. easy. The meaning of "style" covers a wide range, from "variables use camelCase" to "never use global variables" to "never use exceptions". The English version of the project maintains the programming style guidelines used in Google. If the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    ktrain

    ktrain

    ktrain is a Python library that makes deep learning AI more accessible

    ktrain is a Python library that makes deep learning and AI more accessible and easier to apply. ktrain is a lightweight wrapper for the deep learning library TensorFlow Keras (and other libraries) to help build, train, and deploy neural networks and other machine learning models. Inspired by ML framework extensions like fastai and ludwig, ktrain is designed to make deep learning and AI more accessible and easier to apply for both newcomers and experienced practitioners. With only a few lines...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Little Book of Linear Algebra

    Little Book of Linear Algebra

    A concise, beginner-friendly introduction to the core ideas of linear

    This is a concise, beginner-friendly introduction to the fundamental concepts of linear algebra, intended to give readers intuition without overwhelming detail. The material is organized into chapters covering vectors, matrices, linear systems, vector spaces, eigenvalues/eigenvectors, and other central topics, each with worked examples and explanations. There is also a companion “LAB” section for hands-on exploration (e.g. using Python/NumPy) to help cement the connections between algebraic...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    Libro

    Libro

    An interactive program for statistical analysis of texts

    A cross-platform text analysis program written in Python and Free Pascal/Lazarus which scans a whole text file (in plain text, HTML, EPUB, or ODT formats) and ranks all used words according to frequency, performing a quantitative analysis of the text using Shannon-Weaver information statistic and Zipf power law function. It counts words, sentences, chars, spaces, and syllables.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Build Securely on Azure with Proven Frameworks Icon
    Build Securely on Azure with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 10
    translate-gui

    translate-gui

    GUI for translate-shell, the cli tool for quick translation

    GUI for the translate-shell, aims to be easy to use translator and a helpful tool for learning new languages. Most tools do a one way translation from source to target language, do to the reverse involves choosing the source and target languages again. This tool can do a 2 way translation accompanied by speech output of the target language text. Hence it can prove to be an indispensable aid when learning new languages
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    anglolintergo

    lightweight rulebased grammar error checker in Golang

    standalone offline grammar error checker (English, German and Dutch) programmed in Golang V 0.3.8.22 Usage: anglolintergo mytext.txt -> check English text anglolintergo -fix mytext.txt -> check and autofix English text anglolintergo -de mytext_ge.txt -> check German text anglolintergo -nl mytext_nl.txt -> check Dutch text anlgolintergo -help -> show helpfile anglolintergo mytext.tex -> check English...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Sharada-Braille-Writer

    Sharada-Braille-Writer

    Six key approach to producing print materials.

    Sharada-Braille-Writer (S.B.W) is a text editor for GNU/Linux which support braille input. In S.B.W, Simultaneous multiple press of braille combinations, will produce corresponding letters. So one can input text in more than 40 languages with braille knowledge in perkins like way. letters f, d, s, j, k, l represent 1 2 3 4 5 6 of the braille dots respectively. By pressing "f" and "s" together will produce letter k and like. Website : https://zendalona.com/sbw/ Github repository 1 :...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Programming Without Coding Technology

    Programming Without Coding Technology

    Create software using visual programming

    ...Many database, Multi-Media, Network, AI, Simulation & Math applications are developed using PWCT You can see/edit the generated source code. PWCT support Harbour, Supernova, C, Python, & C#.NET and you can extend PWCT to support code generation in any text based programming language. PWCT comes with many samples, tutorials and movies.
    Leader badge
    Downloads: 1,185 This Week
    Last Update:
    See Project
  • 14
    SpeeD ReaD ("Speedy Read-y")

    SpeeD ReaD ("Speedy Read-y")

    SpeeD ReaD is a little program to help you read faster.

    SpeeD ReaD helps you to read faster and more efficiently. By minimizing subvocalization and saccades, you can process and comprehend the text you read much faster than with normal reading. First, subvocalization is the natural tendency for all of us to "hear" the words in our brains as we read. Think of it as reading out loud inside your head. But our minds do not need us to sound out the words we read - even inside our heads - in order to understand them. The words can be processed in a...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 15
    CiteFlow

    CiteFlow

    Desktop research workspace for PDFs, notes, citations, bibliographies.

    CiteFlow is a focused desktop research workspace for students, researchers, and academic writers who want to manage PDFs, notes, citations, and bibliographies in one place. Create project-based workspaces for essays, articles, reports, literature reviews, and long-form research. Import PDFs, read them inside the app, search within documents, compare files side by side, highlight key passages, and add page-based notes. CiteFlow can assist with DOI metadata detection, keeps citation history...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    TOMUSS

    TOMUSS: The Online Multi User Simple Spreadsheet

    TOMUSS is an interactive web application (groupware) allowing multiple concurrent users to edit data tables. Its primary goal is the management of students grades.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Zettel

    Zettel

    Zettel allows taking notes from several references and organizing them

    Zettel is a program for taking notes from bibliographic references. Instead of marking the text on paper and then going crazy looking for where the copy ended up, the notes are saved in a database, linked to the reference from where they were copied. Notes can be tagged and retrieved in several ways. Zettel é um programa para fichamento de referências bibliográficas. Ao invés de marcar o texto em papel e depois enlouquecer procurando onde a cópia foi parar, as notas são guardadas numa...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Advanced Trigonometry Calculator

    Advanced Trigonometry Calculator

    Precision Trigonometry: Advanced Calculator for Complex Math

    Advanced Trigonometry Calculator is equipped with a user-friendly interface that allows for easy input of problems and instant computation. Professionals such as engineers who need to perform advanced trigonometric calculations in their work will find this tool extremely useful. ATC Online Alpha: https://advantrigoncalc.sourceforge.io/atc/ More info by clicking below: https://advantrigoncalc.sourceforge.io/ Advanced Trigonometry Calculator was only and always only developed by...
    Leader badge
    Downloads: 12 This Week
    Last Update:
    See Project
  • 19
    clavirio

    clavirio

    Learn touch typing without leaving the terminal

    A free, open-source typing tutor for the terminal. Progressive lessons, practice modes, real-time stats, and a virtual keyboard with finger hints — for QWERTY, Dvorak, and Colemak. Methodology Research on typing skill suggests that skilled typing relies more on implicit procedural control than on explicit knowledge of key locations. The paper also suggests that the keyboard is represented in terms of its row-and-column structure, not as a memorized list of individual...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 20
    Python Data Science Handbook

    Python Data Science Handbook

    Python Data Science Handbook: full text in Jupyter Notebooks

    The Python Data Science Handbook is a comprehensive collection of Jupyter notebooks written by Jake VanderPlas covering fundamental Python libraries for data science, including IPython, NumPy, Pandas, Matplotlib, Scikit-Learn and more. The project is designed for data scientists, researchers, and anyone transitioning into Python-based data work; it assumes you already know basic Python and focuses more on how to use the ecosystem effectively. Each chapter is a standalone Jupyter notebook,...
    Downloads: 12 This Week
    Last Update:
    See Project
  • 21
    kakagui

    kakagui

    Kakagui is a gui interface for kakasi, the Kanji Kana Simple Inverter

    Kakagui かかぐい, is a gui interface for kakasi, the Kanji Kana Simple Inverter. It can be used by Japanese learners who do not yet have a strong grasp on Kanji to convert text to hiragana/katakana or Romaji to help in reading Japanese.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Enf'encrier

    Enf'encrier

    Editeur de texte pour les enfants

    Enf'encrier est un éditeur de texte simplifié et optimisé pour une utilisation dédiée aux jeunes enfants. Cette application a été développé tout particulièrement pour les enfants en situation de handicap afin de limiter les distractions liées à l'environnement applicatif.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    PersonGen

    PersonGen

    A minor Project in Python which uses the RandomUser API .

    A Small Program in Python That Makes Use of RandomUser API To Generate Random Person Data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Provides a GUI interface to grammatical structure and relations (as parsed by the Stanford Parser) of any text.Contains grammatical relation editor to modify, import, export grammatical relation definitions (tregex patterns and features).
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    Tensorflow and deep learning

    Tensorflow and deep learning

    A crash course in six episodes for software developers

    Tensorflow and deep learning repository is an educational deep learning crash course designed to help software developers quickly understand and apply machine learning concepts without requiring advanced academic background. It is structured as a series of guided lessons that combine theoretical explanations, practical examples, and runnable code, allowing learners to build intuition while actively experimenting with models. The repository covers core neural network concepts such as weights,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next
MongoDB Logo MongoDB