Showing 59 open source projects for "natural language processing"

View related business solutions
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • Auth0 B2B Essentials: SSO, MFA, and RBAC Built In Icon
    Auth0 B2B Essentials: SSO, MFA, and RBAC Built In

    Unlimited organizations, 3 enterprise SSO connections, role-based access control, and pro MFA included. Dev and prod tenants out of the box.

    Auth0's B2B Essentials plan gives you everything you need to ship secure multi-tenant apps. Unlimited orgs, enterprise SSO, RBAC, audit log streaming, and higher auth and API limits included. Add on M2M tokens, enterprise MFA, or additional SSO connections as you scale.
    Sign Up Free
  • 1
    Stanza

    Stanza

    Stanford NLP Python library for many human languages

    Stanza is a collection of accurate and efficient tools for the linguistic analysis of many human languages. Starting from raw text to syntactic analysis and entity recognition, Stanza brings state-of-the-art NLP models to languages of your choosing. Stanza is a Python natural language analysis package. It contains tools, which can be used in a pipeline, to convert a string containing human language text into lists of sentences and words, to generate base forms of those words, their parts of speech and morphological features, to give a syntactic structure dependency parse, and to recognize named entities. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Dict UK

    Dict UK

    Project to generate POS tag dictionary for Ukrainian language

    A Java-based tool for generating full morphological dictionaries for Ukrainian, applying affix rules to base lexemes to produce all inflected forms with part-of-speech tags—used for natural language processing and spell-checking.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 3
    Companion notebooks for Deep Learning

    Companion notebooks for Deep Learning

    Jupyter notebooks for the code samples of the book

    Companion notebooks for Deep Learning is a collection of Jupyter notebooks that accompany François Chollet’s deep learning curriculum, providing hands-on implementations of key concepts using practical examples. The project covers a wide range of topics, including neural networks, computer vision, natural language processing, and sequence modeling. Each notebook is structured to combine theoretical explanations with executable code, allowing users to experiment and learn interactively. The material is designed to be accessible while still covering advanced topics, making it suitable for both beginners and intermediate practitioners. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4
    Roadmap To Learn Generative AI In 2025

    Roadmap To Learn Generative AI In 2025

    Basic Machine Learning Natural Language Processing Roadmap

    Roadmap To Learn Generative AI In 2025 is a curated learning path focused on contemporary generative AI — covering large language models (LLMs), diffusion-based image generation, prompt engineering, multi-modal AI, fine-tuning techniques, and the practical considerations for deploying generative models. It’s aimed at learners and developers who already have some programming or ML basics and wish to specialize in generative AI, offering a modern, structured plan that reflects the state of the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • 5
    PocketFlow Tutorial Codebase Knowledge
    PocketFlow Tutorial Codebase Knowledge is a project that demonstrates how to build an AI agent capable of analyzing arbitrary codebases and generating beginner-friendly tutorials that explain how they work, turning complex source code into clear educational content. The repository builds on a lightweight 100-line LLM framework and uses natural language models to inspect repository structures, identify core abstractions, map dependencies, and articulate the reasoning behind code design and interactions. By crawling code files, extracting higher-level patterns, and using large language models to narrate explanations, the system aims to help developers — especially those new to a codebase — understand unfamiliar projects without manual deep reading. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    The Grand Complete Data Science Guide

    The Grand Complete Data Science Guide

    Data Science Guide With Videos And Materials

    The Grand Complete Data Science Materials is a repository curated by a data-science educator that aggregates a wide range of learning resources — from basic programming and math foundation to advanced topics in machine learning, deep learning, natural language processing, computer vision, and deployment practices — into a structured, centralized collection aimed at learners seeking a comprehensive path to data science mastery. The repository bundles tutorials, lecture notes, project outlines, course materials, and references across topics like Python, statistics, ML algorithms, deep learning, NLP, data preprocessing, model evaluation, and real-world problem solving. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    ChordSmith

    ChordSmith

    Chordpro editor that can display, transpose and print song sheets.

    ChordSmith is a chordpro editor that can display, transpose and print song sheets containing chords and lyrics. It can also edit and convert song sheet formats (including Harmonica tabs) between chordpro format (chords in square brackets in line with lyrics) and two-line format (chords above lyrics). You can find many free sources of song sheets in both formats on the Internet. More information at https://chordsmith.sourceforge.io/ Here are just a few of ChordSmith's many features:
    Downloads: 53 This Week
    Last Update:
    See Project
  • 8
    Statistics101 - Resampling Statistics

    Statistics101 - Resampling Statistics

    Use simulation to perform statistical analyses.

    Statistics101 is an Integrated Development Environment (IDE) that uses a simple, powerful language called “Resampling Stats” to develop Monte Carlo programs to analyze and solve statistical problems. The original Resampling Stats language and computer program were developed by Dr. Julian Simon (https://www.juliansimon.com/) and Peter Bruce (https://www.scientificamerican.com/author/peter-bruce/) as a new way to teach Statistics to social science students. Of course, social science students...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 9
    IK Analysis for Elasticsearch

    IK Analysis for Elasticsearch

    A plugin that integrates Lucene IK analyzer into elasticsearch

    IK Analyzer is an open source, lightweight Chinese word segmentation toolkit developed based on java language. Since the release of version 1.0 in December 2006, IKAnalyzer has launched 4 major versions. Initially, it was a Chinese word segmentation component based on the open source project Luence as the main application, combined with dictionary word segmentation and grammar analysis algorithms. Starting from version 3.0, IK has developed into a common word segmentation component for Java,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Build Securely on AWS with Proven Frameworks Icon
    Build Securely on AWS with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 10
    Fuzzy machine learning framework

    Fuzzy machine learning framework

    A library and a GUI front-end for fuzzy machine learning

    Fuzzy machine learning framework is a library and a GUI front-end for machine learning using intuitionistic fuzzy data. The approach is based on the intuitionistic fuzzy sets and the possibility theory. Further characteristics are fuzzy features and classes; numeric, enumeration features and features based on linguistic variables; user-defined features; derived and evaluated features; classifiers as features for building hierarchical systems; automatic refinement in case of dependent...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    AllenNLP

    AllenNLP

    An open-source NLP research library, built on PyTorch

    AllenNLP makes it easy to design and evaluate new deep learning models for nearly any NLP problem, along with the infrastructure to easily run them in the cloud or on your laptop. AllenNLP includes reference implementations of high quality models for both core NLP problems (e.g. semantic role labeling) and NLP applications (e.g. textual entailment). AllenNLP supports loading "plugins" dynamically. A plugin is just a Python package that provides custom registered classes or additional...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    paip-lisp

    paip-lisp

    Lisp code for the textbook "Paradigms of Artificial Intelligence"

    ...It contains implementations and examples from the book, covering classic AI techniques through Lisp programs. The repository includes systems for symbolic reasoning, search, rule-based inference, natural language processing, pattern matching, logic programming, and game-playing examples. It is valuable for readers who want to study the original code while working through the text or revisiting older AI ideas. The code also serves as a historical reference for how AI programming was taught before today’s deep-learning-centered ecosystem. paip-lisp is best suited for learners interested in Lisp, symbolic AI, and the foundations of practical AI programming.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Machine Learning PyTorch Scikit-Learn

    Machine Learning PyTorch Scikit-Learn

    Code Repository for Machine Learning with PyTorch and Scikit-Learn

    ...The first half of the book introduces readers to machine learning using scikit-learn, the defacto approach for working with tabular datasets. Then, the second half of this book focuses on deep learning, including applications to natural language processing and computer vision.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 14
    DeepLearning

    DeepLearning

    Deep Learning (Flower Book) mathematical derivation

    ...At the same time, it also introduces deep learning techniques used by practitioners in the industry, including deep feedforward networks, regularization, optimization algorithms, convolutional networks, sequence modeling and practical methods, and investigates topics such as natural language processing, Applications in speech recognition, computer vision, online recommender systems, bioinformatics, and video games. Finally, the Deep Learning book provides research directions covering theoretical topics including linear factor models, autoencoders, representation learning, structured probabilistic models, etc.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 15
    PRMLT

    PRMLT

    Matlab code of machine learning algorithms in book PRML

    This Matlab package implements machine learning algorithms described in the great textbook: Pattern Recognition and Machine Learning by C. Bishop (PRML). It is written purely in Matlab language. It is self-contained. There is no external dependency. This package requires Matlab R2016b or latter, since it utilizes a new Matlab syntax called Implicit expansion (a.k.a. broadcasting). It also requires Statistics Toolbox (for some simple random number generator) and Image Processing Toolbox (for reading image data). The code is extremely compact. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    MACE Algebraic Calculator

    MACE Algebraic Calculator

    MACE is a Mathematical program for Algorithms, Calculations, Equations

    .... + A Signal Processing section including the (fast) (inverse) Fourier transform, several filters and fir / iir Systems + bug fixes
    Downloads: 8 This Week
    Last Update:
    See Project
  • 17
    mbFXWords

    mbFXWords

    Analyze text. Diagonal read subject, predicate, obj. Search other pdf.

    Version 1.04. Applies and builds upon Apache OpenNLP. For English, French and German files. JavaFX Application, runs with Oracle Java Runtime Environment version 8 that is including JavaFX. NLP extensions: - Divide sentences in subclauses: segmentation. - Divide plain text: subject, predicate, object. - Count words: stemming. - Search for similar content: pdf's. Gives out subject, predicate and object of sentences of pdf and plain text files. Provides comfortable GUI. Automatic...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18

    OpenRTDynamics

    A framework for implementing real-time control systems

    - - - T H I S P R O J E C T M O V E D T O G I T H U B - - - A framework for the implementation of advanced real-time control systems which uses signal/block-based schematics (like Simulink) extended by several features like state machines and online-exchangeable sub-simulations. Besides, this framework properly handles multiple threads, their communication, allows to synchronise control systems to external events (e.g. variable timers or incoming network packages) and provides...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    subs2srs

    subs2srs

    Convert movies and TV shows to flashcards

    subs2srs allows you to create import files for Anki or other Spaced Repetition Systems (SRS) based on your favorite foreign language movies and TV shows to aid in the language learning process. See http://subs2srs.sourceforge.net/ for more information.
    Leader badge
    Downloads: 39 This Week
    Last Update:
    See Project
  • 20
    Benkyou Studio

    Benkyou Studio

    Benkyou Studio is a Language study toolkit.

    Benkyou Studio is intended to be a One-stop integrated solution for working and learning with languages, For the Learner, it has flashcards,A multi choice Quiz which remembers and adjusts to the words you are struggling with, Speech Synthesis helps you hear the words as you study, you can even export the wordlist to sound files for your portable music player For the Professional, it has unicode lookup and converters, a character map viewer and exporter, a text file converter for...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    LunaIDE is a powerfull IDE for Lua programming language and also have support for XML files. Languages: Spanish, English and Portuguese. Note: Lua is licenced under the MIT License and LunaIDE is licensed under the GNU/GPL v3. The creator of this software do not have any kind of relation with PUC-Rio.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Virastyar

    Virastyar

    Virastyar is an spell checker for low-resource languages

    Virastyar is a free and open-source (FOSS) spell checker. It stands upon the shoulders of many free/libre/open-source (FLOSS) libraries developed for processing low-resource languages, especially Persian and RTL languages Publications: Kashefi, O., Nasri, M., & Kanani, K. (2010). Towards Automatic Persian Spell Checking. SCICT. Kashefi, O., Sharifi, M., & Minaie, B. (2013). A novel string distance metric for ranking Persian respelling suggestions. Natural Language Engineering, 19(2), 259-284. ...
    Leader badge
    Downloads: 315 This Week
    Last Update:
    See Project
  • 23
    Yes, finally. The author decide to schedule some of his busy time to release the "Chinese Language Formula"- Step By Step. It will take a while, but eventually Chinese Language Processing is no longer a problem.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24

    AILOKS

    A polylingual dictionary/ontology system

    AILOKS is an acronym for Artificial Intelligence Linguistic Object Knowledge System. This dictionary is aimed to facilitate advanced NLP, translation engines and provide an effective dictionary interface to humans. This dictionary is Polylingual and should prove useful to language students, linguists and translators alike. It is part of a series of interrelated projects.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    ...Subject Verb Plurality Agreement Example: "They walks into a classroom." This software utilizes parts-of-speech tagging software that was developed and published by the Natural Language Processing Group at Stanford University. Many thanks!!! (Full citation in README)
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next