Showing 554 open source projects for "language processing"

View related business solutions
  • Secure File Transfer for Windows with Cerberus by Redwood Icon
    Secure File Transfer for Windows with Cerberus by Redwood

    Protect and share files over FTP/S, SFTP, HTTPS and SCP with the #1 rated Windows file transfer server.

    Cerberus supports unlimited users and connections on a single IP, with built-in encryption, 2FA, and a browser-based web client — all deployable in under 15 minutes with a 25-day free trial.
    Try for Free
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    IceNLP is an open source Natural Language Processing (NLP) toolkit for analyzing and processing Icelandic text. The toolkit is implemented in Java.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2

    popt4jlib

    Parallel Optimization Library for Java

    popt4jlib is an open-source parallel optimization library for the Java programming language supporting both shared memory and distributed message passing models. Implements a number of meta-heuristic algorithms for Non-Linear Programming, including Genetic Algorithms, Differential Evolution, Evolutionary Algorithms, Simulated Annealing, Particle Swarm Optimization, Firefly Algorithm, Monte-Carlo Search, Local Search algorithms, Gradient-Descent-based algorithms, as well as some well-known...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    AI learning

    AI learning

    AiLearning, data analysis plus machine learning practice

    We actively respond to the Research Open Source Initiative (DOCX) . Open source today is not just open source, but datasets, models, tutorials, and experimental records. We are also exploring other categories of open source solutions and protocols. I hope you will understand this initiative, combine this initiative with your own interests, and do what you can. Everyone's tiny contributions, together, are the entire open source ecosystem. We are iBooker, a large open-source community,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4

    Service Grid - Language Grid Base System

    SOA infrastracture initially developed by NICT Language Grid Project

    Service Grid is an infrastructure for accumulating and sharing Web services. Resources with complicated intellectual property issues are wrapped as Web services and shared on the Service Grid. If you release your software by using the software of this project, please include the following description in the documents or on the website. * This software uses the [SOFTWARE] by the Language Grid project (http://langrid.org/). [SOFTWARE] is one of: * Service Grid Server Software...
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    Seq2Seq Chatbot

    Seq2Seq Chatbot

    Chatbot in 200 lines of code using TensorLayer

    Seq2Seq Chatbot is an implementation of a sequence-to-sequence chatbot model using TensorLayer, demonstrating how to build conversational agents with minimal code.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6

    file_lemmater

    text file quick lemmater

    This executable get a text file (input name "in.txt" at the same folder where the executable is) and creates a file called "out.txt" with the same content but each noun, adjective or verb is lemmatized. From the Aseryla (https://memla.000webhostapp.com/index.html) system that combines the Stanford Core NLP (https://stanfordnlp.github.io/CoreNLP/index.html) and the CSTlemmatiser(http://cst.dk/online/lemmatiser/uk/)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7

    Semantic Assistants

    Natural Language Processing (NLP) for the Masses

    Semantic Assistants support users in content retrieval, analysis, and development, by offering context-sensitive NLP services directly integrated in standard desktop clients, like a word processor, and web information systems, like a wiki.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    DMTK

    DMTK

    Microsoft Distributed Machine Learning Toolkit

    ...At its core is a parameter-server architecture called Multiverso, which manages model parameters and synchronizes updates across distributed training processes. This architecture allows developers to build machine learning systems capable of processing massive datasets and training complex models with reduced infrastructure requirements. DMTK also includes several specialized algorithms and systems, such as LightLDA for large-scale topic modeling and distributed implementations of word embedding techniques used in natural language processing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Seq2seq Chatbot for Keras

    Seq2seq Chatbot for Keras

    This repository contains a new generative model of chatbot

    This repository contains a new generative model of chatbot based on seq2seq modeling. The trained model available here used a small dataset composed of ~8K pairs of context (the last two utterances of the dialogue up to the current point) and respective response. The data were collected from dialogues of English courses online. This trained model can be fine-tuned using a closed-domain dataset to real-world applications. The canonical seq2seq model became popular in neural machine...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • 10
    Welsh Natural Language Toolkit
    The project supports the Welsh Language Technology domain with a set of NLP tools that drive innovation and advance the development of sophisticated textual analysis solutions. The WNLT project delivers four core NLP modules; a) Word Segmentation for separating text into words b) Sentence Boundary Disambiguation for finding sentence boundaries c) Part of Speech Tagger for determining the part of speech of each word d) Morphological Analyser for identifying the root form (lemma) of words....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    TEES

    Turku Event Extraction System

    Turku Event Extraction System (TEES) is a free and open source natural language processing system developed for the extraction of events and relations from biomedical text. It is written mostly in Python, and should work in generic Unix/Linux environments. Currently, the TEES source code repository still remains on GitHub at http://jbjorne.github.com/TEES/ where there is also a wiki with more information.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    AIVA (A.I. Virtual Assistant)

    AIVA (A.I. Virtual Assistant)

    AIVA (A.I. Virtual Assistant): General-purpose virtual assistant

    AIVA is a general-purpose virtual assistant designed for developers, enabling the creation of customizable AI assistants for various applications.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    The Bracket Based Arabic Annotation (B2A2) scheme provides users with the ability to manually tag Arabic text with Part-of-Speech (POS) markers. B2A2 introduces a new approach that enables tagging Arabic text using morphology aware tag markers. Different types of tag markers can be incorporated e.g. grammatical, functional, semantic, linguistic markers.Tag-sets can be configured (modified/extended) by accessing the related table in the supporting database, The user can upload text...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    ZPar statistical parser. Universal language support (depending on the availability of training data), with language-specific features for Chinese and English. Currently support word segmentation, POS tagging, dependency and phrase-structure parsing.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 15
    GT NLP Class

    GT NLP Class

    Course materials for Georgia Tech CS 4650 and 7650

    This repository contains lecture notes, slides, assignments, and code for a university-level Natural Language Processing course. It spans core NLP topics such as language modeling, sequence tagging, parsing, semantics, and discourse, alongside modern machine learning methods used to solve them. Students work through programming exercises and problem sets that build intuition for both classical algorithms (like HMMs and CRFs) and neural approaches (like word embeddings and sequence models). ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16

    JCLTP

    A Java Class Library for Text Processing

    JCLTP is a class library designed for processing text. JCLTP is free, open source and developed with the Java programming language. JCLTP is distributed under the GNU license. It incorporates several technologies that enable process information while applying AI techniques, in order to build predictive models for text classification. Through a flexible structure of interfaces and classes, the opportunity to extend, adapt and add functionality JCLTP is provided.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    NobleTools

    NobleTools

    Collection of NLP Tools developed at DBMI at University of Pittsburgh

    Collection of NLP Tools developed at Department of BioMedical Informatics at University of Pittsburgh. The set of tools include a generic Terminology and Ontology API, Named Entity Recognition (NER) tool called NobleCoder, Information Extraction (IE) framework, Negation detection and more.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Ansj Chinese word segmentation

    Ansj Chinese word segmentation

    Ansj word segmentation

    ...At present, it has realized the functions of Chinese word segmentation, Chinese name recognition, user-defined dictionary, keyword extraction, automatic summarization, and keyword tagging. It can be applied to natural language processing and other aspects, and is suitable for various projects that require high word segmentation effects.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 19

    BioC

    We describe a simple XML format to share text documents and annotation

    A minimalist approach to share text documents and data annotations. Allows a large number of different annotations to be represented. Project files contain: - simple code to hold/read/write data and perform sample processing. - BioC-formatted corpora - BioC tools that work with BioC corpora BioC goals - simplicity - interoperability - broad use - reuse There should be little investment required to learn to use a format or a software module to process that format. We are...
    Leader badge
    Downloads: 5 This Week
    Last Update:
    See Project
  • 20
    OpenCCG, the OpenNLP CCG Library, is a collection of natural language processing components and tools which provide support for parsing and realization with Combinatory Categorial Grammar (CCG).
    Downloads: 4 This Week
    Last Update:
    See Project
  • 21
    This ohnlp project has released "pipelines" that were contributed by members of the OHNLP Consortium. The pipelines are based on the Apache UIMA framework. medKAT/P, MedCoref, MedTagger, MedXN, and cTAKES are licensed under Apache License V2.0. MedTime is licensed under GNU General Public License version 3.0 (GPLv3). cTAKES development has moved to apache.org. See http://ctakes.apache.org/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Phrasal

    Phrasal

    Statistical phrase-based machine translation system

    ...Distinctive features include: providing an easy to use API for implementing new decoding model features, the ability to translating using phrases that include gaps (Galley et al. 2010), and conditional extraction of phrase-tables and lexical reordering models. Developed by The Natural Language Processing Group at Stanford University, a team of faculty, postdocs, programmers and students who work together on algorithms that allow computers to process and understand human languages. Our work ranges from basic research in computational linguistics to key applications in human language technology, and covers areas such as sentence understanding, automatic question answering, machine translation, syntactic parsing and tagging, sentiment analysis.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23

    Rootvole

    a text parsing library that matches text with concepts.

    ...The central class is the parser class, which is instantiated as a series of vocabularies, simple text lists that describe tokens and synonyms, and value descriptors. It's meant as a lightweight alternative to "real" NLP (natural language processing).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24

    cyborg

    Cyborg is an cybernetic organism project written with Ada programming

    Cyborg is an cybernetic organism project written with Ada programming language and using PragmARC library ; Developped on GNU/Linux system with 160x64 characters terminal.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Welsh Natural Language Toolkit

    Welsh Natural Language Toolkit

    WNLT is a suite of open source natural language modules for the Welsh

    The project supports the Welsh Language Technology domain with a set of NLP tools that drive innovation and advance the development of sophisticated textual analysis solutions. The WNLT project delivers four core NLP modules; a) Word Segmentation for separating text into words b) Sentence Boundary Disambiguation for finding sentence boundaries c) Part of Speech Tagger for determining the part of speech of each word d) Morphological Analyser for identifying the root form (lemma) of words....
    Downloads: 0 This Week
    Last Update:
    See Project
Auth0 Logo