Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Artificial Intelligence
Natural Language Processing (NLP) Tools
Search Results

Search Results for "process"

x

Sort By:

Relevance

Clear All Filters

OS

Windows 18
Linux 16
Mac 16
More...
BSD 2
ChromeOS 2

Category

Artificial Intelligence 18
Scientific/Engineering 5
Software Development 2

License

OSI-Approved Open Source 12
Public Domain 1

Translations

Chinese (Simplified) 1
English 1

Programming Language

Python 8
Java 3
C++ 2
C# 1
More...
JavaScript 1
Perl 1
Ruby 1
Scala 1

Status

Beta 3
Production/Stable 2

Showing 18 open source projects for "process"

View related business solutions

Natural Language Processing (NLP) Windows Clear Filters & Widen Search

Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure
Native application identity and user-based security for your Azure cloud

Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.

Get a free trial
Stop vibe-debugging.
Plug Claude into your app's actual errors.

AppSignal's MCP server hands Claude, Cursor, or Zed your real errors, traces, and the deploy that shipped them. AI writes the fix; you review the diff.

Free 30 days.
1

PaddleNLP

Easy-to-use and powerful NLP library with Awesome model zoo

PaddleNLP It is a natural language processing development library for flying paddles, with Easy-to-use text area API, Examples of applications for multiple scenarios, and High-performance distributed training Three major features, aimed at improving the modeling efficiency of the flying oar developer's text field, aiming to improve the developer's development efficiency in the text field, and provide rich examples of NLP applications. Provide rich industry-level pre-task capabilities Taskflow And process-wide text area API: Support for the loading of rich Chinese data sets Dataset API, can flexibly and efficiently complete data pretreatment Data API, Preset 60 + pre-training word vector Embedding API, Providing 100 + pre-training model Transformer API Wait, the efficiency of NLP task modeling can be greatly improved.

Downloads: 0 This Week

Last Update: 2025-05-21
See Project
2

Datasets

Hub of ready-to-use datasets for ML models

...Smart caching: never wait for your data to process several times.

Downloads: 0 This Week

Last Update: 2026-06-05
See Project
3

ModelScope

Bring the notion of Model-as-a-Service to life

ModelScope is built upon the notion of “Model-as-a-Service” (MaaS). It seeks to bring together most advanced machine learning models from the AI community, and streamlines the process of leveraging AI models in real-world applications. The core ModelScope library open-sourced in this repository provides the interfaces and implementations that allow developers to perform model inference, training and evaluation. In particular, with rich layers of API abstraction, the ModelScope library offers unified experience to explore state-of-the-art models spanning across domains such as CV, NLP, Speech, Multi-Modality, and Scientific-computation. ...

Downloads: 2 This Week

Last Update: 2026-05-21
See Project
4

Spark NLP

State of the Art Natural Language Processing

...Spark ML provides a set of machine learning applications that can be built using two main components, estimators and transformers. The estimators have a method that secures and trains a piece of data to such an application. The transformer is generally the result of a fitting process and applies changes to the target dataset. These components have been embedded to be applicable to Spark NLP. Pipelines are a mechanism for combining multiple estimators and transformers in a single workflow. They allow multiple chained transformations along a machine-learning task.

Downloads: 0 This Week

Last Update: 2026-05-25
See Project
Ship Agents Faster
Transform your applications and workflows into powerful agentic systems at global scale.

Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.

Get Started Free
5

find-similar

User-friendly library to find similar objects

The mission of the FindSimilar project is to provide a powerful and versatile open source library that empowers developers to efficiently find similar objects and perform comparisons across a variety of data types. Whether dealing with texts, images, audio, or more, our project aims to simplify the process of identifying similarities and enhancing decision-making. https://github.com/findsimilar/find-similar - GitHub repo http://demo.findsimilar.org/ - Demo project and tutorial https://docs.findsimilar.org/ - Documentation

1 Review

Downloads: 0 This Week

Last Update: 2023-11-12
See Project
6

Graph4NLP

Graph4nlp is the library for the easy use of Graph Neural Networks

Graph4NLP is an easy-to-use library for R&D at the intersection of Deep Learning on Graphs and Natural Language Processing (i.e., DLG4NLP). It provides both full implementations of state-of-the-art models for data scientists and also flexible interfaces to build customized models for researchers and developers with whole-pipeline support. Built upon highly-optimized runtime libraries including DGL , Graph4NLP has both high running efficiency and great extensibility. The architecture of...

Downloads: 0 This Week

Last Update: 2022-08-16
See Project
7

MITRE Annotation Toolkit

A toolkit for managing and manipulating text annotations

The MITRE Annotation Toolkit (MAT) is a suite of tools which can be used for automated and human tagging of annotations. Annotation is a process, used mostly by researchers in natural language processing, of enhancing documents with information about the various phrase types the documents contain. MAT supports both UI interaction and command-line interaction, and provides various levels of control over the overall annotation process. It can be customized for specific tasks (e.g., named entity identification, de-identification of medical records). ...

Downloads: 1 This Week

Last Update: 2023-04-19
See Project
8

fastNLP

fastNLP: A Modularized and Extensible NLP Framework

fastNLP is a lightweight framework for natural language processing (NLP), the goal is to quickly implement NLP tasks and build complex models. A unified Tabular data container simplifies the data preprocessing process. Built-in Loader and Pipe for multiple datasets, eliminating the need for preprocessing code. Various convenient NLP tools, such as Embedding loading (including ELMo and BERT), intermediate data cache, etc.. Provide a variety of neural network components and recurrence models (covering tasks such as Chinese word segmentation, named entity recognition, syntactic analysis, text classification, text matching, metaphor resolution, summarization, etc.). ...

Downloads: 0 This Week

Last Update: 2022-08-05
See Project
9

GluonNLP

NLP made easy

GluonNLP is a toolkit that helps you solve NLP problems. It provides easy-to-use tools that helps you load the text data, process the text data, and train models. To facilitate both the engineers and researchers, we provide command-line-toolkits for downloading and processing the NLP datasets. Gluon NLP makes it easy to evaluate and train word embeddings. Here are examples to evaluate the pre-trained embeddings included in the Gluon NLP toolkit as well as example scripts for training embeddings on custom datasets. ...

Downloads: 0 This Week

Last Update: 2022-08-08
See Project
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
10

Bracket Based Arabic Annotation

...Different types of tag markers can be incorporated e.g. grammatical, functional, semantic, linguistic markers.Tag-sets can be configured (modified/extended) by accessing the related table in the supporting database, The user can upload text files where sentences are normalized and inserted into the supporting database. Multiple narratives can be listed in the text file, where narratives are separated using a # symbol. The text upload process entitles the initial (POS) tagging of uploaded text using Stanford (POS) tagger. The user can later modify and extend the initial tagging. The resultant annotations are stored in the supporting database. These results can be exported to excel or text files for further processing.

Downloads: 1 This Week

Last Update: 2017-02-20
See Project
11

BioC

We describe a simple XML format to share text documents and annotation

... - BioC-formatted corpora - BioC tools that work with BioC corpora BioC goals - simplicity - interoperability - broad use - reuse There should be little investment required to learn to use a format or a software module to process that format. We are interested in reuse, and we focus on common NLP tasks that are broadly useful for textmining.

Downloads: 0 This Week

Last Update: 2016-08-08
See Project
12

Phrasal

Statistical phrase-based machine translation system

...Developed by The Natural Language Processing Group at Stanford University, a team of faculty, postdocs, programmers and students who work together on algorithms that allow computers to process and understand human languages. Our work ranges from basic research in computational linguistics to key applications in human language technology, and covers areas such as sentence understanding, automatic question answering, machine translation, syntactic parsing and tagging, sentiment analysis.

Downloads: 0 This Week

Last Update: 2021-01-19
See Project
13

KALIMAT Multipurpose Arabic Corpus

A corpus that could be of help for researchers working on Arabic NLP

KALIMAT a Multipurpose Arabic Corpus We are pleased to announce the immediate availability of KALIMAT 1.0, KALIMAT is an Arabic natural language resource that consists of: 1) 20,291 Arabic articles collected from the Omani newspaper Alwatan by (Abbas et al. 2011). 2) 20,291 Extractive Single-document system summaries. 3) 2,057 Extractive Multi-document system summaries. 4) 20,291 Named Entity Recognised articles. 5) 20,291 Part of Speech Tagged articles. 6) 20,291 Morphologically Analyse articles. The data collection articles fall into six categories: culture, economy, local-news, international-news, religion, and sports. The process of creating KALIMAT was applied to the entire data collection (20,291 articles).

Downloads: 9 This Week

Last Update: 2015-04-09
See Project
14

romanize

Romanizing 9 Indian languages (Unicode) to English alphabets

This project is a step one in any NLP project. Romanization is normally done using ASCII and extended ASCII syllables, which is easy to process but difficult to work with. Romanize project converts the Indian languages in their unicode form to english alphabets. Compared to the existing schemes of romanizations, this project focuses on few main points - Readability, easy typability, English alphabet combinations only, incoporations with existing popular schemes, phonetically equivallent transliterations and most importantly non-ambiguity across the languages (9) using the same transliteration mapping set.

Downloads: 0 This Week

Last Update: 2013-05-30
See Project
15

CRFSharp

CRFSharp is a .NET(C#) implementation of Conditional Random Field

...Currently, when training corpus, compared with CRF++, CRF# can make full use of multi-core CPUs and only uses very low memory, and memory grow is very smoothly and slowly while amount of training corpus, tags increase. with multi-threads process, CRF# is more suitable for large data and tags training than CRF++ now. For example, in machine with 64GB, CRF# encodes model with more than 4.5 hundred million features quickly.

Downloads: 0 This Week

Last Update: 2015-08-03
See Project
16

Alkhalil Morpho Sys

Alkhalil Morpho Sys is a morphosyntactic parser of Arabic words. The system can process non vocalized texts as well as partially or totally vocalized ones. Our approach is based on modelling a very large set of Arabic morphological rules, and also on integrating linguistic resources, such as the root database, vocalized patterns associated with roots, and proclitic and enclitic tables. As an output of the analysis, we have a highly informative table mainly containing vocalization of the stem, its grammatical category, its possible roots associated with corresponding patterns, proclitics and enclitics. ...

Downloads: 6 This Week

Last Update: 2017-06-05
See Project
17

Birbal

Birbal is an AI project for giving answers to common question.It uses natural language processing for accepting queries in any form.it searches for most appropriate answers in database.Project comprises user guided learning process.

Downloads: 0 This Week

Last Update: 2016-08-07
See Project
18

JWebPro: A Java Web Processing Toolkit

JWebPro: A Java tool that can interact with Google search and then process the returned Web documents in a couple of ways. The outputs can serve as inputs for NLP, IR, infor extraction, Web mining, online social network extraction/analysis applications.

Downloads: 0 This Week

Last Update: 2013-03-13
See Project

Previous
You're on page 1
Next

Related Searches

annotation

dataset

nlp

ddos layer 4

medical diagnosis system

arabic pos

arabic corpus

crf++

alkhalil

ai

Related Categories

Artificial Intelligence

Scientific/Engineering

Software Development

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise