Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Artificial Intelligence
Natural Language Processing (NLP) Tools
Search Results

Search Results for "processing" - Page 5

x

Sort By:

Relevance

Clear All Filters

OS

Linux 118
Windows 117
Mac 111
More...
BSD 10
ChromeOS 9
Desktop Operating Systems 1
Mobile Operating Systems 1

Category

Artificial Intelligence 124
Software Development 16
Scientific/Engineering 11
Business 4
Education 4
Communications 1
Formats and Protocols 1
Games 1
Multimedia 1
Religion and Philosophy 1
Security 1
Social sciences 1
System 1
Text Editors 1

License

OSI-Approved Open Source 112
Creative Commons Attribution License 2
Public Domain 1

Translations

French 2
Arabic 1
Dutch 1
English 1

Programming Language

Python 124
C++ 4
Java 3
Perl 3
JavaScript 2
More...
Ruby 1
Zope 1

Status

Beta 6
Production/Stable 5
Pre-Alpha 2
Planning 1
More...
Alpha 1

Showing 124 open source projects for "processing"

View related business solutions

Natural Language Processing (NLP) Python Clear Filters & Widen Search

Cloud tools for web scraping and data extraction
Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.

Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.

Explore 10,000+ tools
Atera all-in-one platform IT management software with AI agents
Ideal for internal IT departments or managed service providers (MSPs)

Atera’s AI agents don’t just assist, they act. From detection to resolution, they handle incidents and requests instantly, taking your IT management from automated to autonomous.

Learn More
1

Deep Learning Drizzle

Drench yourself in Deep Learning, Reinforcement Learning

Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures! Optimization courses which form the foundation for ML, DL, RL. Computer Vision courses which are DL & ML heavy. Speech recognition courses which are DL heavy. Structured Courses on Geometric, Graph Neural Networks. Section on Autonomous Vehicles. Section on Computer Graphics with ML/DL focus.

Downloads: 0 This Week

Last Update: 2022-07-29
See Project
2

InferSent

InferSent sentence embeddings

InferSent is a supervised sentence embedding method that learns universal representations from Natural Language Inference data and transfers well to many downstream tasks. It uses a BiLSTM encoder with max-pooling to produce fixed-length sentence vectors that capture semantics beyond bag-of-words statistics. Trained on large NLI datasets, the embeddings generalize across tasks like sentiment analysis, entailment, paraphrase detection, and semantic similarity with simple linear classifiers....

Downloads: 0 This Week

Last Update: 2025-10-07
See Project
3

Safe Harbor Deidentification

Safe Harbor Deidentification for medical documents

Phalanx - Deidentify Safe Harbor Deidentification Mode of Phalanx is an abridged pipeline of NLP annotators culminating in NER annotators which write output of text offsets. It uses the Safe Harbor deidentification method.

Downloads: 0 This Week

Last Update: 2019-09-10
See Project
4

NeuroNER

Named-entity recognition using neural networks

...Identified entities can be used in various downstream applications such as patient note de-identification and information extraction systems. They can also be used as features for machine learning systems for other natural language processing tasks. Leverages the state-of-the-art prediction capabilities of neural networks (a.k.a. "deep learning") Is cross-platform, open source, freely available, and straightforward to use. Enables the users to create or modify annotations for a new or existing corpus. Train the neural network that performs the NER. During the training, NeuroNER allows monitoring of the network. ...

Downloads: 0 This Week

Last Update: 2022-08-12
See Project
Grafana: The open and composable observability platform
Faster answers, predictable costs, and no lock-in built by the team helping to make observability accessible to anyone.

Grafana is the open source analytics & monitoring solution for every database.

Learn More
5

lazynlp

Library to scrape and clean web pages to create massive datasets

LazyNLP is a lightweight tool for collecting and curating large-scale text datasets for machine learning and NLP applications with minimal manual effort.

Downloads: 0 This Week

Last Update: 2025-01-22
See Project
6

TextRank

TextRank implementation for Python 3

TextRank is an implementation of the TextRank algorithm for extractive text summarization and keyword extraction, inspired by Google’s PageRank.

Downloads: 0 This Week

Last Update: 2025-01-24
See Project
7

Arabic Corpus

Text categorization, arabic language processing, language modeling

...Berkani, (2011) Evaluation of Topic Identification Methods on Arabic Corpora,JOURNAL OF DIGITAL INFORMATION MANAGEMENT,vol. 9, N. 5, pp.185-192. 2) For Khaleej-2004 corpus --------------------------------- M. Abbas, K. Smaili (2005) Comparison of Topic Identification Methods for Arabic Language, RANLP05 : Recent Advances in Natural Language Processing ,pp. 14-17, 21-23 september 2005, Borovets, Bulgary. More useful references to check: ------------------------------------------- https://sites.google.com/site/mouradabbas9/corpora

Downloads: 1 This Week

Last Update: 2019-03-05
See Project
8

CRP - Chemical Reaction Prediction

Predicting Organic Reactions using Neural Networks.

The intend is to solve the forward-reaction prediction problem, where the reactants are known and the interest is in generating the reaction products using Deep learning. This Graphical User Interface takes simplified molecular-input line-entry system (SMILES) as an input and generates the product SMILE & molecule. Beam search is used in Version 2, to generate top 5 predictions. Maximum input length for the model is 15 (excluding spaces).

Downloads: 0 This Week

Last Update: 2018-11-07
See Project
9

DeepLearn

Implementation of research papers on Deep Learning+ NLP+ CV in Python

Welcome to DeepLearn. This repository contains an implementation of the following research papers on NLP, CV, ML, and deep learning. The required dependencies are mentioned in requirement.txt. I will also use dl-text modules for preparing the datasets. If you haven't use it, please do have a quick look at it. CV, transfer learning, representation learning.

Downloads: 0 This Week

Last Update: 2022-08-11
See Project
Rent Manager Software
Landlords, multi-family homes, manufactured home communities, single family homes, associations, commercial properties and mixed portfolios.

Rent Manager is award-winning property management software built for residential, commercial, and short-term-stay portfolios of any size. The program’s fully customizable features include a double-entry accounting system, maintenance management/scheduling, marketing integration, mobile applications, more than 450 insightful reports, and an API that integrates with the best PropTech providers on the market.

Learn More
10

AI learning

AiLearning, data analysis plus machine learning practice

We actively respond to the Research Open Source Initiative (DOCX) . Open source today is not just open source, but datasets, models, tutorials, and experimental records. We are also exploring other categories of open source solutions and protocols. I hope you will understand this initiative, combine this initiative with your own interests, and do what you can. Everyone's tiny contributions, together, are the entire open source ecosystem. We are iBooker, a large open-source community,...

Downloads: 0 This Week

Last Update: 2022-02-18
See Project
11

Seq2Seq Chatbot

Chatbot in 200 lines of code using TensorLayer

Seq2Seq Chatbot is an implementation of a sequence-to-sequence chatbot model using TensorLayer, demonstrating how to build conversational agents with minimal code.

Downloads: 0 This Week

Last Update: 2025-01-30
See Project
12

Seq2seq Chatbot for Keras

This repository contains a new generative model of chatbot

This repository contains a new generative model of chatbot based on seq2seq modeling. The trained model available here used a small dataset composed of ~8K pairs of context (the last two utterances of the dialogue up to the current point) and respective response. The data were collected from dialogues of English courses online. This trained model can be fine-tuned using a closed-domain dataset to real-world applications. The canonical seq2seq model became popular in neural machine...

Downloads: 0 This Week

Last Update: 2023-03-21
See Project
13

TEES

Turku Event Extraction System

Turku Event Extraction System (TEES) is a free and open source natural language processing system developed for the extraction of events and relations from biomedical text. It is written mostly in Python, and should work in generic Unix/Linux environments. Currently, the TEES source code repository still remains on GitHub at http://jbjorne.github.com/TEES/ where there is also a wiki with more information.

Downloads: 0 This Week

Last Update: 2017-05-23
See Project
14

GT NLP Class

Course materials for Georgia Tech CS 4650 and 7650

This repository contains lecture notes, slides, assignments, and code for a university-level Natural Language Processing course. It spans core NLP topics such as language modeling, sequence tagging, parsing, semantics, and discourse, alongside modern machine learning methods used to solve them. Students work through programming exercises and problem sets that build intuition for both classical algorithms (like HMMs and CRFs) and neural approaches (like word embeddings and sequence models). ...

Downloads: 0 This Week

Last Update: 2025-09-28
See Project
15

BioC

We describe a simple XML format to share text documents and annotation

A minimalist approach to share text documents and data annotations. Allows a large number of different annotations to be represented. Project files contain: - simple code to hold/read/write data and perform sample processing. - BioC-formatted corpora - BioC tools that work with BioC corpora BioC goals - simplicity - interoperability - broad use - reuse There should be little investment required to learn to use a format or a software module to process that format. We are interested in reuse, and we focus on common NLP tasks that are broadly useful for textmining.

Downloads: 0 This Week

Last Update: 2016-08-08
See Project
16

VADER

Lexicon and rule-based sentiment analysis tool

VADER (Valence Aware Dictionary and sEntiment Reasoner) is a lexicon and rule-based sentiment analysis tool designed for analyzing the sentiment of text, particularly in social media and short text formats. It is optimized for quick and accurate analysis of positive, negative, and neutral sentiments.

Downloads: 7 This Week

Last Update: 2025-02-20
See Project
17

tnets

Theano-based (Deep) Neural Networks for Speech Research

Alpha tool build on the top of Theano and delibaretly tuned for ASR/NLP tasks.

Downloads: 0 This Week

Last Update: 2015-11-19
See Project
18

TextBlob

TextBlob is a Python library for processing textual data

Simple, Pythonic, text processing, Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more. It provides a simple API for diving into common natural language processing (NLP) tasks such as part-of-speech tagging, noun phrase extraction, sentiment analysis, classification, translation, and more. TextBlob stands on the giant shoulders of NLTK and pattern, and plays nicely with both.

Downloads: 1 This Week

Last Update: 2021-07-23
See Project
19

Corpus redundancy manager

Redundancy due to cut-paste operations in text creates bias in machine learning for NLP. This module takes a directory and produces a subset of the files in that directory (in a list) with an upper bound on similarity between two files.

Downloads: 0 This Week

Last Update: 2014-06-30
See Project
20

Sylli

Sylli is a universal syllabifier. Developed for Italian, it can easily be adapted to any language that is claimed to respect the SSP. Sylli divides timit, strings, files and directories into syllables.

Downloads: 0 This Week

Last Update: 2012-10-15
See Project
21

MutationFinder

MutationFinder is a biomedical natural language processing (NLP) system for extracting mentions of point mutations from free text. MutationFinder achieves high performance (99% precision, 81% recall on blind test data) as an information extraction system

Downloads: 0 This Week

Last Update: 2013-03-22
See Project
22

Cathnet

Cathnet is developing the infrastructure for the Catholic Semantic Web. Technologies involved include, but are not limited to, XML, RDF, NLP, Zope, Plone and Plone products.

Downloads: 0 This Week

Last Update: 2015-12-07
See Project
23

Net4MRLW

Collection of Python scripts providing interactions between the sociological investigation platform and webservices (such as NLP, search engine, web database).

Downloads: 0 This Week

Last Update: 2013-03-14
See Project
24

webXcreta

webXcreta users natural language processing to create grammatical averages of textual communication and then generate original content based on these statistics.

Downloads: 0 This Week

Last Update: 2014-05-06
See Project

Previous
1
2
3
4
You're on page 5
Next

Related Searches

annotation

deep learning

arabic corpus

chemical reaction software

ai

sentiment analysis

syllables

arabiccorpus

thermodynamic

cheminformatics

Related Categories

Artificial Intelligence

Software Development

Scientific/Engineering

Business

Education

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise

×

Thanks for helping keep SourceForge clean.

X

You seem to have CSS turned off. Please don't fill out this field.

You seem to have CSS turned off. Please don't fill out this field.

Briefly describe the problem (required):

Upload screenshot of ad (required):

Select a file, or drag & drop file here.

✔

✘

Screenshot instructions:

Click URL instructions:
Right-click on the ad, choose "Copy Link", then paste here →
(This may not be possible with some types of ads)

More information about our ad policies

Ad destination/click URL: