Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Artificial Intelligence
Natural Language Processing (NLP) Tools
Search Results

Search Results for "command-line"

x

Sort By:

Relevance

Clear All Filters

OS

Linux 44
Windows 39
Mac 36
More...
BSD 17
ChromeOS 13
Desktop Operating Systems 5
Server Operating Systems 2
Game Consoles 1
Mobile Operating Systems 1

Category

Artificial Intelligence 46
Scientific/Engineering 25
Software Development 7
Education 3
System 3
Business 2
Internet 2
Multimedia 2
Text Editors 2
Communications 1
Database 1
Games 1

License

OSI-Approved Open Source 40
Public Domain 2
Creative Commons Attribution License 1
GNU Free Documentation License 1
More...
Other License 1

Translations

English 14
Chinese (Simplified) 2
Arabic 1
Dutch 1
More...
French 1
German 1
Greek 1
Hebrew 1
Italian 1
Latin 1
Norwegian 1
Portuguese 1
Romanian 1
Russian 1
Spanish 1
Swedish 1

Programming Language

Java 22
Python 18
Perl 7
C 2
More...
C++ 2
JavaScript 2
C# 1
IDL 1
TypeScript 1
Unix Shell 1
XSL (XSLT/XPath/XSL-FO) 1

Status

Beta 11
Production/Stable 9
Alpha 6
Planning 3
More...
Pre-Alpha 2

Showing 46 open source projects for "command-line"

View related business solutions

Natural Language Processing (NLP) Clear Filters & Widen Search

Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
Context for your AI agents
Crawl websites, sync to vector databases, and power RAG applications. Pre-built integrations for LLM pipelines and AI assistants.

Build data pipelines that feed your AI models and agents without managing infrastructure. Crawl any website, transform content, and push directly to your preferred vector store. Use 10,000+ tools for RAG applications, AI assistants, and real-time knowledge bases. Monitor site changes, trigger workflows on new data, and keep your AIs fed with fresh, structured information. Cloud-native, API-first, and free to start until you need to scale.

Try for free
1

Stanford CoreNLP

Stanford CoreNLP, a Java suite of core NLP tools

CoreNLP is your one stop shop for natural language processing in Java! CoreNLP enables users to derive linguistic annotations for text, including token and sentence boundaries, parts of speech, named entities, numeric and time values, dependency and constituency parses, coreference, sentiment, quote attributions, and relations. CoreNLP currently supports 6 languages, Arabic, Chinese, English, French, German, and Spanish. The centerpiece of CoreNLP is the pipeline. Pipelines take in raw text,...

Downloads: 1 This Week

Last Update: 2025-06-07
See Project
2

diff2html

Pretty diff to html javascript library (diff2html)

Each diff provides a comprehensive visualization of the code changes, helping developers identify problems and better understand the changes. Each diff features a line-by-line and side-by-side preview of your changes. All the code changes are syntax highlighted using highlight.js, providing more readability. Similar lines are paired, allowing for easier change tracking. We work hard to make sure you can have your diffs in a simple and flexible way. The AI community building the future. Build, train and deploy state of the art models powered by the reference open source in natural language processing. ...

Downloads: 2 This Week

Last Update: 2026-01-02
See Project
3

Open Interpreter

A natural language interface for computers

Open Interpreter is an open-source tool that provides a natural-language interface for interacting with your computer. It lets large language models (LLMs) run code locally (Python, JavaScript, shell, etc.), enabling you to ask your computer to do tasks like data analysis, file manipulation, browsing, etc. in human terms (“chat with your computer”), with safeguards. Runs locally or via configured remote LLM servers/inference backends, giving flexibility to use models you trust or have...

Downloads: 22 This Week

Last Update: 2025-09-12
See Project
4

Datasets

Hub of ready-to-use datasets for ML models

Datasets is a library for easily accessing and sharing datasets, and evaluation metrics for Natural Language Processing (NLP), computer vision, and audio tasks. Load a dataset in a single line of code, and use our powerful data processing methods to quickly get your dataset ready for training in a deep learning model. Backed by the Apache Arrow format, process large datasets with zero-copy reads without any memory constraints for optimal speed and efficiency. We also feature a deep integration with the Hugging Face Hub, allowing you to easily load and share a dataset with the wider NLP community. ...

Downloads: 4 This Week

Last Update: 2026-01-14
See Project
G-P - Global EOR Solution
Companies searching for an Employer of Record solution to mitigate risk and manage compliance, taxes, benefits, and payroll anywhere in the world

With G-P's industry-leading Employer of Record (EOR) and Contractor solutions, you can hire, onboard and manage teams in 180+ countries — quickly and compliantly — without setting up entities.

Learn More
5

DataProfiler

Extract schema, statistics and entities from datasets

DataProfiler is an AI-powered tool for automatic data analysis and profiling, designed to detect patterns, anomalies, and schema inconsistencies in structured and unstructured datasets. The DataProfiler is a Python library designed to make data analysis, monitoring, and sensitive data detection easy. Loading Data with a single command, the library automatically formats & loads files into a DataFrame. Profiling the Data, the library identifies the schema, statistics, entities (PII / NPI), and more. Data Profiles can then be used in downstream applications or reports.

Downloads: 0 This Week

Last Update: 2025-07-30
See Project
6

Chinese-LLaMA-Alpaca 2

Chinese LLaMA-2 & Alpaca-2 Large Model Phase II Project

...These models expand and optimize the Chinese vocabulary on the basis of the original Llama-2, use large-scale Chinese data for incremental pre-training, and further improve the basic semantics and command understanding of Chinese. Performance improvements. The related model supports FlashAttention-2 training, supports 4K context and can be extended up to 18K+ through the NTK method.

Downloads: 0 This Week

Last Update: 2024-01-23
See Project
7

Amoeba

Linux Command Line Learning Program

Amoeba is a Linux command-line learning program that observes and adapts to the Linux command line storing learned strings and their usage data. It enhances command-line proficiency by capturing command outputs, adapting string lengths, and periodically saving knowledge. Sandboxing is essential for security, and optionally a virtual machine would further isolates it from the host system.

Downloads: 0 This Week

Last Update: 2025-08-09
See Project
8

NLG-Eval

Evaluation code for various unsupervised automated metrics

NLG-Eval is a toolkit for evaluating the quality of natural language generation (NLG) outputs using multiple automated metrics such as BLEU, METEOR, and ROUGE.

Downloads: 0 This Week

Last Update: 2025-01-24
See Project
9

Common Resource Grep - crgrep

Common Resource Grep

CRGREP searches for matching text in databases, various document formats, archives and other difficult to access resources. A command line tool for name and content text matching in database tables, plain files, MS Office documents, PDF, archives, MP3 audio, image meta-data, scanned documents, maven dependencies and web resources. CRGREP will search resources within resources of any arbitrary combination or depth, so text within a document within a zip archive, and so on. ...

3 Reviews

Downloads: 1 This Week

Last Update: 2023-04-23
See Project
Say goodbye to broken revenue funnels and poor customer experiences
Connect and coordinate your data, signals, tools, and people at every step of the customer journey.

LeanData is a Demand Management solution that supports all go-to-market strategies such as account-based sales development, geo-based territories, and more. LeanData features a visual, intuitive workflow native to Salesforce that enables users to view their entire lead flow in one interface. LeanData allows users to access the drag-and-drop feature to route their leads. LeanData also features an algorithms match that uses multiple fields in Salesforce.

Learn More
10

Automatic text summarizer

Module for automatic summarization of text documents and HTML pages

Sumy is an automatic text summarization library that provides multiple algorithms for extracting key content from documents and articles. Simple library and command line utility for extracting summary from HTML pages or plain texts. The package also contains a simple evaluation framework for text summaries. Implemented summarization methods are described in the documentation. I also maintain a list of alternative implementations of the summarizers in various programming languages.

Downloads: 2 This Week

Last Update: 2025-01-21
See Project
11

AllenNLP

An open-source NLP research library, built on PyTorch

...AllenNLP will automatically find any official AI2-maintained plugins that you have installed, but for AllenNLP to find personal or third-party plugins you've installed, you also have to create either a local plugins file named .allennlp_plugins in the directory where you run the allennlp command.

Downloads: 0 This Week

Last Update: 2022-10-18
See Project
12

Emb-GAM

An interpretable and efficient predictor using pre-trained models

Deep learning models have achieved impressive prediction performance but often sacrifice interpretability, a critical consideration in high-stakes domains such as healthcare or policymaking. In contrast, generalized additive models (GAMs) can maintain interpretability but often suffer from poor prediction performance due to their inability to effectively capture feature interactions. In this work, we aim to bridge this gap by using pre-trained neural language models to extract embeddings for...

Downloads: 0 This Week

Last Update: 2023-03-22
See Project
13

MITRE Annotation Toolkit

A toolkit for managing and manipulating text annotations

...Annotation is a process, used mostly by researchers in natural language processing, of enhancing documents with information about the various phrase types the documents contain. MAT supports both UI interaction and command-line interaction, and provides various levels of control over the overall annotation process. It can be customized for specific tasks (e.g., named entity identification, de-identification of medical records). The goal of MAT is not to help you configure your training engine (in the default case, the Carafe CRF system) to achieve the best possible performance on your data. ...

Downloads: 0 This Week

Last Update: 2023-04-19
See Project
14

NLP Architect

A model library for exploring state-of-the-art deep learning

NLP Architect is an open-source Python library for exploring state-of-the-art deep learning topologies and techniques for optimizing Natural Language Processing and Natural Language Understanding neural networks. The library includes our past and ongoing NLP research and development efforts as part of Intel AI Lab. NLP Architect is designed to be flexible for adding new models, neural network components, data handling methods, and for easy training and running models. NLP Architect is a...

Downloads: 0 This Week

Last Update: 2022-08-05
See Project
15

DeText

A Deep Neural Text Understanding Framework

DeText is a Deep Text understanding framework for NLP-related ranking, classification, and language generation tasks. It leverages semantic matching using deep neural networks to understand member intents in search and recommender systems. As a general NLP framework, DeText can be applied to many tasks, including search & recommendation ranking, multi-class classification and query understanding tasks.

Downloads: 0 This Week

Last Update: 2025-01-24
See Project
16

GluonNLP

NLP made easy

...It provides easy-to-use tools that helps you load the text data, process the text data, and train models. To facilitate both the engineers and researchers, we provide command-line-toolkits for downloading and processing the NLP datasets. Gluon NLP makes it easy to evaluate and train word embeddings. Here are examples to evaluate the pre-trained embeddings included in the Gluon NLP toolkit as well as example scripts for training embeddings on custom datasets. Fasttext models trained with the library of Facebook research are exported both in text and a binary format. ...

Downloads: 0 This Week

Last Update: 2022-08-08
See Project
17

Delta ML

Deep learning based natural language and speech processing platform

DELTA is a deep learning-based end-to-end natural language and speech processing platform. DELTA aims to provide easy and fast experiences for using, deploying, and developing natural language processing and speech models for both academia and industry use cases. DELTA is mainly implemented using TensorFlow and Python 3. DELTA has been used for developing several state-of-the-art algorithms for publications and delivering real production to serve millions of users. It helps you to train,...

Downloads: 0 This Week

Last Update: 2022-08-15
See Project
18

VecText

Converting text to a structured representation

...All preprocessing actions are specified using common graphical elements organized into logically related blocks. The graphical user interface is implemented in Perl/Tk. In the command-line interface mode, all options need to be specified using the command line parameters. This way of non-interactive communication enables incorporating the application into a more complicated data mining process integrating several software packages or performing multiple conversions in a batch.

Downloads: 0 This Week

Last Update: 2020-02-25
See Project
19

Dragonfire

The open-source virtual assistant for Ubuntu based Linux distributions

Dragonfire is the open-source virtual assistant project for Ubuntu-based Linux distributions. Her main objective is to serve as a command and control interface to the helmet user. So that you will be able to give orders just by using your voice commands and your eye movements. That makes the helmet handsfree. We are planning to ship Dragonfire as a preinstalled software package on DragonOS Linux Distribution. DragonOS will be a Linux distribution specially designed for the helmet. ...

Downloads: 0 This Week

Last Update: 2022-01-13
See Project
20

Safe Harbor Deidentification

Safe Harbor Deidentification for medical documents

Phalanx - Deidentify Safe Harbor Deidentification Mode of Phalanx is an abridged pipeline of NLP annotators culminating in NER annotators which write output of text offsets. It uses the Safe Harbor deidentification method.

Downloads: 0 This Week

Last Update: 2019-09-10
See Project
21

CRP - Chemical Reaction Prediction

Predicting Organic Reactions using Neural Networks.

The intend is to solve the forward-reaction prediction problem, where the reactants are known and the interest is in generating the reaction products using Deep learning. This Graphical User Interface takes simplified molecular-input line-entry system (SMILES) as an input and generates the product SMILE & molecule. Beam search is used in Version 2, to generate top 5 predictions. Maximum input length for the model is 15 (excluding spaces).

Downloads: 0 This Week

Last Update: 2018-11-07
See Project
22

file_lemmater

text file quick lemmater

This executable get a text file (input name "in.txt" at the same folder where the executable is) and creates a file called "out.txt" with the same content but each noun, adjective or verb is lemmatized. From the Aseryla (https://memla.000webhostapp.com/index.html) system that combines the Stanford Core NLP (https://stanfordnlp.github.io/CoreNLP/index.html) and the CSTlemmatiser(http://cst.dk/online/lemmatiser/uk/)

Downloads: 0 This Week

Last Update: 2019-11-09
See Project
23

TEES

Turku Event Extraction System

Turku Event Extraction System (TEES) is a free and open source natural language processing system developed for the extraction of events and relations from biomedical text. It is written mostly in Python, and should work in generic Unix/Linux environments. Currently, the TEES source code repository still remains on GitHub at http://jbjorne.github.com/TEES/ where there is also a wiki with more information.

Downloads: 0 This Week

Last Update: 2017-05-23
See Project
24

Modular Audio Recognition Framework

MARF is a general cross-platform framework with a collection of algorithms for audio (voice, speech, and sound) and natural language text analysis and recognition along with sample applications (identification, NLP, etc.) of its use, implemented in Java.

3 Reviews

Downloads: 1 This Week

Last Update: 2015-10-06
See Project
25

JInsect

The JINSECT toolkit is a Java-based toolkit and library that supports and demonstrates the use of n-gram graphs within Natural Language Processing applications, ranging from summarization and summary evaluation to text classiﬁcation and indexing.

3 Reviews

Downloads: 0 This Week

Last Update: 2015-08-25
See Project

Previous
You're on page 1
2
Next

Related Searches

mfc file class

arabic speech to text

dataset

chinese

grep

summarizer

stakes mine predictor

medical diagnosis system

python ai

ai

Related Categories

Artificial Intelligence

Scientific/Engineering

Software Development

Education

System

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise

×

Thanks for helping keep SourceForge clean.

X

You seem to have CSS turned off. Please don't fill out this field.

You seem to have CSS turned off. Please don't fill out this field.

Briefly describe the problem (required):

Upload screenshot of ad (required):

Select a file, or drag & drop file here.

✔

✘

Screenshot instructions:

Click URL instructions:
Right-click on the ad, choose "Copy Link", then paste here →
(This may not be possible with some types of ads)

More information about our ad policies

Ad destination/click URL: