Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Search Results

Search Results for "learning language" - Page 14

x

Sort By:

Relevance

Clear All Filters

OS

Linux 362
Windows 360
Mac 338
More...
BSD 173
ChromeOS 168
Mobile Operating Systems 5
Desktop Operating Systems 2
Server Operating Systems 1

Category

Artificial Intelligence 317
Software Development 46
Education 27
Scientific/Engineering 17
Games 9
Business 7
System 6
Communications 4
Multimedia 4
Formats and Protocols 3
Database 2
Text Editors 2
Blockchain 1
Desktop Environment 1
Internet 1
Printing 1
Security 1

License

OSI-Approved Open Source 336
Creative Commons Attribution License 12
GNU Free Documentation License 2
Other License 1

Translations

English 13
French 3
Arabic 1
Brazilian Portuguese 1
More...
Chinese (Simplified) 1
Chinese (Traditional) 1
Dutch 1
Spanish 1
Tamil 1

Programming Language

Python 384
C++ 9
JavaScript 8
C 5
Unix Shell 5
More...
Java 4
Perl 3
C# 2
Julia 2
Common Lisp 1
Emacs-Lisp 1
Go 1
Kotlin 1
PHP 1
PL/SQL 1
R 1
Ruby 1
Rust 1
Scala 1
Tcl 1
VBScript 1

Status

Beta 18
Production/Stable 11
Pre-Alpha 8
Alpha 5
More...
Planning 3
Mature 1

Showing 384 open source projects for "learning language"

View related business solutions

Python Clear Filters & Widen Search

Go From AI Idea to AI App Fast
One platform to build, fine-tune, and deploy ML models. No MLOps team required.

Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.

Try Free
$300 in Free Credit Towards Top Cloud Services
Build VMs, containers, AI, databases, storage—all in one place.

Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.

Get Started
1

cocoNLP

A Chinese information extraction tool

...Because it aims at utility over complexity, it’s useful for prototyping data products or building lightweight text analytics where large models would be overkill. The repository also includes examples and test snippets to help you understand expected inputs and typical outputs, which shortens the learning curve for newcomers.

Downloads: 0 This Week

Last Update: 2025-11-05
See Project
2

Deep Learning Drizzle

Drench yourself in Deep Learning, Reinforcement Learning

Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures! Optimization courses which form the foundation for ML, DL, RL. Computer Vision courses which are DL & ML heavy. Speech recognition courses which are DL heavy. Structured Courses on Geometric, Graph Neural Networks. Section on Autonomous Vehicles. Section on Computer Graphics with ML/DL focus.

Downloads: 0 This Week

Last Update: 2022-07-29
See Project
3

Texar

Toolkit for Machine Learning, Natural Language Processing

Texar is a toolkit aiming to support a broad set of machine learning, especially natural language processing and text generation tasks. Texar provides a library of easy-to-use ML modules and functionalities for composing whatever models and algorithms. The tool is designed for both researchers and practitioners for fast prototyping and experimentation. Texar was originally developed and is actively contributed by Petuum and CMU in collaboration with other institutes.

Downloads: 0 This Week

Last Update: 2022-08-08
See Project
4

PyTorch Natural Language Processing

Basic Utilities for PyTorch Natural Language Processing (NLP)

PyTorch-NLP is a library for Natural Language Processing (NLP) in Python. It’s built with the very latest research in mind, and was designed from day one to support rapid prototyping. PyTorch-NLP comes with pre-trained embeddings, samplers, dataset loaders, metrics, neural network modules and text encoders. It’s open-source software, released under the BSD3 license. With your batch in hand, you can use PyTorch to develop and train your model using gradient descent. For example, check out...

Downloads: 0 This Week

Last Update: 2022-08-09
See Project
Stop Storing Third-Party Tokens in Your Database
Auth0 Token Vault handles secure token storage, exchange, and refresh for external providers so you don't have to build it yourself.

Rolling your own OAuth token storage can be a security liability. Token Vault securely stores access and refresh tokens from federated providers and handles exchange and renewal automatically. Connected accounts, refresh exchange, and privileged worker flows included.

Try Auth0 for Free
5

InferSent

InferSent sentence embeddings

InferSent is a supervised sentence embedding method that learns universal representations from Natural Language Inference data and transfers well to many downstream tasks. It uses a BiLSTM encoder with max-pooling to produce fixed-length sentence vectors that capture semantics beyond bag-of-words statistics. Trained on large NLI datasets, the embeddings generalize across tasks like sentiment analysis, entailment, paraphrase detection, and semantic similarity with simple linear classifiers....

Downloads: 0 This Week

Last Update: 2025-10-07
See Project
6

Project Malmo

A platform for Artificial Intelligence experimentation on Minecraft

...The two components can run on Windows, Linux, or Mac OS, and researchers can program their agents in any programming language they’re comfortable with.

Downloads: 0 This Week

Last Update: 2023-03-23
See Project
7

NeuroNER

Named-entity recognition using neural networks

...Identified entities can be used in various downstream applications such as patient note de-identification and information extraction systems. They can also be used as features for machine learning systems for other natural language processing tasks. Leverages the state-of-the-art prediction capabilities of neural networks (a.k.a. "deep learning") Is cross-platform, open source, freely available, and straightforward to use. Enables the users to create or modify annotations for a new or existing corpus. Train the neural network that performs the NER. ...

Downloads: 0 This Week

Last Update: 2022-08-12
See Project
8

lazynlp

Library to scrape and clean web pages to create massive datasets

LazyNLP is a lightweight tool for collecting and curating large-scale text datasets for machine learning and NLP applications with minimal manual effort.

Downloads: 0 This Week

Last Update: 2025-01-22
See Project
9

Pipelines

An experimental programming language for data flow

Pipelines is a language and runtime for crafting massively parallel pipelines. Unlike other languages for defining data flow, the Pipeline language requires the implementation of components to be defined separately in the Python scripting language. This allows the details of implementations to be separated from the structure of the pipeline while providing access to thousands of active libraries for machine learning, data analysis, and processing.

Downloads: 0 This Week

Last Update: 2025-01-21
See Project
Enterprise-grade ITSM, for every business
Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.

Try it Free
10

Arabic Corpus

Text categorization, arabic language processing, language modeling

The Arabic Corpus {compiled by Dr. Mourad Abbas ( http://sites.google.com/site/mouradabbas9/corpora ) The corpus Khaleej-2004 contains 5690 documents. It is divided to 4 topics (categories). The corpus Watan-2004 contains 20291 documents organized in 6 topics (categories). Researchers who use these two corpora would mention the two main references: (1) For Watan-2004 corpus ---------------------- M. Abbas, K. Smaili, D. Berkani, (2011) Evaluation of Topic Identification Methods on...

Downloads: 4 This Week

Last Update: 2019-03-05
See Project
11

Deepvoice3_pytorch

PyTorch implementation of convolutional neural networks

An open source implementation of Deep Voice 3: Scaling Text-to-Speech with Convolutional Sequence Learning.

Downloads: 0 This Week

Last Update: 2024-08-13
See Project
12

CRP - Chemical Reaction Prediction

Predicting Organic Reactions using Neural Networks.

The intend is to solve the forward-reaction prediction problem, where the reactants are known and the interest is in generating the reaction products using Deep learning. This Graphical User Interface takes simplified molecular-input line-entry system (SMILES) as an input and generates the product SMILE & molecule. Beam search is used in Version 2, to generate top 5 predictions. Maximum input length for the model is 15 (excluding spaces).

Downloads: 0 This Week

Last Update: 2018-11-07
See Project
13

cnn-text-classification-tf

Convolutional Neural Network for Text Classification in Tensorflow

...By breaking down the model into understandable components, it serves as a practical reference for students and practitioners learning how deep learning models handle text beyond traditional bag-of-words approaches.

Downloads: 0 This Week

Last Update: 2026-02-13
See Project
14

anaGo

Bidirectional LSTM-CRF and ELMo for Named-Entity Recognition

anaGo is a Python library for sequence labeling(NER, PoS Tagging,...), implemented in Keras. anaGo can solve sequence labeling tasks such as named entity recognition (NER), part-of-speech tagging (POS tagging), semantic role labeling (SRL) and so on. Unlike traditional sequence labeling solver, anaGo doesn't need to define any language-dependent features. Thus, we can easily use anaGo for any language. In anaGo, the simplest type of model is the Sequence model. Sequence model includes...

Downloads: 0 This Week

Last Update: 2022-08-15
See Project
15

DeepLearn

Implementation of research papers on Deep Learning+ NLP+ CV in Python

Welcome to DeepLearn. This repository contains an implementation of the following research papers on NLP, CV, ML, and deep learning. The required dependencies are mentioned in requirement.txt. I will also use dl-text modules for preparing the datasets. If you haven't use it, please do have a quick look at it. CV, transfer learning, representation learning.

Downloads: 0 This Week

Last Update: 2022-08-11
See Project
16

ECDICT

Free English to Chinese Dictionary Database

ECDICT is a comprehensive English–Chinese dictionary dataset packaged for developers who need an offline, queryable lexicon for applications, NLP, or educational tools. It aggregates headwords, phonetics, parts of speech, translations, and example information into formats that are easy to integrate. The project provides multiple distribution forms—commonly SQLite/CSV/StarDict-style files—so you can choose the right storage and query approach for your app. Because it’s offline and local, it...

Downloads: 16 This Week

Last Update: 2025-11-13
See Project
17

EvalAI

Evaluating state of the art in AI

EvalAI is an open-source platform for evaluating and comparing machine learning (ML) and artificial intelligence (AI) algorithms at scale. We allow the creation of an arbitrary number of evaluation phases and dataset splits, compatibility using any programming language, and organizing results in both public and private leaderboards. Certain large-scale challenges need special computing capabilities for evaluation.

Downloads: 0 This Week

Last Update: 2022-09-02
See Project
18

AI learning

AiLearning, data analysis plus machine learning practice

We actively respond to the Research Open Source Initiative (DOCX) . Open source today is not just open source, but datasets, models, tutorials, and experimental records. We are also exploring other categories of open source solutions and protocols. I hope you will understand this initiative, combine this initiative with your own interests, and do what you can. Everyone's tiny contributions, together, are the entire open source ecosystem. We are iBooker, a large open-source community,...

Downloads: 0 This Week

Last Update: 2022-02-18
See Project
19

House3D

A Realistic and Rich 3D Environment

House3D is a large-scale virtual 3D simulation environment designed to support research in embodied AI, reinforcement learning, and vision-language navigation. It provides more than 45,000 richly annotated indoor scenes sourced from the SUNCG dataset, covering diverse architectural layouts such as studios, multi-floor homes, and spaces with detailed furnishings and room types. Each environment includes fully labeled 3D objects, allowing agents to perceive and interact with their surroundings through multiple sensory modalities including RGB images, depth maps, semantic segmentation masks, and top-down maps. ...

Downloads: 0 This Week

Last Update: 2026-04-25
See Project
20

WikiSQL

A large annotated semantic parsing corpus for developing NL interfaces

A large crowd-sourced dataset for developing natural language interfaces for relational databases. WikiSQL is the dataset released along with our work Seq2SQL: Generating Structured Queries from Natural Language using Reinforcement Learning. Regarding tokenization and Stanza, when WikiSQL was written 3-years ago, it relied on Stanza, a CoreNLP python wrapper that has since been deprecated.

Downloads: 2 This Week

Last Update: 2022-07-26
See Project
21

Seq2Seq Chatbot

Chatbot in 200 lines of code using TensorLayer

Seq2Seq Chatbot is an implementation of a sequence-to-sequence chatbot model using TensorLayer, demonstrating how to build conversational agents with minimal code.

Downloads: 0 This Week

Last Update: 2025-01-30
See Project
22

Ezhil-Lang

தமிழில் கணினி மொழி

எழில் - ஒரு தமிழ் நிரலாக்க மொழி; தமிழ் மாணவர்களுக்கு இது முதல்முறை கணி Ezhil is a Tamil script based programming language for children and teens in the K-12 grade schools. Ezhil enables learning imperative programming like BASIC or LOGO in Tamil language.

1 Review

Downloads: 0 This Week

Last Update: 2017-09-19
See Project
23

Scattertext 0.2.1

Beautiful visualizations of how language differs among document types

A tool for finding distinguishing terms in corpora and displaying them in an interactive HTML scatter plot. Points corresponding to terms are selectively labeled so that they don't overlap with other labels or points.

Downloads: 0 This Week

Last Update: 2024-08-09
See Project
24

TEES

Turku Event Extraction System

Turku Event Extraction System (TEES) is a free and open source natural language processing system developed for the extraction of events and relations from biomedical text. It is written mostly in Python, and should work in generic Unix/Linux environments. Currently, the TEES source code repository still remains on GitHub at http://jbjorne.github.com/TEES/ where there is also a wiki with more information.

Downloads: 0 This Week

Last Update: 2017-05-23
See Project
25

RDRPOSTagger

A Rule-based Part-of-Speech and Morphological Tagging Toolkit

RDRPOSTagger is a robust, easy-to-use and language-independent rule-based toolkit for Part-of-Speech (POS) and morphological tagging. RDRPOSTagger obtains fast performance in both learning and tagging process. RDRPOSTagger also achieves a very competitive accuracy in comparison to the state-of-the-art results. RDRPOSTagger now supports pre-trained POS and morphological tagging models for Bulgarian, Czech, Dutch, English, French, German, Hindi, Italian, Portuguese, Spanish, Swedish, Thai and Vietnamese. ...

2 Reviews

Downloads: 0 This Week

Last Update: 2017-05-24
See Project

Previous
10
11
12
13
You're on page 14
15
16
Next

Related Searches

deep learning

natural language processing project

minecraft ai bot

arabic corpus

chemical reaction software

stardict

ai

ezhillang

morphological analysis for amharic language

natural language processing arabic

Related Categories

Artificial Intelligence

Software Development

Education

Scientific/Engineering

Games

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise