Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Search Results

Search Results for "learning language" - Page 12

x

Sort By:

Relevance

Clear All Filters

OS

Linux 362
Windows 360
Mac 338
More...
BSD 173
ChromeOS 168
Mobile Operating Systems 5
Desktop Operating Systems 2
Server Operating Systems 1

Category

Artificial Intelligence 317
Software Development 46
Education 27
Scientific/Engineering 17
Games 9
Business 7
System 6
Communications 4
Multimedia 4
Formats and Protocols 3
Database 2
Text Editors 2
Blockchain 1
Desktop Environment 1
Internet 1
Printing 1
Security 1

License

OSI-Approved Open Source 336
Creative Commons Attribution License 12
GNU Free Documentation License 2
Other License 1

Translations

English 13
French 3
Arabic 1
Brazilian Portuguese 1
More...
Chinese (Simplified) 1
Chinese (Traditional) 1
Dutch 1
Spanish 1
Tamil 1

Programming Language

Python 384
C++ 9
JavaScript 8
C 5
Unix Shell 5
More...
Java 4
Perl 3
C# 2
Julia 2
Common Lisp 1
Emacs-Lisp 1
Go 1
Kotlin 1
PHP 1
PL/SQL 1
R 1
Ruby 1
Rust 1
Scala 1
Tcl 1
VBScript 1

Status

Beta 18
Production/Stable 11
Pre-Alpha 8
Alpha 5
More...
Planning 3
Mature 1

Showing 384 open source projects for "learning language"

View related business solutions

Python Clear Filters & Widen Search

Auth0 B2B Essentials: SSO, MFA, and RBAC Built In
Unlimited organizations, 3 enterprise SSO connections, role-based access control, and pro MFA included. Dev and prod tenants out of the box.

Auth0's B2B Essentials plan gives you everything you need to ship secure multi-tenant apps. Unlimited orgs, enterprise SSO, RBAC, audit log streaming, and higher auth and API limits included. Add on M2M tokens, enterprise MFA, or additional SSO connections as you scale.

Sign Up Free
Forever Free Full-Stack Observability | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
1

LM Human Preferences

Code for the paper Fine-Tuning Language Models from Human Preferences

lm-human-preferences is the official OpenAI codebase that implements the method from the paper Fine-Tuning Language Models from Human Preferences. Its purpose is to show how to align language models with human judgments by training a reward model from human comparisons and then fine-tuning a policy model using that reward signal. The repository includes scripts to train the reward model (learning to rank or score pairs of outputs), and to fine-tune a policy (a language model) with reinforcement learning (or related techniques) guided by that reward model. ...

Downloads: 0 This Week

Last Update: 2025-10-03
See Project
2

MLPACK C++ machine learning library

MLPACK is a C++ machine learning library with emphasis on scalability, speed, and ease-of-use. Its aim is to make machine learning possible for novice users by means of a simple, consistent API, while simultaneously exploiting C++ language features to provide maximum performance and flexibility for expert users. * More info + downloads: https://mlpack.org * Git repo: https://github.com/mlpack/mlpack

Downloads: 0 This Week

Last Update: 2023-06-28
See Project
3

Transformers-Interpret

Model explainability that works seamlessly with Hugging Face

Transformers-Interpret is an interpretability tool for Transformer-based NLP models, providing insights into attention mechanisms and feature importance.

Downloads: 0 This Week

Last Update: 2025-01-24
See Project
4

VALL-E

PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)

We introduce a language modeling approach for text to speech synthesis (TTS). Specifically, we train a neural codec language model (called VALL-E) using discrete codes derived from an off-the-shelf neural audio codec model, and regard TTS as a conditional language modeling task rather than continuous signal regression as in previous work. During the pre-training stage, we scale up the TTS training data to 60K hours of English speech which is hundreds of times larger than existing systems. ...

Downloads: 0 This Week

Last Update: 2023-04-14
See Project
Try Google Cloud Risk-Free With $300 in Credit
No hidden charges. No surprise bills. Cancel anytime.

Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.

Start Free
5

GPT-NeoX

Implementation of model parallel autoregressive transformers on GPUs

This repository records EleutherAI's library for training large-scale language models on GPUs. Our current framework is based on NVIDIA's Megatron Language Model and has been augmented with techniques from DeepSpeed as well as some novel optimizations. We aim to make this repo a centralized and accessible place to gather techniques for training large-scale autoregressive language models, and accelerate research into large-scale training. For those looking for a TPU-centric codebase, we...

Downloads: 0 This Week

Last Update: 2023-03-23
See Project
6

Simple LLM Finetuner

Simple UI for LLM Model Finetuning

Simple LLM Finetuner is a beginner-friendly interface designed to make the process of fine-tuning large language models more accessible by providing a simplified UI and workflow built around parameter-efficient techniques such as LoRA. It allows users to customize pre-trained models using relatively small datasets and modest hardware, making it feasible to experiment with LLM training even on consumer-grade GPUs or cloud environments like Google Colab.

Downloads: 0 This Week

Last Update: 2026-03-23
See Project
7

Alpa

Training and serving large-scale neural networks

Alpa is a system for training and serving large-scale neural networks. Scaling neural networks to hundreds of billions of parameters has enabled dramatic breakthroughs such as GPT-3, but training and serving these large-scale neural networks require complicated distributed system techniques. Alpa aims to automate large-scale distributed training and serving with just a few lines of code.

Downloads: 0 This Week

Last Update: 2023-03-23
See Project
8

d2l-zh

Chinese-language edition of Dive into Deep Learning

d2l‑zh is the Chinese-language edition of Dive into Deep Learning, an interactive, open‑source deep learning textbook that combines code, math, and explanatory text. It features runnable Jupyter notebooks compatible with multiple frameworks (e.g., PyTorch, MXNet, TensorFlow), comprehensive theoretical analysis, and exercises. Widely adopted in over 70 countries and used by more than 500 universities for teaching deep learning.

Downloads: 0 This Week

Last Update: 2025-07-29
See Project
9

CPT

CPT: A Pre-Trained Unbalanced Transformer

A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation. We replace the old BERT vocabulary with a larger one of size 51271 built from the training data, in which we 1) add missing 6800+ Chinese characters (most of them are traditional Chinese characters); 2) remove redundant tokens (e.g. Chinese character tokens with ## prefix); 3) add some English tokens to reduce OOV.

Downloads: 0 This Week

Last Update: 2023-03-23
See Project
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
10

AllenNLP

An open-source NLP research library, built on PyTorch

AllenNLP makes it easy to design and evaluate new deep learning models for nearly any NLP problem, along with the infrastructure to easily run them in the cloud or on your laptop. AllenNLP includes reference implementations of high quality models for both core NLP problems (e.g. semantic role labeling) and NLP applications (e.g. textual entailment). AllenNLP supports loading "plugins" dynamically. A plugin is just a Python package that provides custom registered classes or additional...

Downloads: 0 This Week

Last Update: 2022-10-18
See Project
11

Emb-GAM

An interpretable and efficient predictor using pre-trained models

...In this work, we aim to bridge this gap by using pre-trained neural language models to extract embeddings for each input before learning a linear model in the embedding space. The final model (which we call Emb-GAM) is a transparent, linear function of its input features and feature interactions. Leveraging the language model allows Emb-GAM to learn far fewer linear coefficients, model larger interactions, and generalize well to novel inputs.

Downloads: 0 This Week

Last Update: 2023-03-22
See Project
12

DialoGPT

Large-scale pretraining for dialogue

DialoGPT is an open-source conversational language model developed by Microsoft Research for generating natural dialogue responses using large-scale transformer architectures. The system is built on the GPT-2 architecture and is designed specifically for multi-turn conversation tasks, enabling machines to produce coherent responses during interactive dialogue. The model was trained on a massive dataset of approximately 147 million conversational exchanges extracted from Reddit discussion...

Downloads: 1 This Week

Last Update: 2026-03-12
See Project
13

Pattern

Web mining module for Python, with tools for scraping

Pattern is an open-source Python library that provides tools for web mining, natural language processing, machine learning, and network analysis. The project integrates multiple capabilities into a single framework that allows developers to collect, process, and analyze textual data from the web. It includes modules for web scraping and crawling that can retrieve information from sources such as social media platforms, search engines, and online knowledge bases.

Downloads: 1 This Week

Last Update: 2026-03-15
See Project
14

Hacker Scripts

Based on a true story

Hacker Scripts is a cheeky collection of small automation scripts and language ports collected under the tagline “Based on a true story.” The repository gathers playful utilities (originally shell and Ruby scripts) that automate short, real-world tasks — for example, sending a quick “late at work” text when SSH sessions are active, firing off an automated “I’m sick / working from home” email on certain mornings, or even talking to a networked coffee machine to start brewing at precisely the...

Downloads: 137 This Week

Last Update: 5 days ago
See Project
15

OpenPrompt

An Open-Source Framework for Prompt-Learning

Prompt-learning is the latest paradigm to adapt pre-trained language models (PLMs) to downstream NLP tasks, which modifies the input text with a textual template and directly uses PLMs to conduct pre-trained tasks. OpenPrompt is a library built upon PyTorch and provides a standard, flexible and extensible framework to deploy the prompt-learning pipeline.

Downloads: 0 This Week

Last Update: 2022-08-10
See Project
16

ASRT Speech Recognition

A Deep-Learning-Based Chinese Speech Recognition System

ASRT is an end-to-end deep-learning Chinese ASR system built with TensorFlow/Keras, using convolution + CTC and a Max-Entropy HMM language model. It provides a REST/gRPC server backend and client SDKs in multiple languages (Python, Java, Go, Windows). Notably lightweight, it performs well without needing GPU acceleration and runs across platforms, targeting developers and researchers building Chinese voice interfaces.

Downloads: 0 This Week

Last Update: 2025-07-03
See Project
17

Fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python

Fairseq(-py) is a sequence modeling toolkit that allows researchers and developers to train custom models for translation, summarization, language modeling and other text generation tasks. We provide reference implementations of various sequence modeling papers. Recent work by Microsoft and Google has shown that data parallel training can be made significantly more efficient by sharding the model parameters and optimizer state across data parallel workers. These ideas are encapsulated in the...

Downloads: 0 This Week

Last Update: 2022-06-27
See Project
18

PromptSource

Toolkit for creating, sharing and using natural language prompts

PromptSource is a toolkit for creating, sharing and using natural language prompts. Recent work has shown that large language models exhibit the ability to perform reasonable zero-shot generalization to new tasks. For instance, GPT-3 demonstrated that large language models have strong zero- and few-shot abilities. FLAN and T0 then demonstrated that pre-trained language models fine-tuned in a massively multitask fashion yield even stronger zero-shot performance. A common denominator in these...

Downloads: 0 This Week

Last Update: 2024-08-07
See Project
19

Google Cloud Vision API examples

Sample code for Google Cloud Vision

...Although the repository has been marked as deprecated in favor of language-specific repositories for new work, it still serves as a broad reference hub for legacy examples and multi-language implementation patterns.

Downloads: 1 This Week

Last Update: 2026-03-18
See Project
20

Machine Learning PyTorch Scikit-Learn

Code Repository for Machine Learning with PyTorch and Scikit-Learn

...For those who are interested in knowing what this book covers in general, I’d describe it as a comprehensive resource on the fundamental concepts of machine learning and deep learning. The first half of the book introduces readers to machine learning using scikit-learn, the defacto approach for working with tabular datasets. Then, the second half of this book focuses on deep learning, including applications to natural language processing and computer vision.

Downloads: 3 This Week

Last Update: 2022-08-22
See Project
21

Texar-PyTorch

Integrating the Best of TF into PyTorch, for Machine Learning

Texar-PyTorch is a toolkit aiming to support a broad set of machine learning, especially natural language processing and text generation tasks. Texar provides a library of easy-to-use ML modules and functionalities for composing whatever models and algorithms. The tool is designed for both researchers and practitioners for fast prototyping and experimentation. Texar-PyTorch was originally developed and is actively contributed by Petuum and CMU in collaboration with other institutes. ...

Downloads: 0 This Week

Last Update: 2023-03-23
See Project
22

CodeSearchNet

Datasets, tools, and benchmarks for representation learning of code

CodeSearchNet is a large-scale dataset and research benchmark designed to advance the development of systems that retrieve source code using natural language queries. The project was created through collaboration between GitHub and Microsoft Research and aims to support research on semantic code search and program understanding. The dataset contains millions of pairs of source code functions and corresponding documentation comments extracted from open-source repositories. These pairs allow machine learning models to learn relationships between natural language descriptions and programming code. ...

Downloads: 1 This Week

Last Update: 2026-03-12
See Project
23

Hugging Face Transformer

CPU/GPU inference server for Hugging Face transformer models

Optimize and deploy in production Hugging Face Transformer models in a single command line. At Lefebvre Dalloz we run in-production semantic search engines in the legal domain, in the non-marketing language it's a re-ranker, and we based ours on Transformer. In that setup, latency is key to providing a good user experience, and relevancy inference is done online for hundreds of snippets per user query. Most tutorials on Transformer deployment in production are built over Pytorch and FastAPI....

Downloads: 0 This Week

Last Update: 2022-08-22
See Project
24

Lines

Lines a game written in Python two players through internet

Lines is an old game that I had programed in python (pygame). I usually prefer to use visual c# as a programing language. However, I wrote this game in python in a time period, I was learning python. Here, I want to thank all people who have training videos in youtube, they helped me a lot to make this program. Some of the code of the program is from these videos. The game can be played from one or two persons through internet. The game is a good example for learning pygame.

Downloads: 0 This Week

Last Update: 2022-11-16
See Project
25

Awesome Decision Tree Papers

A collection of research papers on decision, classification, etc.

A collection of research papers on decision, classification and regression trees with implementations.

Downloads: 1 This Week

Last Update: 2024-08-09
See Project

Previous
8
9
10
11
You're on page 12
13
14
15
16
Next

Related Searches

roblox scripts

k nearest neighbor

text to speech

employee training records

stakes mine predictor

script

windows optimizer

machine learning

cuda c machine learning

tts

Related Categories

Artificial Intelligence

Software Development

Education

Scientific/Engineering

Games

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise