Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Search Results

Search Results for "learning language" - Page 5

x

Sort By:

Relevance

Clear All Filters

OS

Linux 368
Windows 365
Mac 343
More...
BSD 176
ChromeOS 171
Mobile Operating Systems 5
Desktop Operating Systems 2
Server Operating Systems 1

Category

Artificial Intelligence 321
Software Development 47
Education 29
Scientific/Engineering 17
Games 9
Business 7
System 6
Communications 4
Multimedia 4
Formats and Protocols 3
Database 2
Text Editors 2
Blockchain 1
Desktop Environment 1
Internet 1
Printing 1
Security 1

License

OSI-Approved Open Source 340
Creative Commons Attribution License 13
GNU Free Documentation License 2
Other License 1

Translations

English 13
French 3
Arabic 1
Brazilian Portuguese 1
More...
Chinese (Simplified) 1
Chinese (Traditional) 1
Dutch 1
Spanish 1
Tamil 1

Programming Language

Python 391
C++ 9
JavaScript 8
C 5
Unix Shell 5
More...
Java 4
Perl 3
C# 2
Julia 2
Common Lisp 1
Emacs-Lisp 1
Go 1
Kotlin 1
PHP 1
PL/SQL 1
R 1
Ruby 1
Rust 1
Scala 1
Tcl 1
VBScript 1

Status

Beta 18
Production/Stable 12
Pre-Alpha 8
Alpha 5
More...
Planning 3
Mature 1

Showing 391 open source projects for "learning language"

View related business solutions

Python Clear Filters & Widen Search

MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
$300 Free Credits for Your Google Cloud Projects
Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.

Start Free Trial
1

Liger Kernel

Efficient Triton Kernels for LLM Training

Liger Kernel is a unified kernel developed by LinkedIn to streamline data science and machine learning workflows across different languages and tools. It provides a consistent interface for running code in various languages (such as Python, R, SQL) within a single Jupyter-like environment, enhancing productivity and collaboration for data scientists working in mixed-language projects.

Downloads: 4 This Week

Last Update: 2026-04-30
See Project
2

MaxText

A simple, performant and scalable Jax LLM

...MaxText includes ready-to-use configurations and reproducible training examples that help developers understand how to deploy large-scale AI workloads with modern machine learning infrastructure.

Downloads: 0 This Week

Last Update: 2026-06-12
See Project
3

The Alignment Handbook

Robust recipes to align language models with human and AI preferences

The Alignment Handbook is an open-source resource created to provide practical guidance for aligning large language models with human preferences and safety requirements. The project focuses on the post-training stage of model development, where models are refined after pre-training to behave more helpfully, safely, and reliably in real-world applications. It provides detailed training recipes that explain how to perform tasks such as supervised fine-tuning, preference modeling, and reinforcement learning from human feedback. ...

Downloads: 0 This Week

Last Update: 2026-03-08
See Project
4

Web Dev for Beginners

About 24 Lessons, 12 Weeks, Get Started as a Web Developer

Web-Dev-For-Beginners is Microsoft’s open source, project-based curriculum for learning web development from scratch. Designed as a 12-week, 24-lesson course, it covers HTML, CSS, and JavaScript fundamentals through hands-on projects like terrariums, browser extensions, and space games. Each lesson includes a mix of pre-lecture quizzes, written content, assignments, challenges, and post-lecture quizzes to reinforce learning. The course also offers global accessibility with translations in...

Downloads: 2 This Week

Last Update: 8 hours ago
See Project
Build Agents and Models on One Platform
Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.

Try It Free
5

TorchDistill

A coding-free framework built on PyTorch

torchdistill (formerly kdkit) offers various state-of-the-art knowledge distillation methods and enables you to design (new) experiments simply by editing a declarative yaml config file instead of Python code. Even when you need to extract intermediate representations in teacher/student models, you will NOT need to reimplement the models, which often change the interface of the forward, but instead specify the module path(s) in the yaml file. In addition to knowledge distillation, this...

Downloads: 0 This Week

Last Update: 2025-12-24
See Project
6

MiMo Audio

Audio Language Models are Few-Shot Learners

MiMo Audio is an open-source audio language model project focused on few-shot learning across speech and audio tasks. It explores how large-scale next-token prediction can help audio models generalize from a few examples or simple instructions. The project includes MiMo-Audio-7B-Base and MiMo-Audio-7B-Instruct, along with a dedicated MiMo-Audio tokenizer. It supports audio understanding, speech intelligence, spoken dialogue, instruction-following audio generation, and text-to-speech-style tasks. ...

Downloads: 0 This Week

Last Update: 2026-06-29
See Project
7

FastDeploy

High-performance Inference and Deployment Toolkit for LLMs and VLMs

FastDeploy is an open-source inference and deployment toolkit designed to simplify the process of running and serving deep learning models across a wide range of hardware platforms. Developed within the PaddlePaddle ecosystem, the toolkit focuses on providing high-performance deployment capabilities for modern AI models including large language models and vision-language systems. The platform enables developers to deploy trained models quickly using optimized inference pipelines that support GPUs, specialized AI accelerators, and other hardware architectures. ...

Downloads: 5 This Week

Last Update: 2026-04-08
See Project
8

fugue

A unified interface for distributed computing

Fugue is a unified interface for distributed computing that lets users execute Python, Pandas, and SQL code on Spark, Dask, and Ray with minimal rewrites.

Downloads: 5 This Week

Last Update: 2026-02-20
See Project
9

Artificial Intelligence for Beginners

12 Weeks, 24 Lessons, AI for All

...The repository provides a 12-week program composed of 24 lessons that combine theory, code examples, quizzes, and laboratory exercises. It covers a broad range of topics including neural networks, computer vision, natural language processing, and AI ethics. The curriculum is intentionally beginner-friendly while still exposing learners to widely used frameworks such as TensorFlow and PyTorch. It also supports many languages, making the material accessible to a global audience. Overall, the project functions as a complete self-paced learning pathway for students, educators, and developers who want a practical introduction to modern AI concepts.

Downloads: 3 This Week

Last Update: 2026-07-06
See Project
Custom VMs From 1 to 96 vCPUs With 99.95% Uptime
General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.

Try Free
10

Elyra

Elyra extends JupyterLab with an AI centric approach

Elyra is a set of AI-centric extensions to JupyterLab Notebooks. The Elyra Getting Started Guide includes more details on these features. A version-specific summary of new features is located on the releases page.

Downloads: 7 This Week

Last Update: 2025-08-16
See Project
11

DB-GPT

Revolutionizing Database Interactions with Private LLM Technology

DB-GPT is an experimental open-source project that uses localized GPT large models to interact with your data and environment. With this solution, you can be assured that there is no risk of data leakage, and your data is 100% private and secure.

Downloads: 9 This Week

Last Update: 2026-06-18
See Project
12

Hermes Agent

The agent that grows with you

...It supports scheduled automation in natural language, allowing users to set up recurring tasks such as daily briefings or system audits that it runs unattended.

Downloads: 40 This Week

Last Update: 6 days ago
See Project
13

Sparrow

Structured data extraction and instruction calling with ML, LLM

Sparrow is an open-source platform designed to extract structured information from documents, images, and other unstructured data sources using machine learning and large language models. The system focuses on transforming complex documents such as invoices, receipts, forms, and scanned pages into structured formats like JSON that can be processed by downstream applications. It combines several components, including OCR pipelines, vision-language models, and LLM-based reasoning modules to identify and extract meaningful data fields from heterogeneous document layouts. ...

Downloads: 7 This Week

Last Update: 2026-06-05
See Project
14

NNCF

Neural Network Compression Framework for enhanced OpenVINO

NNCF (Neural Network Compression Framework) is an optimization toolkit for deep learning models, designed to apply quantization, pruning, and other techniques to improve inference efficiency.

Downloads: 7 This Week

Last Update: 2026-06-01
See Project
15

LlamaIndex

Central interface to connect your LLM's with external data

LlamaIndex (GPT Index) is a project that provides a central interface to connect your LLM's with external data. LlamaIndex is a simple, flexible interface between your external data and LLMs. It provides the following tools in an easy-to-use fashion. Provides indices over your unstructured and structured data for use with LLM's. These indices help to abstract away common boilerplate and pain points for in-context learning. Dealing with prompt limitations (e.g. 4096 tokens for Davinci) when...

Downloads: 9 This Week

Last Update: 2026-06-24
See Project
16

slime LLM

slime is an LLM post-training framework for RL Scaling

slime is an open-source large language model (LLM) post-training framework developed to support reinforcement learning (RL)-based scaling and high-performance training workflows for advanced LLMs, blending training and rollout modules into an extensible system. It offers a flexible architecture that connects high-throughput training (e.g., via Megatron-LM) with a customizable data generation pipeline, enabling researchers and engineers to iterate on new RL training paradigms effectively. ...

Downloads: 2 This Week

Last Update: 2026-05-30
See Project
17

nano-graphrag

A simple, easy-to-hack GraphRAG implementation

nano-graphrag is a lightweight implementation of the GraphRAG approach designed to simplify experimentation with graph-based retrieval-augmented generation systems. GraphRAG expands traditional RAG pipelines by constructing knowledge graphs from documents and using relationships between entities to improve the quality and reasoning of AI responses. The nano-GraphRAG project focuses on reducing complexity by providing a compact and readable codebase that preserves the core functionality of...

Downloads: 7 This Week

Last Update: 2026-03-05
See Project
18

OpenBB

Investment Research for Everyone, Everywhere

Customize and speed up your analysis, bring your own data, and create instant reports to gain a competitive edge. Whether it’s a CSV file, a private endpoint, an RSS feed, or even embed an SEC filing directly. Chat with financial data using large language models. Don’t waste time reading, create summaries in seconds and ask how that impacts investments. Create your dashboard with your favorite widgets. Create charts directly from raw data in seconds. Create charts directly from raw data in...

Downloads: 2 This Week

Last Update: 2026-04-23
See Project
19

Lingua-Py

The most accurate natural language detection library for Python

Language detection is often done as part of large machine learning frameworks or natural language processing applications. In cases where you don't need the full-fledged functionality of those systems or don't want to learn the ropes of those, a small flexible library comes in handy.

Downloads: 0 This Week

Last Update: 2026-03-09
See Project
20

tiny-llm

A course of learning LLM inference serving on Apple Silicon

tiny-llm is an educational open-source project designed to teach system engineers how large language model inference and serving systems work by building them from scratch. The project is structured as a guided course that walks developers through the process of implementing the core components required to run a modern language model, including attention mechanisms, token generation, and optimization techniques. Rather than relying on high-level machine learning frameworks, the codebase uses mostly low-level array and matrix manipulation APIs so that developers can understand exactly how model inference works internally. ...

Downloads: 0 This Week

Last Update: 2026-06-13
See Project
21

gensim

Topic Modelling for Humans

Gensim is a Python library for topic modeling, document indexing, and similarity retrieval with large corpora. The target audience is the natural language processing (NLP) and information retrieval (IR) community.

Downloads: 5 This Week

Last Update: 2025-10-16
See Project
22

docext

An on-premises, OCR-free unstructured data extraction

docext is a document intelligence toolkit that uses vision-language models to extract structured information from documents such as PDFs, forms, and scanned images. The system is designed to operate entirely on-premises, allowing organizations to process sensitive documents without relying on external cloud services. Unlike traditional document processing pipelines that rely heavily on optical character recognition, docext leverages multimodal AI models capable of understanding both visual...

Downloads: 4 This Week

Last Update: 2026-03-12
See Project
23

Agent Behavior Monitoring

The open source post-building layer for agents

Agent Behavior Monitoring is an open-source framework designed to monitor, evaluate, and improve the behavior of AI agents operating in real or simulated environments. The system focuses on agent behavior monitoring by collecting interaction data and analyzing how agents perform across different scenarios and tasks. Developers can use the framework to observe agent actions in both online production environments and offline evaluation settings, making it useful for debugging and performance...

Downloads: 7 This Week

Last Update: 2026-06-25
See Project
24

spacy-transformers

Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy

spaCy supports a number of transfer and multi-task learning workflows that can often help improve your pipeline’s efficiency or accuracy. Transfer learning refers to techniques such as word vector tables and language model pretraining. These techniques can be used to import knowledge from raw text into your pipeline, so that your models are able to generalize better from your annotated examples.

Downloads: 2 This Week

Last Update: 2026-03-17
See Project
25

ModelScope

Bring the notion of Model-as-a-Service to life

ModelScope is built upon the notion of “Model-as-a-Service” (MaaS). It seeks to bring together most advanced machine learning models from the AI community, and streamlines the process of leveraging AI models in real-world applications. The core ModelScope library open-sourced in this repository provides the interfaces and implementations that allow developers to perform model inference, training and evaluation. In particular, with rich layers of API abstraction, the ModelScope library offers...

Downloads: 11 This Week

Last Update: 2026-07-06
See Project

Previous
1
2
3
4
You're on page 5
6
7
8
9
Next

Related Searches

artificial intelligence projects

ai agent

offline artificial intelligence\

ai

create db data with ai

intelligence

hermes

agent ai

ai pro free

windows 12 lite

Related Categories

Artificial Intelligence

Software Development

Education

Scientific/Engineering

Games

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise