Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Artificial Intelligence Software
Search Results

Search Results for "distributed shared memory" - Page 2

x

Sort By:

Relevance

Clear All Filters

OS

Mac 65
Linux 63
Windows 62
More...
BSD 31
ChromeOS 30
Desktop Operating Systems 1
Mobile Operating Systems 1
Server Operating Systems 1

Category

Artificial Intelligence 65
Software Development 8
System 3
Communications 2
Business 1
Formats and Protocols 1
Internet 1
Multimedia 1
Scientific/Engineering 1
Security 1

License

OSI-Approved Open Source 61

Translations

English 1
Finnish 1
French 1
German 1
More...
Greek 1
Italian 1
Lithuanian 1

Programming Language

Python 32
C++ 8
TypeScript 8
JavaScript 5
More...
Java 4
Rust 3
Unix Shell 3
Go 2
C# 1
F# 1
PowerShell 1
Yacc 1

Status

Beta 3
Production/Stable 1

Showing 65 open source projects for "distributed shared memory"

View related business solutions

Artificial Intelligence Mac Clear Filters & Widen Search

Forever Free Full-Stack Observability | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
Go From AI Idea to AI App Fast
One platform to build, fine-tune, and deploy ML models. No MLOps team required.

Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.

Try Free
1

Kalavai

Turn everyday devices into your own AI cluster

Kalavai is a self-hosted platform that turns everyday devices into your very own AI cluster. Do you have an old desktop or a gaming laptop gathering dust? Aggregate resources from multiple machines and say goodbye to CUDA out-of-memory errors. Deploy your favorite open-source LLM, fine-tune it with your own data, or simply run your distributed work, zero-DevOps. Simple. Private. Yours.

Downloads: 0 This Week

Last Update: 6 days ago
See Project
2

Xtuner

A Next-Generation Training Engine Built for Ultra-Large MoE Models

...Its architecture incorporates memory-efficient optimizations that allow researchers to train large models even when computational resources are limited. XTuner is also designed to integrate with modern AI ecosystems, supporting multimodal training, reinforcement learning optimization, and instruction tuning pipelines.

Downloads: 0 This Week

Last Update: 2026-03-04
See Project
3

Liveblocks

Liveblocks gives you the building blocks and infrastructure

Liveblocks is an open-source collaboration infrastructure and toolkit that enables developers to integrate real-time collaborative features into web and mobile applications with minimal effort. It provides building blocks like multiplayer cursors, comments, notifications, and AI-agent hooks that can be composed to support shared experiences such as collaborative editing, synchronized state, or embedded AI collaboration within apps. Rather than building real-time synchronization from scratch, developers can leverage Liveblocks’ SDKs and APIs to focus on their product’s unique logic while relying on robust back-end support for distributed state and event propagation. ...

Downloads: 0 This Week

Last Update: 1 day ago
See Project
4

Ludwig AI

Low-code framework for building custom LLMs, neural networks

...Support for multi-task and multi-modality learning. Comprehensive config validation detects invalid parameter combinations and prevents runtime failures. Automatic batch size selection, distributed training (DDP, DeepSpeed), parameter efficient fine-tuning (PEFT), 4-bit quantization (QLoRA), and larger-than-memory datasets. Retain full control of your models down to the activation functions. Support for hyperparameter optimization, explainability, and rich metric visualizations. Experiment with different model architectures, tasks, features, and modalities with just a few parameter changes in the config. ...

Downloads: 4 This Week

Last Update: 21 hours ago
See Project
Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
5

Extractous

Fast and efficient unstructured data extraction

...Its purpose is to extract text and metadata efficiently from formats such as PDF, Word, HTML, email archives, images, and more, without depending on external APIs or separate parsing servers. The project emphasizes performance and low memory usage, and its maintainers describe it as a local-first alternative to heavier extraction stacks. For broader format support, the system combines its Rust core with ahead-of-time compiled Apache Tika shared libraries, which allows it to extend parsing coverage while still avoiding traditional server-based overhead. It also supports OCR for images and scanned documents through Tesseract, making it useful for document ingestion pipelines that include image-based or scanned inputs.

Downloads: 0 This Week

Last Update: 2026-03-06
See Project
6

Refly

The first open-source agent skills builder

...With a focus on making automation accessible, it provides a visual canvas and low-code components that feel similar to drag-and-drop builders but backed by powerful AI orchestration, memory handling, and integrations with external services. Refly’s approach bridges the gap between workflow ideas and stable, deterministic infrastructure: skills become governed capabilities that can be versioned, shared, and monetized, not just temporary scripts.

Downloads: 0 This Week

Last Update: 2026-02-10
See Project
7

OpenTinker

OpenTinker is an RL-as-a-Service infrastructure for foundation models

...Traditional RL setups can be monolithic and difficult to configure, but OpenTinker separates concerns across agent definition, environment interaction, and execution, which lets developers focus on defining the logic of agents and environments separately from how training and inference are run. It introduces a centralized scheduler to manage distributed training jobs and shared compute resources, enabling workloads like reinforcement learning, supervised fine-tuning, and inference to run across multiple settings. The architecture supports a range of single-turn and multi-turn agentic tasks with a design that abstracts away infrastructure complexity while offering flexible Python APIs to define environments and workflows.

Downloads: 0 This Week

Last Update: 2026-03-01
See Project
8

LingBot-World

Advancing Open-source World Models

LingBot-World is an open-source, high-fidelity world simulator designed to advance the state of world models through video generation. Built on top of Wan2.2, it enables realistic, dynamic environment simulation across diverse styles, including real-world, scientific, and stylized domains. LingBot-World supports long-term temporal consistency, maintaining coherent scenes and interactions over minute-level horizons. With real-time interactivity and sub-second latency at 16 FPS, it is...

Downloads: 6 This Week

Last Update: 1 day ago
See Project
9

super-agent-party

All-in-one AI companion! Desktop girlfriend + virtual streamer

Super Agent Party is an open-source experimental framework designed to demonstrate collaborative multi-agent AI systems interacting within a shared environment. The project explores how multiple specialized AI agents can coordinate to solve complex tasks by communicating with each other and sharing information. Instead of relying on a single monolithic model, the framework organizes agents with different roles or capabilities that cooperate to achieve goals. Each agent may handle different...

Downloads: 0 This Week

Last Update: 2026-05-01
See Project
Try Google Cloud Risk-Free With $300 in Credit
No hidden charges. No surprise bills. Cancel anytime.

Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.

Start Free
10

Omnilingual ASR

Omnilingual ASR Open-Source Multilingual SpeechRecognition

Omnilingual-ASR is a research codebase exploring automatic speech recognition that generalizes across a very large number of languages using shared modeling and training recipes. It focuses on leveraging self-supervised audio pretraining and scalable fine-tuning so low-resource languages can benefit from high-resource data. The project provides data preparation pipelines, training scripts, decoding utilities, and evaluation tools so researchers can reproduce results and extend to new...

Downloads: 0 This Week

Last Update: 2025-12-12
See Project
11

NeuralForecast

Scalable and user friendly neural forecasting algorithms.

NeuralForecast offers a large collection of neural forecasting models focusing on their performance, usability, and robustness. The models range from classic networks like RNNs to the latest transformers: MLP, LSTM, GRU, RNN, TCN, TimesNet, BiTCN, DeepAR, NBEATS, NBEATSx, NHITS, TiDE, DeepNPTS, TSMixer, TSMixerx, MLPMultivariate, DLinear, NLinear, TFT, Informer, AutoFormer, FedFormer, PatchTST, iTransformer, StemGNN, and TimeLLM. There is a shared belief in Neural forecasting methods'...

Downloads: 0 This Week

Last Update: 2026-05-06
See Project
12

.NET for Apache Spark

A free, open-source, and cross-platform big data analytics framework

.NET for Apache Spark provides high-performance APIs for using Apache Spark from C# and F#. With these .NET APIs, you can access the most popular Dataframe and SparkSQL aspects of Apache Spark, for working with structured data, and Spark Structured Streaming, for working with streaming data. .NET for Apache Spark is compliant with .NET Standard - a formal specification of .NET APIs that are common across .NET implementations. This means you can use .NET for Apache Spark anywhere you write...

Downloads: 0 This Week

Last Update: 2026-02-13
See Project
13

TensorRT LLM

TensorRT LLM provides users with an easy-to-use Python API

...It provides a Python-based API built on top of PyTorch that allows developers to define, customize, and deploy LLMs efficiently across a variety of hardware configurations, from single GPUs to large multi-node clusters. The library focuses on maximizing throughput and minimizing latency through advanced techniques such as quantization, custom attention kernels, and optimized memory management strategies. It includes support for cutting-edge inference methods like speculative decoding and inflight batching, enabling real-time and large-scale AI applications. TensorRT-LLM integrates seamlessly with NVIDIA’s broader inference ecosystem, including Triton Inference Server and distributed deployment frameworks, making it suitable for production environments.

Downloads: 0 This Week

Last Update: 2026-04-16
See Project
14

Swarms

Enterprise multi-agent orchestration framework for scalable AI apps

...Swarms also includes mechanisms for agent lifecycle management, memory handling, and dynamic composition, making it adaptable to evolving workloads. Additionally, it focuses on developer productivity through APIs, CLI tools, and templates that simplify building and deploying agent-based applications.

Downloads: 0 This Week

Last Update: 2026-03-17
See Project
15

Tiny CUDA Neural Networks

Lightning fast C++/CUDA neural network framework

...We provide a sample application where an image function (x,y) -> (R,G,B) is learned. The fully fused MLP component of this framework requires a very large amount of shared memory in its default configuration. It will likely only work on an RTX 3090, an RTX 2080 Ti, or high-end enterprise GPUs. Lower-end cards must reduce the n_neurons parameter or use the CutlassMLP (better compatibility but slower) instead. tiny-cuda-nn comes with a PyTorch extension that allows using the fast MLPs and input encodings from within a Python context. ...

Downloads: 0 This Week

Last Update: 2025-07-08
See Project
16

Chitu

High-performance inference framework for large language models

...It supports heterogeneous computing environments, including CPUs, GPUs, and various specialized AI accelerators, allowing models to run across a wide range of infrastructure configurations. Chitu is designed to scale from small single-machine deployments to large distributed clusters that handle high volumes of concurrent inference requests. The system also includes performance optimizations for large models, including support for quantized formats and efficient computation operators that reduce memory usage and latency. Its architecture aims to support enterprise adoption by ensuring stable long-term operation under production workloads.

Downloads: 0 This Week

Last Update: 3 days ago
See Project
17

DeepSpeed

Deep learning optimization library: makes distributed training easy

DeepSpeed is an easy-to-use deep learning optimization software suite that enables unprecedented scale and speed for Deep Learning Training and Inference. With DeepSpeed you can: 1. Train/Inference dense or sparse models with billions or trillions of parameters 2. Achieve excellent system throughput and efficiently scale to thousands of GPUs 3. Train/Inference on resource constrained GPU systems 4. Achieve unprecedented low latency and high throughput for inference 5. Achieve extreme...

Downloads: 1 This Week

Last Update: 2026-05-06
See Project
18

OneFlow

OneFlow is a deep learning framework designed to be user-friendly

...An extension for OneFlow to target third-party compiler, such as XLA, TensorRT and OpenVINO etc.CUDA runtime is statically linked into OneFlow. OneFlow will work on a minimum supported driver, and any driver beyond. For more information. Distributed performance (efficiency) is the core technical difficulty of the deep learning framework. OneFlow focuses on performance improvement and heterogeneous distributed expansion. It adheres to the core concept and architecture of static compilation and streaming parallelism and solves the memory wall challenge at the cluster level. world-leading level. ...

Downloads: 0 This Week

Last Update: 2024-03-11
See Project
19

FastChat

Open platform for training, serving, and evaluating language models

FastChat is an open platform for training, serving, and evaluating large language model-based chatbots. If you do not have enough memory, you can enable 8-bit compression by adding --load-8bit to the commands above. This can reduce memory usage by around half with slightly degraded model quality. It is compatible with the CPU, GPU, and Metal backend. Vicuna-13B with 8-bit compression can run on a single NVIDIA 3090/4080/T4/V100(16GB) GPU. In addition to that, you can add --cpu-offloading to...

Downloads: 1 This Week

Last Update: 2024-02-11
See Project
20

Punica

Serving multiple LoRA finetuned LLM as one

Punica is a system designed to efficiently serve multiple LoRA-fine-tuned large language models within a shared GPU environment. LoRA is a parameter-efficient fine-tuning method that allows developers to adapt large pretrained models to specific tasks by adding lightweight adapter layers rather than retraining the entire model. Punica introduces a serving architecture that allows multiple LoRA adapters to share the same base model during inference, significantly reducing memory consumption and computational overhead. ...

Downloads: 0 This Week

Last Update: 2026-03-09
See Project
21

Metaseq

Repo for external large-scale work

Metaseq is a flexible, high-performance framework for training and serving large-scale sequence models, such as language models, translation systems, and instruction-tuned LLMs. Built on top of PyTorch, it provides distributed training, model sharding, mixed-precision computation, and memory-efficient checkpointing to support models with hundreds of billions of parameters. The framework was used internally at Meta to train models like OPT (Open Pre-trained Transformer) and serves as a reference implementation for scaling transformer architectures efficiently across GPUs and nodes. ...

Downloads: 0 This Week

Last Update: 2025-10-06
See Project
22

ParlAI

A framework for training and evaluating AI models

...The library integrates tightly with PyTorch and supports both generative and retrieval-augmented models, along with utilities for multitask training and model selection. A large set of built-in tasks and dataset loaders (with consistent preprocessing and metrics) makes it easy to compare methods under shared conditions. Tools for distributed training, mixed precision, and model zoos help scale experiments from laptops to multi-GPU clusters.

Downloads: 0 This Week

Last Update: 2025-10-06
See Project
23

Mars Framework

Mars is a tensor-based unified framework for large-scale data

Mars is a distributed computing framework designed to scale scientific computing and data science workloads across large clusters while preserving the familiar programming interfaces of common Python libraries. The project provides a tensor-based execution model that extends the capabilities of tools such as NumPy, pandas, and scikit-learn so that large datasets can be processed in parallel without rewriting code for distributed environments.

Downloads: 0 This Week

Last Update: 2026-03-11
See Project
24

MXNet

Lightweight, Portable, Flexible Distributed/Mobile Deep Learning

Apache MXNet is a scalable, efficient open-source deep learning framework—offering a flexible hybrid programming model (symbolic + imperative) and supporting a wide array of languages—designed for training and deploying neural networks across heterogeneous systems. Apache MXNet is a deep learning framework designed for both efficiency and flexibility. It allows you to mix symbolic and imperative programming to maximize efficiency and productivity. At its core, MXNet contains a dynamic...

Downloads: 1 This Week

Last Update: 2025-08-18
See Project
25

TensorFlowOnSpark

TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters

By combining salient features from the TensorFlow deep learning framework with Apache Spark and Apache Hadoop, TensorFlowOnSpark enables distributed deep learning on a cluster of GPU and CPU servers. It enables both distributed TensorFlow training and inferencing on Spark clusters, with a goal to minimize the amount of code changes required to run existing TensorFlow programs on a shared grid.

Downloads: 0 This Week

Last Update: 2024-08-05
See Project

Previous
1
You're on page 2
3
Next

Related Searches

.net framework v4.0.30319

installer

cuda

ai

artificial neural network

smart home control panel

chatbot code

cluster

train ai

python ai

Related Categories

Artificial Intelligence

Software Development

System

Communications

Business

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise