distributed computing free download

fugue

A unified interface for distributed computing

Fugue is a unified interface for distributed computing that lets users execute Python, Pandas, and SQL code on Spark, Dask, and Ray with minimal rewrites.

Downloads: 7 This Week

Last Update: 2026-02-20

See Project

EdgeChains

EdgeChains.js is Full-Stack GenAI library

EdgeChains.js is a full-stack generative AI library that provides front-end, back-end, APIs, prompt management, and distributed computing capabilities, with core prompts and chains managed declaratively in Jsonnet. At EdgeChains, we take a unique approach to Generative AI - we think Generative AI is a deployment and configuration management challenge rather than a UI and library design pattern challenge. We build on top of a tech that has solved this problem in a different domain - Kubernetes Config Management - and bring that to Generative AI. ...

Downloads: 7 This Week

Last Update: 2025-01-29

See Project

Parallax

Parallax is a distributed model serving framework

Parallax is a decentralized inference framework designed to run large language models across distributed computing resources. Instead of relying on centralized GPU clusters in data centers, the system allows multiple heterogeneous machines to collaborate in serving AI inference workloads. Parallax divides model layers across different nodes and dynamically coordinates them to form a complete inference pipeline. A two-stage scheduling architecture determines how model layers are allocated to available hardware and how requests are routed across nodes during execution. ...

Downloads: 3 This Week

Last Update: 2026-03-09

See Project

Kubeflow Trainer

Distributed AI Model Training and LLM Fine-Tuning on Kubernetes

...The platform supports a wide range of machine learning frameworks, including PyTorch, JAX, Hugging Face, DeepSpeed, and XGBoost, making it highly flexible for different AI use cases. One of its key innovations is the integration of MPI-based distributed computing within Kubernetes, allowing efficient communication between nodes for high-performance training. It also includes advanced scheduling capabilities through integrations with tools like Kueue and Volcano, enabling topology-aware resource allocation and multi-cluster job orchestration.

Downloads: 4 This Week

Last Update: 2026-03-20

See Project

Ray

A unified framework for scalable computing

Modern workloads like deep learning and hyperparameter tuning are compute-intensive and require distributed or parallel execution. Ray makes it effortless to parallelize single machine code — go from a single CPU to multi-core, multi-GPU or multi-node with minimal code changes. Accelerate your PyTorch and Tensorflow workload with a more resource-efficient and flexible distributed execution framework powered by Ray. Accelerate your hyperparameter search workloads with Ray Tune. Find the best...

Downloads: 4 This Week

Last Update: 2026-04-19

See Project

Data-Juicer

Data processing for and with foundation models

Data-Juicer is an open-source data processing and augmentation framework designed to enhance the quality and diversity of datasets for machine learning tasks. It includes a modular pipeline for scalable data transformation.

Downloads: 0 This Week

Last Update: 2026-05-29

See Project

Xtuner

A Next-Generation Training Engine Built for Ultra-Large MoE Models

Xtuner is a large-scale training engine designed for efficient training and fine-tuning of modern large language models, particularly mixture-of-experts architectures. The framework focuses on enabling scalable training for extremely large models while maintaining efficiency across distributed computing environments. Unlike traditional 3D parallel training strategies, XTuner introduces optimized parallelism techniques that simplify scaling and reduce system complexity when training massive models. The engine supports training models with hundreds of billions of parameters and enables long-context training with sequence lengths reaching tens of thousands of tokens. ...

Downloads: 2 This Week

Last Update: 2026-03-04

See Project

Matrix

Multi-Agent daTa geneRation Infra and eXperimentation framework

Matrix is a distributed, large-scale engine for multi-agent synthetic data generation and experiments: it provides the infrastructure to run thousands of “agentic” workflows concurrently (e.g. multiple LLMs interacting, reasoning, generating content, data-processing pipelines) by leveraging distributed computing (like Ray + cluster management). The idea is to treat data generation as a “data-to-data” transformation: each input item defines a task, and the runtime orchestrates asynchronous, peer-to-peer agent workflows, avoiding global synchronization bottlenecks. ...

Downloads: 0 This Week

Last Update: 2026-03-05

See Project

Angel

A Flexible and Powerful Parameter Server for large-scale ML

Angel is a high-performance distributed machine learning and graph computing platform based on the philosophy of Parameter Server. It is tuned for performance with big data from Tencent and has a wide range of applicability and stability, demonstrating an increasing advantage in handling higher-dimension models. Angel is jointly developed by Tencent and Peking University, taking account of both high availability in industry and innovation in academia.

Downloads: 5 This Week

Last Update: 2025-09-03

See Project

OSMO

The developer-first platform for scaling complex Physical AI workloads

OSMO is a developer-first orchestration platform designed to scale complex physical AI workflows across heterogeneous computing environments, including cloud GPUs, simulation clusters, and edge devices. It was originally built internally at NVIDIA to support robotics and embodied AI systems, where workflows span multiple stages such as data generation, training, simulation, and hardware testing. The platform addresses what NVIDIA refers to as the “three computer problem” by unifying these...

Downloads: 4 This Week

Last Update: 2026-05-22

See Project

mlforecast

Scalable machine learning for time series forecasting

...It supports multi-series forecasting, meaning you can train one model that forecasts many time series at once (common in retail, demand forecasting, etc.), rather than one model per series. The library is built to scale: behind the scenes, it can leverage distributed computing frameworks (Spark, Dask, Ray) when datasets or the number of series grow large.

Downloads: 5 This Week

Last Update: 2026-03-10

See Project

HolmesGPT

CNCF Sandbox Project

HolmesGPT is an open-source AI agent designed to help DevOps and site reliability engineering teams diagnose and resolve production incidents. The system aggregates signals from observability tools such as logs, metrics, alerts, and distributed traces, then analyzes them using large language models to identify potential root causes. Rather than requiring engineers to manually correlate large volumes of monitoring data, HolmesGPT automatically synthesizes evidence and presents explanations in natural language. The project is developed by Robusta and has been accepted as a Cloud Native Computing Foundation Sandbox project, highlighting its relevance to the cloud-native ecosystem. ...

Downloads: 8 This Week

Last Update: 2026-05-28

See Project

EvoTorch

Advanced evolutionary computation library built on top of PyTorch

EvoTorch is an evolutionary optimization framework built on top of PyTorch, developed by NNAISENSE. It is designed for large-scale optimization problems, particularly those that require evolutionary algorithms rather than gradient-based methods.

Downloads: 0 This Week

Last Update: 2025-05-14

See Project

Apache Hamilton

Helps data scientists define testable self-documenting dataflows

Apache Hamilton is an open-source Python framework designed to simplify the creation and management of dataflows used in analytics, machine learning pipelines, and data engineering workflows. The framework enables developers to define data transformations as simple Python functions, where each function represents a node in a dataflow graph and its parameters define dependencies on other nodes. Hamilton automatically analyzes these functions and constructs a directed acyclic graph...

Downloads: 4 This Week

Last Update: 2026-04-04

See Project

Chitu

High-performance inference framework for large language models

...The framework focuses on improving efficiency, flexibility, and scalability for organizations that need to run LLM inference workloads across different hardware platforms. It supports heterogeneous computing environments, including CPUs, GPUs, and various specialized AI accelerators, allowing models to run across a wide range of infrastructure configurations. Chitu is designed to scale from small single-machine deployments to large distributed clusters that handle high volumes of concurrent inference requests. The system also includes performance optimizations for large models, including support for quantized formats and efficient computation operators that reduce memory usage and latency. ...

Downloads: 7 This Week

Last Update: 7 days ago

See Project

AlphaZero.jl

A generic, simple and fast implementation of Deepmind's AlphaZero

...Because AlphaZero is resource-hungry, successful open-source implementations (such as Leela Zero) are written in low-level languages (such as C++) and optimized for highly distributed computing environments. This makes them hardly accessible for students, researchers and hackers. Many simple Python implementations can be found on Github, but none of them is able to beat a reasonable baseline on games such as Othello or Connect Four. As an illustration, the benchmark in the README of the most popular of them only features a random baseline, along with a greedy baseline that does not appear to be significantly stronger.

Downloads: 16 This Week

Last Update: 2025-12-12

See Project

DGL

Python package built to ease deep learning on graph

...DGL provides a powerful graph object that can reside on either CPU or GPU. It bundles structural data as well as features for a better control. We provide a variety of functions for computing with graph objects including efficient and customizable message passing primitives for Graph Neural Networks.

Downloads: 4 This Week

Last Update: 2024-08-29

See Project

TAME LLM

Traditional Mandarin LLMs for Taiwan

TAME LLM is an open-source initiative focused on building and releasing large language models optimized for Traditional Mandarin and the linguistic context of Taiwan. The project includes models such as Llama-3-Taiwan-70B, which are fine-tuned versions of large transformer architectures trained on extensive corpora containing both Traditional Mandarin and English text. These models are designed to support applications such as conversational AI, knowledge retrieval, and domain-specific...

Downloads: 0 This Week

Last Update: 2026-03-09

See Project

FRODO 2

Open-Source Framework for Distributed Constraint Optimization (DCOP)

FRODO is a Java platform to solve Distributed Constraint Satisfaction Problems (DisCSPs) and Optimization Problems (DCOPs). It provides implementations for a variety of algorithms, including DPOP (and its variants), ADOPT, SynchBB, DSA...

3 Reviews

Downloads: 2 This Week

Last Update: 2025-10-05

See Project

Bandicoot

fast C++ library for GPU linear algebra & scientific computing

* Fast GPU linear algebra library (matrix maths) for the C++ language, aiming towards a good balance between speed and ease of use * Provides high-level syntax and functionality deliberately similar to Matlab * Provides an API that is aiming to be compatible with Armadillo for easy transition between CPU and GPU linear algebra code * Useful for algorithm development directly in C++, or quick conversion of research code into production environments * Distributed under the permissive...

Downloads: 8 This Week

Last Update: 2026-05-08

See Project

Gorgonia

Gorgonia is a library that helps facilitate machine learning in Go

Write and evaluate mathematical equations involving multidimensional arrays easily. Gorgonia is a library that helps facilitate machine learning in Go. Write and evaluate mathematical equations involving multidimensional arrays easily. If this sounds like Theano or TensorFlow, it's because the idea is quite similar. Specifically, the library is pretty low-level, like Theano, but has higher goals like Tensorflow. The primary goal for Gorgonia is to be a highly performant machine...

Downloads: 1 This Week

Last Update: 2023-10-13

See Project

LLM Applications

A comprehensive guide to building RAG-based LLM applications

...It provides step-by-step guidance for constructing systems that ingest documents, split them into chunks, generate embeddings, index them in vector databases, and retrieve relevant context during inference. The repository also shows how these components can be scaled and deployed using distributed computing frameworks such as Ray. In addition to development workflows, the project includes notebooks, datasets, and evaluation tools that help developers experiment with different retrieval strategies and model configurations.

Downloads: 0 This Week

Last Update: 2026-03-15

See Project

Mars Framework

Mars is a tensor-based unified framework for large-scale data

Mars is a distributed computing framework designed to scale scientific computing and data science workloads across large clusters while preserving the familiar programming interfaces of common Python libraries. The project provides a tensor-based execution model that extends the capabilities of tools such as NumPy, pandas, and scikit-learn so that large datasets can be processed in parallel without rewriting code for distributed environments.

Downloads: 7 This Week

Last Update: 2026-03-11

See Project

Agentopia

Java5 mobile agents in peer2peer containers without stubs/skeletons.

Agentopia is a programming framework (API) for Java 5 mobile agents in peer-to-peer networks. Main features: Routing around firewalls, anonymity, and it is extremely easy to write new agents. No RMI, no CORBA, just plain Java bytecode loading.

Downloads: 0 This Week

Last Update: 2020-10-21

See Project

spark-ml-source-analysis

Spark ml algorithm principle analysis and specific source code

...Each section discusses both the mathematical principles behind the algorithms and how Spark implements them in a distributed computing environment. By studying these implementations, readers gain insight into how large-scale machine learning pipelines operate across distributed data systems.

Downloads: 0 This Week

Last Update: 2026-03-12

See Project

Search Results for "distributed computing"

Showing 84 open source projects for "distributed computing"

fugue

EdgeChains

Parallax

Kubeflow Trainer

Ray

Data-Juicer

Xtuner

Matrix

Angel

OSMO

mlforecast

HolmesGPT

EvoTorch

Apache Hamilton

Chitu

AlphaZero.jl

DGL

TAME LLM

FRODO 2

Bandicoot

Gorgonia

LLM Applications

Mars Framework

Agentopia

spark-ml-source-analysis

Search Results for "distributed computing"

Showing 84 open source projects for "distributed computing"

fugue

EdgeChains

Parallax

Kubeflow Trainer

Ray

Data-Juicer

Xtuner

Matrix

Angel

OSMO

mlforecast

HolmesGPT

EvoTorch

Apache Hamilton

Chitu

AlphaZero.jl

DGL

TAME LLM

FRODO 2

Bandicoot

Gorgonia

LLM Applications

Mars Framework

Agentopia

spark-ml-source-analysis

Related Searches

Related Categories