Search Results for "gpu hardware" - Page 5

Sort By:

Showing 240 open source projects for "gpu hardware"

View related business solutions

Go From AI Idea to AI App Fast
One platform to build, fine-tune, and deploy ML models. No MLOps team required.

Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.

Try Free
Custom VMs From 1 to 96 vCPUs With 99.95% Uptime
General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.

Try Free
1

MSI Afterburner

MSI Afterburner: Overclock, monitor, and optimize your GPU.

...Furthermore, MSI Afterburner supports video recording, enabling users to capture their gameplay or overclocking sessions with ease. Its compatibility with all major graphics card brands makes it an indispensable tool for anyone looking to push their hardware to the limit.

Downloads: 77 This Week

Last Update: 2025-07-13
See Project
2

OpenVINO

OpenVINO™ Toolkit repository

OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference. Boost deep learning performance in computer vision, automatic speech recognition, natural language processing and other common tasks. Use models trained with popular frameworks like TensorFlow, PyTorch and more. Reduce resource demands and efficiently deploy on a range of Intel® platforms from edge to cloud. This open-source version includes several components: namely Model Optimizer, OpenVINO™ Runtime,...

Downloads: 21 This Week

Last Update: 2026-03-25
See Project
3

gpt-oss

gpt-oss-120b and gpt-oss-20b are two open-weight language models

gpt-oss is OpenAI’s open-weight family of large language models designed for powerful reasoning, agentic workflows, and versatile developer use cases. The series includes two main models: gpt-oss-120b, a 117-billion parameter model optimized for general-purpose, high-reasoning tasks that can run on a single H100 GPU, and gpt-oss-20b, a lighter 21-billion parameter model ideal for low-latency or specialized applications on smaller hardware. Both models use a native MXFP4 quantization for efficient memory use and support OpenAI’s Harmony response format, enabling transparent full chain-of-thought reasoning and advanced tool integrations such as function calling, browsing, and Python code execution. ...

1 Review

Downloads: 11 This Week

Last Update: 2026-01-13
See Project
4

CUDA-QX

Accelerated libraries for quantum-classical computing built on CUDA-Q

CUDA-QX is a collection of accelerated libraries built on top of the CUDA-Q platform, designed to enable rapid development of hybrid quantum-classical applications. It extends the CUDA-Q programming model by providing optimized implementations of domain-specific quantum computing primitives and workflows. The libraries are intended to help researchers and developers leverage GPUs, CPUs, and quantum processing units together in a unified computational model. CUDA-QX focuses on key areas such...

Downloads: 0 This Week

Last Update: 2026-04-10
See Project
Full-stack observability with actually useful AI | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
5

JAX Toolbox

Public CI, Docker images for popular JAX libraries

JAX Toolbox is a development toolkit designed to streamline and optimize the use of JAX for machine learning and high-performance computing on NVIDIA GPUs. It provides prebuilt Docker images, continuous integration pipelines, and optimized example implementations that help developers quickly set up and run JAX workloads without complex configuration. The project supports popular JAX-based frameworks and models, including architectures used for large-scale pretraining such as GPT and LLaMA...

Downloads: 0 This Week

Last Update: 7 days ago
See Project
6

Diffrax

Numerical differential equation solvers in JAX

Diffrax is a numerical differential equation solving library built for the JAX ecosystem, with a strong focus on composability, differentiability, and high-performance scientific computing. The project provides tools for solving ordinary differential equations, stochastic differential equations, controlled differential equations, and related systems in a way that fits naturally into modern machine learning and differentiable programming workflows. Because it is written to work closely with...

Downloads: 0 This Week

Last Update: 2026-03-12
See Project
7

CUDA Containers for Edge AI & Robotics

Machine Learning Containers for NVIDIA Jetson and JetPack-L4T

CUDA Containers for Edge AI & Robotics is an open-source project that provides a modular container build system designed for running machine learning and AI workloads on NVIDIA Jetson devices. The repository contains container configurations that package the latest AI frameworks and dependencies optimized for Jetson hardware. These containers simplify the deployment of complex machine learning environments by bundling libraries such as CUDA, TensorRT, and deep learning frameworks into reproducible container images. The project is particularly useful for developers building edge AI and robotics systems that rely on GPU-accelerated inference and real-time computer vision. ...

Downloads: 0 This Week

Last Update: 6 days ago
See Project
8

CUDA Agent

Large-Scale Agentic RL for High-Performance CUDA Kernel Generation

CUDA Agent is a research-driven agentic reinforcement learning system designed to automatically generate and optimize high-performance CUDA kernels for GPU workloads. The project addresses the long-standing challenge that efficient CUDA programming typically requires deep hardware expertise by training an autonomous coding agent capable of iterative improvement through execution feedback. Its architecture combines large-scale data synthesis, a skill-augmented CUDA development environment, and long-horizon reinforcement learning to build intrinsic optimization capability rather than relying on simple post-hoc tuning. ...

Downloads: 0 This Week

Last Update: 2026-03-03
See Project
9

EPLB

Expert Parallelism Load Balancer

EPLB is DeepSeek’s open implementation of a load balancing algorithm designed for expert parallelism (EP) settings in MoE architectures. In EP, different “experts” are mapped to different GPUs or nodes, so load imbalance becomes a performance bottleneck if certain experts are invoked much more often. EPLB solves this by duplicating heavily used experts (redundancy) and then placing those duplicates across GPUs to even out computational load. It uses policies like hierarchical load balancing...

Downloads: 0 This Week

Last Update: 2025-10-03
See Project
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
10

Selkies-GStreamer

Open-Source Low-Latency Accelerated Linux WebRTC HTML5 Remote Desktop

selkies-gstreamer is a GStreamer-based media streaming component used in the Selkies project, a cloud-native platform designed for interactive desktop and application streaming. This module acts as a high-performance media pipeline that captures video, encodes it with low latency, and streams it via WebRTC to client browsers. It is optimized for GPU-accelerated encoding and integrates with Kubernetes-based deployments to enable scalable, real-time remote desktop sessions. This component...

Downloads: 0 This Week

Last Update: 2025-03-27
See Project
11

WebLLM

Bringing large-language models and chat to web browsers

WebLLM is a modular, customizable javascript package that directly brings language model chats directly onto web browsers with hardware acceleration. Everything runs inside the browser with no server support and is accelerated with WebGPU. We can bring a lot of fun opportunities to build AI assistants for everyone and enable privacy while enjoying GPU acceleration. WebLLM offers a minimalist and modular interface to access the chatbot in the browser. The WebLLM package itself does not come with UI, and is designed in a modular way to hook to any of the UI components. ...

Downloads: 0 This Week

Last Update: 2026-03-13
See Project
12

CosyVoice

Multi-lingual large voice generation model, providing inference

CosyVoice is a multilingual large voice generation model that offers a full-stack solution for training, inference, and deployment of high-quality TTS systems. The model supports multiple languages, including Chinese, English, Japanese, Korean, and a range of Chinese dialects such as Cantonese, Sichuanese, Shanghainese, Tianjinese, and Wuhanese. It is designed for zero-shot voice cloning and cross-lingual or mix-lingual scenarios, so a single reference voice can be used to synthesize speech...

Downloads: 2 This Week

Last Update: 2025-11-30
See Project
13

MaxText

A simple, performant and scalable Jax LLM

MaxText is a high-performance, highly scalable open-source framework designed to train and fine-tune large language models using the JAX ecosystem. The project acts as both a reference implementation and a practical training library that demonstrates best practices for building and scaling transformer-based language models on modern accelerator hardware. It is optimized to run efficiently on Google Cloud TPUs and GPUs, enabling researchers and engineers to train models ranging from small...

Downloads: 0 This Week

Last Update: 2026-03-23
See Project
14

Unsloth-MLX

Bringing the Unsloth experience to Mac users via Apple's MLX framework

...This project removes traditional barriers that prevent Mac users from prototyping and experimenting with LLM training locally by allowing the same code used in cloud GPU environments to run on M-series hardware, improving workflow continuity and reducing iteration costs. It supports loading and training Hugging Face models with fine-tuning strategies like SFT, DPO, ORPO, and GRPO and even handles exporting models to formats like GGUF for downstream use, although some limitations apply with quantized models. ...

Downloads: 0 This Week

Last Update: 3 days ago
See Project
15

TensorFlow Probability

Probabilistic reasoning and statistical analysis in TensorFlow

TensorFlow Probability is a library for probabilistic reasoning and statistical analysis. TensorFlow Probability (TFP) is a Python library built on TensorFlow that makes it easy to combine probabilistic models and deep learning on modern hardware (TPU, GPU). It's for data scientists, statisticians, ML researchers, and practitioners who want to encode domain knowledge to understand data and make predictions. Since TFP inherits the benefits of TensorFlow, you can build, fit, and deploy a model using a single language throughout the lifecycle of model exploration and production. ...

Downloads: 0 This Week

Last Update: 2024-11-08
See Project
16

AIDA64 Extreme

AIDA64 Extreme: Ultimate PC diagnostics & system info tool

AIDA64 Extreme - The Ultimate System Diagnostics & Benchmarking Tool Unlock the full potential of your PC with AIDA64, the industry-leading system information, diagnostics, and benchmarking software. Trusted by PC enthusiasts, IT professionals, and overclockers, AIDA64 provides detailed insights into your hardware, software, and system performance. Optimize your device, troubleshoot issues, and push performance to the max. Why Choose AIDA64? - Comprehensive System Info: Get in-depth details on CPU, GPU, RAM, motherboard, and more. - Advanced Diagnostics: Identify hardware issues and monitor system health in real-time. - Benchmarking Power: Test CPU, GPU, and memory performance with accurate metrics...

1 Review

Downloads: 112 This Week

Last Update: 2025-06-03
See Project
17

powerMAX

powerMAX is a CPU and GPU burn-in test

powerMAX is a CPU and GPU burn-in tool designed to push your hardware to its absolute thermal and power limits. It helps users uncover stability issues, cooling weaknesses, and power delivery problems by applying maximum, sustained stress to both the processor and graphics card. The utility supports dedicated CPU tests—SSE or AVX—and a demanding GPU 3D rendering test, with the option to run both simultaneously for full-system power load evaluation.

1 Review

Downloads: 24 This Week

Last Update: 2025-11-22
See Project
18

Superposition Benchmark (Unigine)

GPU benchmark testing graphics performance with realistic 3D scenes.

...Widely used by gamers and hardware reviewers, it is proprietary but offers a free edition.

Downloads: 92 This Week

Last Update: 2025-10-07
See Project
19

Deep Java Library (DJL)

An engine-agnostic deep learning framework in Java

...Because DJL is deep learning engine agnostic, you don't have to make a choice between engines when creating your projects. You can switch engines at any point. To ensure the best performance, DJL also provides automatic CPU/GPU choice based on hardware configuration.

1 Review

Downloads: 1 This Week

Last Update: 2025-12-15
See Project
20

MegEngine

Easy-to-use deep learning framework with 3 key features

MegEngine is a fast, scalable and easy-to-use deep learning framework with 3 key features. You can represent quantization/dynamic shape/image pre-processing and even derivation in one model. After training, just put everything into your model and inference it on any platform at ease. Speed and precision problems won't bother you anymore due to the same core inside. In training, GPU memory usage could go down to one-third at the cost of only one additional line, which enables the DTR...

Downloads: 0 This Week

Last Update: 2024-04-30
See Project
21

local-llm

Run LLMs locally on Cloud Workstations

local-llm is a development framework that enables developers to run large language models locally within Google Cloud Workstations or standard environments without requiring GPU hardware. It focuses on making generative AI development more accessible by leveraging quantized models and CPU-based execution, eliminating the dependency on expensive GPU infrastructure. The repository includes tools, Docker configurations, and command-line utilities that simplify the process of downloading, running, and interacting with language models directly on local or cloud-based workstations. ...

Downloads: 1 This Week

Last Update: 2026-03-17
See Project
22

VCClient

Software that uses AI to perform real-time voice conversion

...It provides both a graphical user interface and API access, making it suitable for casual users as well as developers who want to integrate voice transformation into their own applications. The project also supports GPU acceleration, enabling faster inference and smoother real-time performance on compatible hardware. Additionally, it includes tools for training and managing voice models, giving users the ability to create personalized voice profiles.

Downloads: 13 This Week

Last Update: 2026-03-23
See Project
23

CPU-Z

Utility for detailed CPU and system hardware information.

CPU-Z for Windows® x86/x64 is a freeware that gathers information on some of the main devices of your system : Processor name and number, codename, process, package, cache levels. Mainboard and chipset. Memory type, size, timings, and module specifications (SPD). Real time measurement of each core's internal frequency, memory frequency. CPU-Z is fully supported on Windows® 11.

1 Review

Downloads: 261 This Week

Last Update: 2025-10-10
See Project
24

MSI Kombustor

Advanced OpenGL and Vulkan graphics card stress testing utility

...The tool provides MSI users with an exclusive, streamlined interface for testing their hardware safely and effectively. By driving high temperatures and peak loads, it reveals whether a graphics card can sustain extended heavy usage. Kombustor is ideal for anyone looking to test, validate, or tune their GPU setup.

1 Review

Downloads: 77 This Week

Last Update: 2025-11-22
See Project
25

StressLab

All-In-One Tool to Test CPU, GPU, RAM, SSDs, TPM/Secure Boot, & more!

StressLab is a free, open-source single-file Windows GUI application built for PC technicians and hardware testers. It combines automated health checks, hardware stress testing with live temperature monitoring, crash dump analysis, and system utilities into one tool — with exportable pass/fail certificates for documentation. Checks for Missing Drivers, BIOS Age, TPM/Secure Boot Status, Windows Activation (with key displayed if activated), and SMART Disk Health. ...

Downloads: 17 This Week

Last Update: 2026-02-27
See Project