Showing 457 open source projects for "gpu max performance"

View related business solutions
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • Build Securely on AWS with Proven Frameworks Icon
    Build Securely on AWS with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 1
    GPU Hot

    GPU Hot

    Real-time NVIDIA GPU dashboard

    GPU Hot is an open-source, lightweight monitoring dashboard designed to provide real-time visibility into NVIDIA GPU performance across single machines or entire clusters. The project offers a self-hosted web interface that streams hardware metrics directly from GPU servers, enabling developers, ML engineers, and system administrators to observe GPU utilization and system behavior in real time through a browser.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    Fan Control

    Fan Control

    Highly customizable fan controlling software for Windows

    ...The tool supports both manual mode (user directly sets fan speeds) and automated mode (fans follow sensor-driven graphs), enabling flexible tradeoffs between noise, cooling, and performance. Users can create and save different “profiles” (e.g. silent mode, high-performance mode, custom GPU-heavy workloads), then load them as needed. Recent versions switched their driver backend (migrating from WinRing0 to a PawnIO-based driver) to improve compatibility and avoid security/AV-flagging issues.
    Downloads: 214 This Week
    Last Update:
    See Project
  • 3
    Gupax

    Gupax

    GUI Uniting P2Pool And XMRig

    A versatile tool designed for managing and analyzing GPU performance across tasks.
    Downloads: 31 This Week
    Last Update:
    See Project
  • 4
    Ricks-Lab GPU Utilities

    Ricks-Lab GPU Utilities

    A set of utilities for monitoring and customizing GPU performance

    A set of utilities for monitoring GPU performance and modifying control settings. In order to get the maximum capability of these utilities, you should be running with a kernel that provides support for the GPUs you have installed. If using AMD GPUs, installing the latest AMD GPU driver or ROCm package may provide additional capabilities. If you have Nvidia GPUs installed, you should have Nvidia-smi installed in order for the utility reading of the cards to be possible. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    Access competitive interest rates on your digital assets.

    Generate interest, borrow against your crypto, and trade a range of cryptocurrencies — all in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 5
    AGI (Android GPU Inspector)

    AGI (Android GPU Inspector)

    Android GPU Inspector

    Android GPU Inspector (AGI) is a desktop tool for profiling, tracing, and debugging graphics workloads running on Android devices. It helps developers analyze Vulkan and OpenGL ES applications at the system, frame, and draw-call levels to uncover GPU and CPU bottlenecks. AGI captures detailed performance counters, timelines, and pipeline state to reveal stalls, overdraw, shader hotspots, and inefficient resource usage.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Apple Silicon Guide

    Apple Silicon Guide

    Learn all about the A17 Pro, A16 Bionic, R1, M1-series

    The Apple Silicon Guide repository is a detailed resource for users adopting or migrating to Apple Silicon (M1/M2/Pro/Max/Ultra) Macs, offering a wealth of information on hardware capabilities, software compatibility, performance optimization, and tooling best practices. It covers differences between Apple Silicon and legacy Intel architectures, helping users understand how to choose software builds (arm64 vs. universal), deal with Rosetta 2 translation, and optimize workloads for native performance. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    nviwatch

    nviwatch

    A blazingly fast rust based TUI for managing and monitoring NVIDIA GPU

    NviWatch is an interactive terminal user interface (TUI) application for monitoring NVIDIA GPU devices and processes. Built with Rust, it provides real-time insights into GPU performance metrics, including temperature, utilization, memory usage, and power consumption.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    how-to-optim-algorithm-in-cuda

    how-to-optim-algorithm-in-cuda

    How to optimize some algorithm in cuda

    ...These examples show how different optimization techniques influence performance on modern GPU hardware and allow readers to experiment with real implementations. The repository also contains extensive learning notes that summarize CUDA programming concepts, GPU architecture details, and performance engineering strategies.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    ParallelStencil.jl

    ParallelStencil.jl

    Package for writing high-level code for parallel stencil computations

    ParallelStencil empowers domain scientists to write architecture-agnostic high-level code for parallel high-performance stencil computations on GPUs and CPUs. Performance similar to CUDA C / HIP can be achieved, which is typically a large improvement over the performance reached when using only CUDA.jl or AMDGPU.jl GPU Array programming. For example, a 2-D shallow ice solver presented at JuliaCon 2020 [1] achieved a nearly 20 times better performance than a corresponding GPU Array programming implementation; in absolute terms, it reached 70% of the theoretical upper performance bound of the used Nvidia P100 GPU, as defined by the effective throughput metric, T_eff. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 10
    Alacritty

    Alacritty

    A cross-platform, GPU-accelerated terminal emulator

    Alacritty is the fastest open source terminal emulator there is. How is it the fastest? With such a strong focus on simplicity and performance, Alacritty’s included features are very carefully considered, ensuring that it remains blazingly fast. It’s got a GPU for rendering that makes a whole lot of optimizations possible. In various benchmarked terminals, Alacritty has shown to be either faster, or way faster than others. Alacritty requires no additional setup, but still allows configuration of many aspects of the terminal. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 11
    CatBoost

    CatBoost

    High-performance library for gradient boosting on decision trees

    CatBoost is a fast, high-performance open source library for gradient boosting on decision trees. It is a machine learning method with plenty of applications, including ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. CatBoost offers superior performance over other GBDT libraries on many datasets, and has several superb features.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 12
    XMRig

    XMRig

    RandomX, KawPow, CryptoNight, AstroBWT and GhostRider unified miner

    High performance, open-source, cross-platform RandomX, KawPow, CryptoNight, and AstroBWT CPU/GPU miner, RandomX benchmark, and stratum proxy. XMRig is a high-performance, open-source, cross-platform RandomX, KawPow, CryptoNight, and AstroBWT unified CPU/GPU miner and RandomX benchmark. Official binaries are available for Windows, Linux, macOS, and FreeBSD.
    Downloads: 36 This Week
    Last Update:
    See Project
  • 13
    SwiftShader

    SwiftShader

    SwiftShader is a high-performance CPU-based implementation

    SwiftShader is Google’s high-performance CPU-based implementation of the Vulkan 1.3 graphics API, designed to provide a hardware-independent rendering solution for 3D graphics. Unlike traditional GPU drivers, SwiftShader executes graphics commands entirely on the CPU, making it ideal for environments where dedicated graphics hardware is unavailable or unsuitable. It acts as a drop-in replacement for Vulkan drivers, allowing existing applications to run seamlessly by redirecting API calls through its software-based rendering engine. ...
    Downloads: 139 This Week
    Last Update:
    See Project
  • 14
    Fabulously Optimized

    Fabulously Optimized

    A simple Minecraft modpack focusing on performance and graphics

    A simple Minecraft modpack focusing on performance and graphics enhancements. A Minecraft modpack focused on performance, providing a smooth experience with multiple optimization mods.
    Downloads: 19 This Week
    Last Update:
    See Project
  • 15
    CubeCL

    CubeCL

    Multi-platform high-performance compute language extension for Rust

    CubeCL is a low-level compute language and compiler framework designed to simplify and optimize GPU programming for high-performance workloads, particularly in machine learning and numerical computing. It provides an abstraction layer that allows developers to write portable, hardware-efficient compute kernels without directly dealing with complex GPU APIs such as CUDA or OpenCL. CubeCL focuses on delivering predictable performance and composability by exposing explicit control over memory layouts, parallelism, and execution patterns while still maintaining a developer-friendly syntax. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Citron Neo

    Citron Neo

    Research software designed to orchestrate virtual environments

    Citron Neo is an advanced emulator project focused on replicating complex system environments with high performance and flexibility. It is designed to emulate modern console behavior while integrating improvements in CPU emulation, GPU rendering, and memory management. The project incorporates optimizations such as dynamic recompilation and Vulkan-based rendering to enhance performance across supported platforms. It also includes continuous updates that improve compatibility with games and system firmware, reflecting an active development cycle. ...
    Downloads: 199 This Week
    Last Update:
    See Project
  • 17
    llmfit

    llmfit

    157 models, 30 providers, one command to find what runs on hardware

    llmfit is a terminal-based utility that helps developers determine which large language models can realistically run on their local hardware by analyzing system resources and model requirements. The tool automatically detects CPU, RAM, GPU, and VRAM specifications, then ranks available models based on performance factors such as speed, quality, and memory fit. It provides both an interactive terminal user interface and a traditional CLI mode, enabling flexible workflows for different user preferences. llmfit also supports advanced configurations including multi-GPU setups, mixture-of-experts architectures, and dynamic quantization recommendations. ...
    Downloads: 19 This Week
    Last Update:
    See Project
  • 18
    CUDA Core Compute Libraries (CCCL)

    CUDA Core Compute Libraries (CCCL)

    CUDA Core Compute Libraries

    ...By unifying these components, CCCL reduces duplication and improves developer productivity while maintaining performance across different GPU architectures.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    KonaBess

    KonaBess

    A GPU overclock & undervolt tool for various Snapdragon chips

    KonaBess is a straightforward application designed to customize GPU frequency and voltage tables without the need for kernel recompilation. The application achieves customization by unpacking the Boot/Vendor Boot image, decompiling and editing relevant dtb (device tree binary) files, and finally repacking and flashing the modified image. The extent of improvement varies, with some users reporting a 25% reduction in power consumption in the graphics benchmark (4.2w->3.2w) after undervolting...
    Downloads: 51 This Week
    Last Update:
    See Project
  • 20
    NVIDIA Profile Inspector

    NVIDIA Profile Inspector

    Modify game profiles inside the internal driver database

    NVIDIA Profile Inspector is a specialized utility that allows users to view and modify hidden graphics driver settings within NVIDIA’s internal profile database, providing deeper control than the official NVIDIA Control Panel. It exposes advanced and undocumented configuration options that can influence rendering behavior, performance optimization, and compatibility for specific games. Users can create, edit, and assign profiles for individual applications, enabling fine-grained tuning of GPU behavior beyond standard settings. The tool is particularly popular among enthusiasts who want to optimize performance, troubleshoot graphical issues, or enable experimental features such as custom DLSS configurations. ...
    Downloads: 59 This Week
    Last Update:
    See Project
  • 21
    OpenLIT

    OpenLIT

    OpenLIT is an open-source LLM Observability tool

    OpenLIT is an OpenTelemetry-native tool designed to help developers gain insights into the performance of their LLM applications in production. It automatically collects LLM input and output metadata and monitors GPU performance for self-hosted LLMs. OpenLIT makes integrating observability into GenAI projects effortless with just a single line of code. Whether you're working with popular LLM providers such as OpenAI and HuggingFace, or leveraging vector databases like ChromaDB, OpenLIT ensures your applications are monitored seamlessly, providing critical insights including GPU performance stats for self-hosted LLMs to improve performance and reliability. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 22
    GameMode

    GameMode

    Optimise Linux system performance on demand

    GameMode is a daemon/lib combo for Linux that allows games to request a set of optimizations be temporarily applied to the host OS and/or a game process. GameMode was designed primarily as a stop-gap solution to problems with the Intel and AMD CPU power save or on-demand governors but is now host to a range of optimization features and configurations.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 23
    XenosRecomp

    XenosRecomp

    A tool for converting Xbox 360 shaders to HLSL

    ...The project addresses one of the most complex aspects of console reverse engineering, which is accurately reproducing proprietary GPU behavior in a portable and efficient way. By reconstructing the graphics pipeline, XenosRecomp enables developers to render scenes correctly without relying on emulation layers that can introduce performance overhead or inaccuracies.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    The Ocular Migraine

    The Ocular Migraine

    The Ocular Migraine {Dev Mode} Master Control Program

    The Ocular Migraine is a powerful utility application designed for Meta Quest VR headsets that provides deep control over system-level performance, recording, and application behavior through an advanced interface. It operates as a “master control program” that leverages ADB-level access to modify device settings that are typically hidden from standard users, enabling extensive customization of CPU, GPU, and rendering parameters. The application allows users to create custom performance profiles that can automatically apply when launching specific games or applications, optimizing both visuals and performance.
    Downloads: 34 This Week
    Last Update:
    See Project
  • 25
    7-max increases the performance of some applications up to 10-20%. Windows uses small (4 KB) RAM pages by default. 7-max allows to use large (2 MB) RAM pages.
    Leader badge
    Downloads: 47 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB