42 projects for "gpu max performance" with 2 filters applied:

  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    Numba CUDA Target

    Numba CUDA Target

    The CUDA target for Numba

    Numba CUDA Target is NVIDIA’s maintained CUDA backend for the Numba JIT compiler, enabling developers to write GPU-accelerated code directly in Python. It allows users to define CUDA kernels using Python syntax, which are then compiled into efficient GPU code at runtime using LLVM-based toolchains. This approach significantly lowers the barrier to entry for GPU programming by eliminating the need to write CUDA C++ while still delivering high performance.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    Butterchurn

    Butterchurn

    Butterchurn is a WebGL implementation of the Milkdrop Visualizer

    ...The project emphasizes both artistic expression and technical performance, offering a balance between visual complexity and efficiency.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 3
    lru-cache

    lru-cache

    A fast cache that automatically deletes the least recently used items

    ...It offers flexible configuration options such as max size limits, time based expiration, and custom disposal logic. Developers can use it to cache expensive computations, API responses, or frequently accessed data. The implementation focuses on correctness, speed, and compatibility with modern Node.js environments. Overall, node-lru-cache provides a reliable building block for performance optimization in JavaScript backends.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4
    webgl-plot

    webgl-plot

    A high-Performance real-time 2D plotting library based on native WebGL

    ...Its minimal memory footprint and GPU acceleration ensure excellent performance even with tens of thousands of data points, and its simple API allows developers to get started quickly.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 5
    libplacebo

    libplacebo

    Official mirror of libplacebo

    libplacebo is a flexible, high-performance graphics library built on top of Vulkan, designed to provide reusable GPU-accelerated components for media applications. It originated as a core part of the rendering pipeline for the mpv media player and has since grown into a standalone library used for tone mapping, dithering, color space conversion, and more. libplacebo is ideal for developers looking to integrate sophisticated video rendering and post-processing into their own applications with full control over shaders and rendering stages.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    XFrames

    XFrames

    GPU-accelerated GUI development for Node.js and the browser

    xframes is a high-performance library that empowers developers to build native desktop applications using familiar web technologies, specifically Node.js and React, without the overhead of the DOM. xframes serves as a streamlined alternative to Electron, designed for developers looking to maximize performance and efficiency.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    FlashMLA

    FlashMLA

    FlashMLA: Efficient Multi-head Latent Attention Kernels

    FlashMLA is a high-performance decoding kernel library designed especially for Multi-Head Latent Attention (MLA) workloads, targeting NVIDIA Hopper GPU architectures. It provides optimized kernels for MLA decoding, including support for variable-length sequences, helping reduce latency and increase throughput in model inference systems using that attention style.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Zoo Design Studio

    Zoo Design Studio

    The Zoo Design Studio app

    ...Users can interact with the system through a familiar point-and-click interface, but every action is translated into code in the underlying modeling language, ensuring consistency between visual and programmatic representations. The application is powered by a GPU-first geometry engine that streams rendered output as video frames, enabling high-performance modeling even when heavy computation is offloaded to remote infrastructure. It uses WebSockets for real-time communication between the client and the modeling engine, allowing immediate feedback and interactive design updates.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 9
    Floem

    Floem

    A native Rust UI library with fine-grained reactivity

    Floem is a cross-platform GUI framework for Rust. It aims to be extremely performant while providing world-class developer ergonomics. Supporting both GPU and CPU rendering, Floem gives you performance that's closest to bare metal. Also primitives are provided to help the developer to write performant UI code without too much effect.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Build Securely on Azure with Proven Frameworks Icon
    Build Securely on Azure with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 10
    ChartGPU

    ChartGPU

    Beautiful, open source, WebGPU-based charting library

    The ChartGPU repository is an open-source, WebGPU-based charting library written in TypeScript that enables developers to visualize large datasets with high performance and smooth interactivity even when handling millions of data points. By leveraging WebGPU — the next-generation graphics API for the web — ChartGPU offloads rendering work to the GPU, allowing for fast panning, zooming, and real-time updates with minimal latency. This makes the library particularly valuable for data-intensive dashboards, scientific visualizations, and financial charting where performance bottlenecks of traditional canvas or SVG approaches become apparent. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    GPUPixel

    GPUPixel

    Real-time image and video processing library similar to GPUImage

    GPUPixel is a real-time image and video processing library written in C++11, based on OpenGL/ES. It offers functionalities similar to GPUImage, including built-in beauty filters, enabling efficient processing and rendering of visual effects on images and videos.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    PyOpenCL

    PyOpenCL

    OpenCL integration for Python, plus shiny features

    PyOpenCL is a Python wrapper for the OpenCL framework, providing seamless access to parallel computing on CPUs, GPUs, and other accelerators. It enables developers to harness the full power of heterogeneous computing directly from Python, combining Python’s ease of use with the performance benefits of OpenCL. PyOpenCL also includes convenient features for managing memory, compiling kernels, and interfacing with NumPy, making it a preferred choice in scientific computing, data analysis, and...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    bitnet.cpp

    bitnet.cpp

    Official inference framework for 1-bit LLMs

    bitnet.cpp is the official open-source inference framework and ecosystem designed to enable ultra-efficient execution of 1-bit large language models (LLMs), which quantize most model parameters to ternary values (-1, 0, +1) while maintaining competitive performance with full-precision counterparts. At its core is bitnet.cpp, a highly optimized C++ backend that supports fast, low-memory inference on both CPUs and GPUs, enabling models such as BitNet b1.58 to run without requiring enormous...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 14
    Recursive Language Models

    Recursive Language Models

    General plug-and-play inference library for Recursive Language Models

    RLM (short for Reinforcement Learning Models) is a modular framework that makes it easier to build, train, evaluate, and deploy reinforcement learning (RL) agents across a wide range of environments and tasks. It provides a consistent API that abstracts away many of the repetitive engineering patterns in RL research and application work, letting developers focus on modeling, experimentation, and fine-tuning rather than infrastructure plumbing. Within the framework, you can define custom...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    glsl-sandbox

    glsl-sandbox

    Shader editor and gallery

    GLSL Sandbox is an in-browser playground for writing and sharing fragment shaders with instant visual feedback. It provides a minimal editor and a fullscreen WebGL viewport so your shader takes center stage, making it perfect for learning, live-coding, and showcasing visual experiments. The environment injects a small set of uniforms—time, resolution, mouse—so you can animate and interact without boilerplate. A public gallery lets creators browse, fork, and remix shaders, turning the site...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16
    Hasktorch

    Hasktorch

    Tensors and neural networks in Haskell

    Hasktorch is a powerful Haskell library for tensor computation and neural network modeling, built on top of libtorch (the backend of PyTorch). It brings differentiable programming, automatic differentiation, and efficient tensor operations into Haskell’s strongly typed functional paradigm. This project is in active development, so expect changes to the library API as it evolves. We would like to invite new users to join our Hasktorch discord space for questions and discussions....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Bottleneck Calculator

    Bottleneck Calculator

    Check CPU and GPU balance with real time bottleneck analysis

    PC Bottleneck Calculator is a performance analysis tool that helps PC gamers and builders identify CPU or GPU bottlenecks in their systems. It provides accurate compatibility insights by comparing hardware data and real world benchmarks to estimate system balance. Users can instantly see how well their CPU and GPU pair together, test different configurations, and understand which component limits their gaming performance. www.pcbottleneckcalculator.io Built with a clean, responsive interface, the tool offers quick, data-driven results without requiring downloads or complex setup.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    QtAV

    QtAV

    A multimedia framework based on Qt and FFmpeg

    QtAV is a cross-platform and high performance multimedia playback framework based on Qt and FFmpeg. Features: timeline preview, gpu decoding etc
    Downloads: 28 This Week
    Last Update:
    See Project
  • 19
    Glumpy

    Glumpy

    Python+Numpy+OpenGL, scalable and beautiful scientific visualization

    Glumpy is a Python library that simplifies the development of high-performance, interactive OpenGL visualizations. It abstracts complex OpenGL tasks into Pythonic constructs, making it easier for scientists, artists, and developers to harness the power of the GPU for real-time rendering and data visualization. Glumpy is particularly well-suited for rapid prototyping of graphical applications, and its integration with NumPy and shader programming makes it a powerful tool for both research and creative exploration.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    pipeless

    pipeless

    A computer vision framework to create and deploy apps in minutes

    ...You can easily use industry-standard models, such as YOLO, or load your custom model in one of the supported inference runtimes. Pipeless ships some of the most popular inference runtimes, such as the ONNX Runtime, allowing you to run inference with high performance on CPU or GPU out-of-the-box. You can deploy your Pipeless application with a single command to edge and IoT devices or the cloud.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    MLPACK is a C++ machine learning library with emphasis on scalability, speed, and ease-of-use. Its aim is to make machine learning possible for novice users by means of a simple, consistent API, while simultaneously exploiting C++ language features to provide maximum performance and flexibility for expert users. * More info + downloads: https://mlpack.org * Git repo: https://github.com/mlpack/mlpack
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    MetalPetal

    MetalPetal

    A GPU accelerated image and video processing framework built on Metal

    MetalPetal is an image processing framework based on Metal designed to provide real-time processing for still images and video with easy-to-use programming interfaces. This chapter covers the key concepts of MetalPetal, and will help you to get a better understanding of its design, implementation, performance implications, and best practices. A MTIImage object is a representation of an image to be processed or produced. It does directly represent image bitmap data instead it has all the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Veldrid

    Veldrid

    A low-level, portable graphics library for .NET

    Veldrid is a low-level, portable graphics library for .NET, providing a unified API over multiple graphics backends such as Direct3D, Vulkan, OpenGL, and Metal. It enables developers to write high-performance, cross-platform graphics applications without being tied to a specific graphics API. Veldrid is suitable for game development, simulations, and other applications requiring advanced graphics capabilities.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    FairScale

    FairScale

    PyTorch extensions for high performance and large scale training

    ...FairScale puts emphasis on correctness and debuggability, offering hook points, logging, and reference examples for common trainer patterns. Although many ideas have since landed in core PyTorch, FairScale remains a valuable reference and a practical toolbox for squeezing more performance out of multi-GPU and multi-node jobs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    slide-element

    slide-element

    Promise-based library for animating elements with dynamic heights

    ...The animations themselves are powered by the same mechanics used within CSS transitions, making it one of the best ways to pull it off in terms of performance.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB