Showing 114 open source projects for "gpu max performance"

View related business solutions
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 1
    HTML GL

    HTML GL

    Get as many FPS as you need and amazing effects by rendering HTML/CSS

    60 FPS and amazing effects by rendering HTML/CSS in WebGL, framework agnostic. HTML GL solves "the slow DOM problem" by creating WebGL representations of DOM elements and hiding actual DOM after. This speeds up HTML/CSS animations and transformations by using 3D hardware acceleration and allows to apply OpenGL effects as modern 3D games have. Using HTML GL you still work with HTML/CSS as you are common to, but DOM elements are just facades to their WebGL representations. These GPU...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    WebGL Heatmap

    WebGL Heatmap

    A high performance WebGL/JS heatmap display library

    webgl-heatmap is a browser-side rendering library that uses the GPU via WebGL to draw smooth, continuous heatmaps from large numbers of data points. Instead of relying on CPU-bound canvas operations, it leverages fragment shaders and additive blending to accumulate intensity and colorize results in real time. The API lets you push points or weighted samples into a buffer and then renders a gradient map where hot areas emerge organically from density rather than discrete markers. Because most...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    MXLib is a C++ wrapper around the Intel® Integrated Performance Primitives (IPP) library and NVidia NPP CUDA library. You can use either IPP code (or a subset of functions that do not require IPP) on the CPU side, or use NPP/CUDA on the GPU side, or use both together. The function syntax is similar to that found in MatLab and the library is designed to make it easy to port your code from MatLab to C++.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    HIPAcc

    HIPAcc

    Heterogeneous Image Processing Acceleration (HIPACC) Framework

    HIPAcc development has moved to github: https://github.com/hipacc HIPAcc allows to design image processing kernels and algorithms in a domain-specific language (DSL). From this high-level description, low-level target code for GPU accelerators is generated using source-to-source translation. As back ends, the framework supports CUDA, OpenCL, and Renderscript. HIPAcc allows programmers to develop imaging applications while providing high productivity, flexibility and portability as well as competitive performance: the same algorithm description serves as basis for targeting different GPU accelerators and low-level languages.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Build Securely on AWS with Proven Frameworks Icon
    Build Securely on AWS with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 5

    Java/C Comparative Benchmarks

    Java and C Comparative Performance Benchmarks

    A collection of software benchmarks developed to compare the performance of Java with C on identical code. No language libraries were used to avoid implementation differences. Some of the benchmarks are also implemented in Python and Scala. There are benchmarks for bit twiddling, numerical computing, data structure manipulation, concurrent computing, callouts to native libraries, and, graphics processing units (GPU) utilization.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    StarlingPunk

    StarlingPunk

    StarlingPunk is a framework built on top the Starling library

    StarlingPunk is a game framework built on top of the Starling GPU-accelerated 2D library (AS3 / Flash / AIR). It is inspired by FlashPunk: it gives structure (entities, worlds), collision detection systems, tile maps, etc., and is intended to help developers organize 2D game code more cleanly while benefiting from Starling’s performance. It has features for quick prototyping and reusing code between projects.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Books (zengfeng)

    Books (zengfeng)

    Collection of scanned/digitized books, tutorials, and source code

    The “books” repository is a collection of scanned / digitized books, tutorials, source code and documents related to Flash / ActionScript 3.0 and graphics/game programming. It includes Chinese translations and source code for AS3 animation, Stage3D tutorials, 3D mathematics, etc. It is more of a resource repository than a software product.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    LBP in multiple platforms

    LBP implementation in multiple computing platforms (ARM,GPU, DSP...)

    The Local Binary Pattern (LBP) is a texture operator that is used in several different computer vision applications and implemented in a variety of platforms. When selecting a suitable LBP implementation platform, the specific application and its requirements in terms of performance, size, energy efficiency, cost and developing time has to be carefully considered. This is a software toolbox that collects software implementations of the Local Binary Pattern operator in several platforms: - OpenCL for CPU & GPU - OpenCL for GPU (branchless) - C code optimized for ARM - OpenGL ES 2.0 shaders mobile GPUs - C code for TI C64x DSP core (branchless) - C code for TTA processor synthesis If you use the code somewhere, please cite: Bordallo López M., Nieto A., Boutellier J., Hannuksela J., and Silvén O. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9

    Java EE Two-Phase Commit Testkit

    A Portable Thin Client & Server Pair for XA / 2PC Testing

    A portable testing tool to validate the configuration of XA and two-phase commit for Java EE 6 Containers. The tool permits the stress-testing, performance characterization and correct transaction semantics of distributed transactions across Three XA resource managers by Exception Injection. The thin-client can be run from the command-line or inside the IDE. The server side component includes EJB packaging and some simple Web servlet support for retrieving diagnostic and performance data...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 10
    "UltraQueue" is a library (DLL) project, written in C++. This library is intended to be used for very fast temporary data storage, aquired by any possible source (File, Serial Port, ...). Users can create multiple queues of differrent sizes & types
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Dictmatch is a high performance multi-mode string match lib which supports full matching and max forward matching. It is used to match substrings which contented in dictionary from string user provides. Its throughout is more than 20w line per sec.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    fossa
    Fossa is a runtime CUDA memory extender for Linux. With fossa, users can run CUDA applications that require more than the available amount of GPU memory. Fossa is targeted toward command line driven applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Rapidlogix
    Rapidlogix is an open source application performance management tool optimized for distributed production environments. It allows to increase time-to-market of performance improvements and prevent performance regression
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    A simple, cross platform performance monitoring application specifically designed to be used with nVidia's instrumented driver and the NVPerfSDK to give a graphical representation of internal GPU counters. Support for non-GPU counters is also available.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB