Showing 19 open source projects for "gpu max performance"

View related business solutions
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • Application Monitoring That Won't Slow Your App Down Icon
    Application Monitoring That Won't Slow Your App Down

    AppSignal's Rust-based agent is lightweight and stable. Already running in thousands of production apps.

    Full APM with errors, performance, logs, and uptime monitoring. 99.999% uptime SLA on the platform itself.
    Start Free
  • 1
    Newton

    Newton

    An open-source, GPU-accelerated physics simulation engine

    Newton is a high-performance, GPU-accelerated physics simulation engine designed primarily for robotics research, machine learning, and advanced simulation workflows. Built on top of NVIDIA Warp, it leverages GPU parallelism to deliver scalable and efficient simulation environments that support rapid iteration and experimentation. The engine extends previous simulation frameworks by introducing differentiable physics capabilities, allowing it to integrate seamlessly with machine learning models and optimization pipelines. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    JAX

    JAX

    Composable transformations of Python+NumPy programs

    With its updated version of Autograd, JAX can automatically differentiate native Python and NumPy functions. It can differentiate through loops, branches, recursion, and closures, and it can take derivatives of derivatives of derivatives. It supports reverse-mode differentiation (a.k.a. backpropagation) via grad as well as forward-mode differentiation, and the two can be composed arbitrarily to any order. What’s new is that JAX uses XLA to compile and run your NumPy programs on GPUs and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Parallel and Distributed Process System

    Parallel and Distributed Process System

    NOTICE OF CONSOLIDATION & PARTNERSHIP PENDING As of April 2026, the 20

    NOTICE OF CONSOLIDATION & PARTNERSHIP PENDING As of April 2026, the 20 pipelines of the QCAUS/PDPBioGen suites are undergoing consolidation for high-scale institutional research. Core 'Ford 2026' algorithms remain the proprietary IP of the Ford Peace and Justice Foundation. Academic users at partner institutions are currently performing validation; all other commercial inquiries must contact the author Computational Neuroscience: Large-scale neural population dynamics, brain-inspired...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    MLPACK is a C++ machine learning library with emphasis on scalability, speed, and ease-of-use. Its aim is to make machine learning possible for novice users by means of a simple, consistent API, while simultaneously exploiting C++ language features to provide maximum performance and flexibility for expert users. * More info + downloads: https://mlpack.org * Git repo: https://github.com/mlpack/mlpack
    Downloads: 0 This Week
    Last Update:
    See Project
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    Let your crypto work for you

    Put idle assets to work with competitive interest rates, borrow without selling, and trade with precision. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 5
    Arduino HomeKit ESP8266

    Arduino HomeKit ESP8266

    Native Apple HomeKit accessory implementation for the ESP8266 Arduino

    This Arduino library is a native Apple HomeKit accessory implementation for the ESP8266 Arduino core and works without any additional bridges. This project is mainly based on an esp-home kit for ESP-OPEN-RTOS. I ported the RTOS-based implementation of the esp-home kit to the pure Arduino environment, aimed at easy and fast building projects using Arduino IDE (or Eclipse with sloeber, PlatformIO). Enjoy the "one-key" build, "one-key" upload, and work to link various other Arduino libraries...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    RMGDFT

    RMGDFT

    Real Space Multigrid based electronic structure code.

    News: active RMG development has moved to github https://github.com/RMGDFT News: V4.1.0 released on 09/29/2020 News: V4.0.0 released on 09/01/2020 with major updates. News: V3.0.0 released on 06/09/2018 with major updates. News: V2.2.2 released on 10/14/2017 with minor bug fixes. News: V2.2 with performance enhancements, bug fixes and new features released on 06/26/2017. Sources are available with binaries to follow soon. News: V2.1 with many improvements released on...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Buddhabrot Max

    Buddhabrot Max

    A Buddhabrot generator for large (>20000² px) renderings.

    A escape time orbit fractal renderer written in C#. Consists of two separate programs, one focused on high resolution and high quality rendering ("Buddhabrot Max"), the other focused on realtime exploration / zooming ("Buddhabrot Mag"). Uses multithreading and hardware acceleration. Allows creating of huge renderings (if enough memory is available) and saving files with full 32 bit float dynamic range as OpenEXR files. You can easily add new formulas which are then runtime compiled for maximum performance.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 8

    darc

    Durham Adaptive-optics Real-time Controller

    darc, the Durham Adaptive optics Real-time Controller. For documentation or darctalk client only, select "View all files". For the latest bleeding-edge version, please use: git clone git://git.code.sf.net/p/darc2/code darc (no password required) (this changed May 2013 due to a sourceforge update). If you use darc, please cite with: Basden, A and Myers, R, MNRAS Vol 242, page 1483, 2012
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9

    SoAx

    Structure of Arrays of multiple types

    Structures of arrays (SoA) are generally faster than arrays of structures (AoS) while AoS are more handy. This project (SoAx) combines the advantages of both. By means of C++(11) meta-template programming SoAx achieves maximal performance (efficient use of vector units and cache of modern CPUs) while providing a very convenient user interface (including object-oriented element handling) and flexibility. It has been designed to handle list-like sets of particles (similar to struct {int id;...
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    Nifty Sim
    Nifty Sim is a high-performance nonlinear finite element solver, developed at University College London. A key feature is the option of GPU-based execution, which allows the solver to significantly out-perform equivalent commercial packages.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Functy

    Functy

    Functy is a 3D graph drawing package that generates immediate results.

    ...It allows Cartesian, spherical and parametric curve functions to be plotted and altered quickly and easily. This immediacy and the vivid results promote fun exploration of 3D function. Rendering is performed entirely on the GPU to improve visuals and performance. Ubuntu functy PPA: https://launchpad.net/~flypig/+archive/ubuntu/ppa
    Leader badge
    Downloads: 2 This Week
    Last Update:
    See Project
  • 12

    cphcttoolbox

    Cph CT Toolbox is a selection of Computed Tomography tools

    Copenhagen Computed Tomography Toolbox is a collection of applications and libraries for flexible and efficient CT reconstruction. The toolbox apps generally take a set of projections (X-ray intensity measurements) and filter and back project them in order to recreate the image or volume that the projections represent. The project includes both mostly informative CPU implementations and highly efficient GPU implementations. Regular releases are hosted at the Python Package Index.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13

    LightSpMV

    lightweight GPU-based sparse matrix-vector multiplication (SpMV)

    LightSpMV is a novel CUDA-compatible sparse matrix-vector multiplication (SpMv) algorithm using the standard compressed sparse row (CSR) storage format. We have evaluated LightSpMV using various sparse matrices and further compared it to the CSR-based SpMV subprograms in the state-of-the-art CUSP and cuSPARSE. Performance evaluation reveals that on a single Tesla K40c GPU, LightSpMV is superior to both CUSP and cuSPARSE, with a speedup of up to 2.60 and 2.63 over CUSP, and up to 1.93 and 1.79 over cuSPARSE for single and double precision, respectively.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    MXLib is a C++ wrapper around the Intel® Integrated Performance Primitives (IPP) library and NVidia NPP CUDA library. You can use either IPP code (or a subset of functions that do not require IPP) on the CPU side, or use NPP/CUDA on the GPU side, or use both together. The function syntax is similar to that found in MatLab and the library is designed to make it easy to port your code from MatLab to C++.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15

    FreDec

    Parallelized FREquency DEComposer algorithm

    ...After selection of the initial frequency candidates, the algorithm passes through all their possible combinations and estimates their multi-frequency statistical significance. In the end, it prints out the set of largest frequency tuples that were still found significant. The GPU computing is implemented through CUDA and brings a significant performance increase. It is still possible to run FreDec solely on CPU, if no suitable GPU device is available in the system. See the details of the underlying theory in Baluev 2013, MNRAS, V. 436, P. 807 The description of the algorithm itself can be found in arXiv:1309.0100. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    This program is able to detect circles in images. Therefor it uses the hough algorithm CvHoughCircles from the OpenCV library. You can define all parameters e.g. min-/max radius, min distance between circles, all kinds of thresholds, filters and so on.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17

    LBP in multiple platforms

    LBP implementation in multiple computing platforms (ARM,GPU, DSP...)

    The Local Binary Pattern (LBP) is a texture operator that is used in several different computer vision applications and implemented in a variety of platforms. When selecting a suitable LBP implementation platform, the specific application and its requirements in terms of performance, size, energy efficiency, cost and developing time has to be carefully considered. This is a software toolbox that collects software implementations of the Local Binary Pattern operator in several platforms: - OpenCL for CPU & GPU - OpenCL for GPU (branchless) - C code optimized for ARM - OpenGL ES 2.0 shaders mobile GPUs - C code for TI C64x DSP core (branchless) - C code for TTA processor synthesis If you use the code somewhere, please cite: Bordallo López M., Nieto A., Boutellier J., Hannuksela J., and Silvén O. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    SweetOnionCCG2PTBConverter

    SweetOnionCCG2PTBConverter

    A tool that converts CCGBank to PTB

    Conversion between different grammar frameworks is of great importance to comparative performance analysis of the parsers developed on them. This tool can convert CCG derivations to PTB trees by using Max Entropy models as well as visualizing the tree graphs. The main technical innovation presented here is the effective conversion method which achieves a F score over 95%.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    IC CAD tools, documentation, scripts, and libraries for designing high-performance ICs, including SUE for schematics, MAX for layouts, DPC for datapaths and MCC for megacells. Prebuilt binaries for Linux, Sparc-Solaris, and HP-PA.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB