Showing 27 open source projects for "compute"

View related business solutions
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 1
    CUDA Core Compute Libraries (CCCL)

    CUDA Core Compute Libraries (CCCL)

    CUDA Core Compute Libraries

    CCCL, or CUDA Core Compute Libraries, is a unified repository that consolidates several foundational CUDA C++ libraries into a single, cohesive development platform. It brings together Thrust, CUB, and libcudacxx, which collectively provide high-level abstractions, low-level performance primitives, and a CUDA-compatible standard library for GPU programming.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2

    Halide

    A language for fast, portable data-parallel computation

    ...It was designed to make writing high-performance image and array processing code much easier on modern machines. It works on all major operating systems and with several CPU architectures (X86, ARM, MIPS, Hexagon, PowerPC) and GPU Compute APIs (CUDA, OpenCL, OpenGL, among others). It isn't a standalone programming language however; rather it is embedded in C++ which means that you write C++ code, building an in-memory representation of a Halide pipeline using Halide's C++ API. This representation can then be compiled to an object file, or a JIT-compile and run in the same process. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    VulkanSceneGraph

    VulkanSceneGraph

    Vulkan & C++17 based Scene Graph Project

    VulkanSceneGraph (VSG), is a modern, cross-platform, high-performance scene graph library built upon Vulkan graphics/compute API. The software is written in C++17 and follows the CppCoreGuidelines and FOSS Best Practices. The source code is published under the MIT License, with the exception of vulkan.h, used for Vulkan extensions, which is under Apache License 2.0. This repository contains C++ headers and source and CMake build scripts to build the libvsg library.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 4
    FlashMLA

    FlashMLA

    FlashMLA: Efficient Multi-head Latent Attention Kernels

    ...The library supports both BF16 and FP16 data types, and includes a paged KV cache implementation with a block size of 64 to efficiently manage memory during decoding. On very compute-bound settings, it can reach up to ~660 TFLOPS on H800 SXM5 hardware, while in memory-bound configurations it can push memory throughput to ~3000 GB/s. The team regularly updates it with performance improvements; for example, a 2025 update claims 5 % to 15 % gains on compute-bound workloads while maintaining API compatibility.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Build Securely on AWS with Proven Frameworks Icon
    Build Securely on AWS with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 5

    Microsoft SEAL

    Easy-to-use and powerful homomorphic encryption library

    ...Developed by the Cryptography and Privacy Research group at Microsoft, it enables software engineers to build end-to-end encrypted data storage and computation services that never have to procure the customer's key. Microsoft SEAL is very easy to use, compile and run in many different environments. Homomorphic encryption is an encryption scheme that allows the cloud to compute directly on the encrypted data, without requiring the data to be decrypted first. This results in encrypted computations remaining encrypted, decrypted only by the data owner using the secret key.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    NVTX (NVIDIA Tools Extension Library)

    NVTX (NVIDIA Tools Extension Library)

    C-based Application Programming Interface (API)

    ...It allows developers to insert markers, ranges, and events directly into their applications, providing contextual insight into how code executes on CPUs and GPUs. These annotations are visualized in tools such as NVIDIA Nsight Systems and Nsight Compute, enabling developers to identify performance bottlenecks, track execution flow, and correlate application behavior with hardware activity. The API is written in C and includes wrappers for C++ and Python, making it accessible across different programming environments and workloads. NVTX is particularly valuable in high-performance computing and AI workloads where understanding concurrency, memory usage, and kernel execution is critical for optimization.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 7
    Point Cloud Library

    Point Cloud Library

    A standalone, large scale, open project for 2D/3D image processing

    The Point Cloud Library (PCL) is a standalone, large scale, open project for 2D/3D image and point cloud processing. PCL is released under the terms of the BSD license, and thus free for commercial and research use. Whether you’ve just discovered PCL or you’re a long time veteran, this page contains links to a set of resources that will help consolidate your knowledge on PCL and 3D processing. An additional Wiki resource for developers is available too. To simplify both usage and...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 8
    bitnet.cpp

    bitnet.cpp

    Official inference framework for 1-bit LLMs

    ...At its core is bitnet.cpp, a highly optimized C++ backend that supports fast, low-memory inference on both CPUs and GPUs, enabling models such as BitNet b1.58 to run without requiring enormous compute infrastructure. The project’s focus on extreme quantization dramatically reduces memory footprint and energy consumption compared with traditional 16-bit or 32-bit LLMs, making it practical to deploy advanced language understanding and generation models on everyday machines. BitNet is built to scale across architectures, with configurable kernels and tiling strategies that adapt to different hardware, and it supports large models with impressive throughput even on modest resources.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    oneDNN

    oneDNN

    oneAPI Deep Neural Network Library (oneDNN)

    This software was previously known as Intel(R) Math Kernel Library for Deep Neural Networks (Intel(R) MKL-DNN) and Deep Neural Network Library (DNNL). oneAPI Deep Neural Network Library (oneDNN) is an open-source cross-platform performance library of basic building blocks for deep learning applications. oneDNN is part of oneAPI. The library is optimized for Intel(R) Architecture Processors, Intel Processor Graphics and Xe Architecture graphics. oneDNN has experimental support for the...
    Downloads: 3 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 10
    cuDF

    cuDF

    GPU DataFrame Library

    ...The RAPIDS suite of open-source software libraries aims to enable the execution of end-to-end data science and analytics pipelines entirely on GPUs. It relies on NVIDIA® CUDA® primitives for low-level compute optimization but exposing that GPU parallelism and high-bandwidth memory speed through user-friendly Python interfaces.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 11
    Raspberry Pi GCC Toolchains

    Raspberry Pi GCC Toolchains

    CI maintained precompiled GCC ARM/ARM64 Toolchains for Raspberry Pi

    This project provides latest Raspberry Pi hardware optimized GCC Cross Compiler & Native (ARM & ARM64) automated Build-Scripts and Precompiled standalone Toolchains binaries, that will save you tons of time & thereby helps you get quickly started with software development on Pi.
    Leader badge
    Downloads: 166 This Week
    Last Update:
    See Project
  • 12

    UniSIMD-assembler

    SIMD macro assembler unified for ARM, MIPS, PPC and x86

    UniSIMD assembler is a high-level C/C++ macro assembler framework unified across ARM, MIPS, POWER and x86 architectures. It establishes a subset of both BASE and SIMD instruction sets with clearly defined common API, so that application logic can be written and maintained in one place without code replication. The assembler itself isn't a separate tool, but rather a collection of C/C++ header files, which applications need to include directly in order to use. At present, Intel...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    MACE

    MACE

    Deep learning inference framework optimized for mobile platforms

    Mobile AI Compute Engine (or MACE for short) is a deep learning inference framework optimized for mobile heterogeneous computing on Android, iOS, Linux and Windows devices. Runtime is optimized with NEON, OpenCL and Hexagon, and Winograd algorithm is introduced to speed up convolution operations. The initialization is also optimized to be faster.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    progrep

    progrep

    Utility to show live progress, status & stats for running simulations

    progrep is a command-line tool (Linux) to show live progress report, status & stats of a running simulation or compute job that executes a given number of iterations. It shows % completed, time remaining, time elapsed, number of threads, MPI_Rank(if any), CPU usage & speed (FPS). The FPS measures may be used in benchmarking, e.g. while optimizing HPC algorithms for performance. progrep supports both single-threaded and parallel (multicore/multinode - e.g.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Synthclipse

    Synthclipse

    GLSL shader prototyping tool based on Eclipse IDE.

    Synthclipse is a GLSL shader prototyping tool. It makes possible to fast develop all kinds of shaders: Fragment (FS), Vertex (VS), Geometry (GS), Tessellation (TS) and Compute (CS). Shader development can be done using, compatible with Fragmentarium and Shadertoy (but limited to FS and VS), Fragx shaders or by using fully featured JavaScript JSX scripts. The JSX API is consistent with WebGL but in fact exposes complete OpenGL 4+ API. To install Synthclipse one can download prepacked Eclipse with Synthclipse feature (see "Files" section above) or update an existing Eclipse using the following Update Site: https://kamil-kolaczynski.github.io/synthclipse/update/ Prepacked version has one advantage: it has nice console logs coloring since it shipped with Grep Console plugin ( https://marketplace.eclipse.org/content/grep-console ) that is configured specially for Synthclipse logs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    "CUDA Templates" is a collection of C++ template classes and functions which provide a consistent interface to NVidia's "Compute Unified Device Architecture" (CUDA), hiding much of the complexity of the underlying CUDA functions from the programmer.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Kalibr Allan

    Kalibr Allan

    IMU Allan standard deviation charts

    ...While manufacturers typically provide “white noise” values in IMU datasheets, the bias instability and random walk parameters must be determined experimentally. This project enables users to compute those values using Allan variance analysis from recorded IMU data. The workflow involves recording IMU measurements with the device stationary, converting ROS bag files into MATLAB-compatible formats, and then running MATLAB scripts to generate Allan deviation plots. These plots are analyzed to determine noise density and random walk parameters for both gyroscopes and accelerometers. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 18
    properties

    properties

    another c#-like property accessor for c++11 and above.

    A property is a member that provides a flexible mechanism to read, write, or compute the value of a private field. Properties can be used as if they are public data members, but they are actually special methods called accessors. This enables data to be accessed easily and still helps promote the safety and flexibility of methods. * Github repository : https://github.com/gammasoft71/properties * Homepage : https://gammasoft71.wixsite.com/properties * Reference guide : https://codedocs.xyz/gammasoft71/properties/ * Examples : https://github.com/gammasoft71/properties/tree/master/examples
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    SoftPixel Engine

    SoftPixel Engine

    Realtime 3D engine for game development

    The SoftPixel Engine is an OpenSource high level realtime 3D engine for C++ (GNU C++ & VisualC++). It supports Direct3D11, Direct3D9, OpenGL 1.1 - 4.1, OpenGL|ES 1.1 and OpenGL|ES 2.0 and runs on Windows, Linux, Android and iOS.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Parallel Compute
    Center for Parallel Computing
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    IterInt is a numerical integration package that implements iterated numerical integration methods. Iterated methods can be used to compute low dimensional (less than ten, for example) integration problems to get very accurate results.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    An algorithm to efficiently compute the number of products and the commonality of the features modeled by a Feature Diagram
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    HASEM

    HASEM

    Matrix c++ library solving any linear algebra problems

    HASEM is a c++ template library, it can be easily configure as a expression template. HASEM solves any linear equation of any kind, it might be homogenous/none homogenous, compute the inverse of square/none square matrices, find LU decomposition, find determinant and rank of matrices, calculate the characteristic polynomial of matrices, find the eigenvalues and eigen vectors and so on. HASEM uses many methods, therefore, you are not limited to just one method, you have the choice to define your desire method as default one. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    Brook+ is a high level C-like language with extensions for stream programming on different compute devices such as CPUs and GPUs. Supports an ATI CAL and x86 CPU backend. Keywords : GPGPU, GPU Computing, HPC
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    This is a program for compute the dimension of the Hochschild homology k-vector spaces of a k-algebra A.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB