Showing 114 open source projects for "gpu max performance"

View related business solutions
  • Build Securely on Azure with Proven Frameworks Icon
    Build Securely on Azure with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    FairScale

    FairScale

    PyTorch extensions for high performance and large scale training

    ...FairScale puts emphasis on correctness and debuggability, offering hook points, logging, and reference examples for common trainer patterns. Although many ideas have since landed in core PyTorch, FairScale remains a valuable reference and a practical toolbox for squeezing more performance out of multi-GPU and multi-node jobs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    slide-element

    slide-element

    Promise-based library for animating elements with dynamic heights

    ...The animations themselves are powered by the same mechanics used within CSS transitions, making it one of the best ways to pull it off in terms of performance.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Darknet

    Darknet

    Convolutional Neural Networks

    ...Darknet is lightweight, fast, and easy to compile, making it suitable for research and production use. The repository provides pre-trained models, configuration files, and tools for training custom object detection models. With GPU acceleration via CUDA and OpenCV integration, it achieves high performance in image recognition tasks. Its simplicity, combined with powerful capabilities, has made Darknet one of the most influential projects in the computer vision community.
    Downloads: 28 This Week
    Last Update:
    See Project
  • 4

    TOTimer

    Time-Out-Timer for single threaded environement

    Serve timeouts by calling a service-function periodically and an independent time-source (toticker). If you need a timer and like to try this one, please write your experiences to me in brief! And may be suggestions, of course. 10ms tick time should be possible. May be less. DO NOT USE Version less 0.1.1 - it is definatly buggy! Easy to control timers via Handle, eg auto create&start a timer via totimer_setTimeout function. Optionally use Callback-Functions if timeout...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Build Securely on AWS with Proven Frameworks Icon
    Build Securely on AWS with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 5
    Ion

    Ion

    Portable suite of libraries and tools for building client applications

    Ion is a modular C++ toolkit for building high-performance 2D/3D graphics applications with a strong emphasis on portability, correctness, and developer ergonomics. Rather than a monolithic engine, it offers focused libraries—math, image, GPU resource management, shader utilities, remote inspection, and platform abstractions—that you can adopt à la carte. The rendering layer wraps modern OpenGL/OpenGL ES concepts with a carefully layered API that tracks object lifetimes, deduplicates resources, and enables safe multithreaded recording of draw calls. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    JAMon API

    JAMon API

    Monitor Java applications - SQL, HTTP, Methods, Exceptions and more.

    JAMon API is a free, simple, high performance, thread safe, Java API that allows developers to easily monitor the performance and scalability of production applications. JAMon tracks hits, execution times (total, avg, min, max, std dev), and more. * JAMon Users Manual: For more on the JAMon, including installing, configuring, and using it, see http://jamonapi.sourceforge.net/
    Downloads: 28 This Week
    Last Update:
    See Project
  • 7
    Big Sleep

    Big Sleep

    A simple command line tool for text to image generation

    ...You can set the number of classes that you wish to restrict Big Sleep to use for the Big GAN with the --max-classes flag as follows (ex. 15 classes). This may lead to extra stability during training, at the cost of lost expressivity.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    feathersui-starling

    feathersui-starling

    User interface components for Starling Framework, ActionScript 3

    Feathers UI (Starling edition) is a lightweight, open-source library of user interface components designed specifically for use with the Starling Framework. It allows ActionScript developers to build GPU-accelerated interfaces for games and applications that run on desktop and mobile platforms. With a focus on performance and flexibility, Feathers UI includes buttons, sliders, lists, navigators, and layout containers optimized for Starling's rendering pipeline.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Arraymancer

    Arraymancer

    A fast, ergonomic and portable tensor library in Nim

    Arraymancer is a tensor and deep learning library for the Nim programming language, designed for high-performance numerical computations and machine learning applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • 10
    sc

    sc

    Common libraries and data structures for C

    Portable, stand-alone C libraries and data structures. Each folder is stand-alone with a single header/source pair in it. There is no build for libraries, just copy files you want. e.g If you want the logger, copy sc_log.h and sc_log.c to your project. High performance & minimal memory usage. Portability between many operating systems and architectures. Tests with 100% branch coverage and multiple sanitizers. Drag & drop source code distribution. There is 100% branch coverage on Linux....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    macOS Simple KVM

    macOS Simple KVM

    Tools to set up a quick macOS VM in QEMU, accelerated by KVM

    ...The repository includes tools for preparing installation media, configuring virtual hardware, and managing VM launch scripts. By using KVM acceleration, the virtual machine runs with near-native performance, making it useful for testing, development, or personal experimentation. The project also supports GPU passthrough and other advanced configurations for users who want a more optimized macOS VM environment. While primarily intended for educational and testing purposes, it demonstrates how macOS can be virtualized outside of Apple hardware.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Tone.js

    Tone.js

    A Web Audio framework for making interactive music in the browser

    ...It has common DAW (digital audio workstation) features for those looking to schedule events and tinker with pre-built synths and effects. There’s also a great selection of high-performance building blocks for signal-processing programmers familiar with languages like Max/MSP. With Tone.js they can create their own synthesizers, effects, and complex control signals.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 13
    GPUImage 2

    GPUImage 2

    Framework for GPU-accelerated video and image processing

    ...By relying on the GPU to run these operations, performance improvements of 100X or more over CPU-bound code can be realized. This is particularly noticeable in mobile or embedded devices. On an iPhone 4S, this framework can easily process 1080p video at over 60 FPS. On a Raspberry Pi 3, it can perform Sobel edge detection on live 720p video at over 20 FPS.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Effeckt.css

    Effeckt.css

    A Performant Transitions and Animations Library

    Effeckt.css is a showcase and toolkit of high-performance UI transitions and animations for the web. It catalogs common interaction patterns—button states, list reveals, modals, off-canvas menus, and page transitions—implemented with CSS transforms and opacity for smooth, GPU-friendly motion. The project focuses on practical details such as using compositing-friendly properties, keeping DOM structures lean, and providing hooks so JavaScript can toggle classes without micromanaging animation state. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Video Nonlocal Net

    Video Nonlocal Net

    Non-local Neural Networks for Video Classification

    video-nonlocal-net implements Non-local Neural Networks for video understanding, adding long-range dependency modeling to 2D/3D ConvNet backbones. Non-local blocks compute attention-like responses across all positions in space-time, allowing a feature at one frame and location to aggregate information from distant frames and regions. This formulation improves action recognition and spatiotemporal reasoning, especially for classes requiring context beyond short temporal windows. The repo...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Mixup-CIFAR10

    Mixup-CIFAR10

    mixup: Beyond Empirical Risk Minimization

    mixup-cifar10 is the official PyTorch implementation of “mixup: Beyond Empirical Risk Minimization” (Zhang et al., ICLR 2018), a foundational paper introducing mixup, a simple yet powerful data augmentation technique for training deep neural networks. The core idea of mixup is to generate synthetic training examples by taking convex combinations of pairs of input samples and their labels. By interpolating both data and labels, the model learns smoother decision boundaries and becomes more...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 17
    Mocha.jl

    Mocha.jl

    Deep Learning framework for Julia

    ...It offers efficient implementations of gradient descent solvers and common neural network layers, supports optional unsupervised pre-training, and allows switching to a GPU backend for accelerated performance. The development of Mocha.jl happens in relative early days of Julia. Now that both Julia and the ecosystem has evolved significantly, and with some exciting new tech such as writing GPU kernels directly in Julia and general auto-differentiation supports, the Mocha codebase becomes excessively old and primitive. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Deeplearning-papernotes

    Deeplearning-papernotes

    Summaries and notes on Deep Learning research papers

    Deeplearning-papernotes is an implementation of Convolutional Neural Networks for sentence and text classification in TensorFlow, based on a well-known research paper that applies CNN architectures to natural language processing tasks with strong performance in sentiment analysis and similar classification problems. The repository provides the complete network definition, including an embedding layer to convert words into dense representations, convolution and max-pooling layers to extract informative features, and a final softmax classifier to distinguish between target classes. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    PixelFlow

    PixelFlow

    A Processing/Java library for high performance GPU-Computing (GLSL)

    PixelFlow is a Processing library focused on advanced graphics and visual effects, offering an extensive suite of GPU-based tools for visual artists, researchers, and creative coders. It enables real-time simulation and rendering of complex effects such as fluid dynamics, reaction-diffusion systems, soft shadows, and more, all powered by GLSL shaders. Its modular structure allows for chaining and composing various visual effects easily, making it ideal for installations, performances, and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Intel neon

    Intel neon

    Intel® Nervana™ reference deep learning framework

    neon is Intel's reference deep learning framework committed to best performance on all hardware. Designed for ease of use and extensibility. See the new features in our latest release. We want to highlight that neon v2.0.0+ has been optimized for much better performance on CPUs by enabling Intel Math Kernel Library (MKL). The DNN (Deep Neural Networks) component of MKL that is used by neon is provided free of charge and downloaded automatically as part of the neon installation. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21

    SoAx

    Structure of Arrays of multiple types

    Structures of arrays (SoA) are generally faster than arrays of structures (AoS) while AoS are more handy. This project (SoAx) combines the advantages of both. By means of C++(11) meta-template programming SoAx achieves maximal performance (efficient use of vector units and cache of modern CPUs) while providing a very convenient user interface (including object-oriented element handling) and flexibility. It has been designed to handle list-like sets of particles (similar to struct {int id;...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Caffe2

    Caffe2

    Caffe2 is a lightweight, modular, and scalable deep learning framework

    Caffe2 is a lightweight, modular, and scalable deep learning framework. Building on the original Caffe, Caffe2 is designed with expression, speed, and modularity in mind. Caffe2 is a deep learning framework that provides an easy and straightforward way for you to experiment with deep learning and leverage community contributions of new models and algorithms. You can bring your creations to scale using the power of GPUs in the cloud or to the masses on mobile with Caffe2’s cross-platform...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Starling Extension Graphics

    Starling Extension Graphics

    flash.display.Graphics style extension for the Starling Flash GPU

    Starling-Extension-Graphics is an extension for the Starling framework (which itself is a GPU-accelerated 2D framework for Flash/AIR via Stage3D). This extension adds graphics primitives (fills, strokes, planes etc.) that mimic flash.display.Graphics-style drawing but implemented in a GPU-friendly manner. It automatically triangulates vector shapes, letting developers use familiar drawing APIs but get performance benefits of GPU rendering via Starling. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Starling Extension Particle System

    Starling Extension Particle System

    A particle system for the Starling framework

    The Starling Extension Particle System is an ActionScript extension for the Starling framework that enables developers to integrate particle effects created with the "Particle Designer" tool by 71squared into Starling-based applications. The demo-directory contains a sample project. To compile it, add a reference to the Starling library and add the source directory that contains the particle system classes. The project contains 4 sample configurations. Switch between configurations in...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    ND2D

    ND2D

    A Flash Molehill (Stage3D) GPU accelerated 2D game engine

    ND2D is a 2D game framework for Flash that uses Stage3D / Molehill (i.e. the GPU acceleration in newer Flash Player versions). It allows game developers to build 2D games with lots of sprites, leveraging GPU for better performance. It includes display tree constructs, sprite sheets, particle systems, cameras, post-processing etc., made to simplify building high-performance 2D content in Flash. ND2D was built to make an ease use of hardware accelerated 2D content in the Flashplayer. ...
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB