Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Search Results

Search Results for "gpu max performance" - Page 5

x

Sort By:

Relevance

Clear All Filters

OS

Linux 388
Windows 346
Mac 336
More...
BSD 122
ChromeOS 121
Mobile Operating Systems 23
Desktop Operating Systems 6
Embedded Operating Systems 1
Server Operating Systems 1

Category

Artificial Intelligence 153
Software Development 106
Multimedia 47
System 47
Business 24
Scientific/Engineering 17
Games 13
Blockchain 6
Database 4
Mobile 3
Security 3
Education 2
Terminals 2
Internet 1
Text Editors 1

License

OSI-Approved Open Source 315
Creative Commons Attribution License 2
Other License 2
Public Domain 1

Translations

English 13
Bengali 1
Chinese (Simplified) 1
Korean 1
More...
Spanish 1

Programming Language

Python 131
C++ 88
C 35
Rust 20
More...
Java 14
JavaScript 13
TypeScript 13
Unix Shell 13
Go 10
Julia 10
ActionScript 8
C# 7
Objective C 4
Assembly 2
CoffeeScript 2
Haskell 2
MATLAB 2
AspectJ 1
Fortran 1
haXe 1
Kotlin 1
Lua 1
PHP 1
Swift 1
Tcl 1

Status

Production/Stable 24
Beta 15
Alpha 7
Mature 3

Showing 388 open source projects for "gpu max performance"

View related business solutions

Linux Clear Filters & Widen Search

Full-stack observability with actually useful AI | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
Try Google Cloud Risk-Free With $300 in Credit
No hidden charges. No surprise bills. Cancel anytime.

Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.

Start Free
1

FurMark

GPU stress test OpenGL and Vulkan graphics benchmark Windows/Linux

FurMark is an intensive benchmarking tool designed to evaluate the performance of graphics cards using fur rendering algorithms. This tool is particularly effective in generating high workloads that can significantly increase the temperature of the GPU, making it a useful utility for testing the stability and stress tolerance of graphics cards. By simulating demanding rendering tasks, FurMark serves as a comprehensive test for assessing the robustness and thermal performance of GPUs under extreme conditions. ...

Downloads: 311 This Week

Last Update: 2024-10-28
See Project
2

HLSL++

Math library using HLSL syntax with multiplatform SIMD support

HLSL++ is a header-only C++ math library designed to replicate the syntax and functionality of the HLSL shading language, making it easier for developers to write CPU-side code that mirrors GPU shader logic. It provides vector, matrix, and math operations with a syntax identical or very similar to HLSL, allowing seamless transition between shader code and application code. The library is optimized for performance and supports SIMD instructions across multiple architectures, including SSE, AVX, AVX2, AVX512, and ARM NEON, ensuring high efficiency on modern hardware. ...

Downloads: 2 This Week

Last Update: 2026-04-08
See Project
3

LiveAvatar

Streaming Real-time Audio-Driven Avatar Generation

LiveAvatar is an open-source research and implementation project that provides a unified framework for real-time, streaming, interactive avatar video generation driven by audio and other control signals. It implements techniques from state-of-the-art diffusion-based avatar modeling to support infinite-length continuous video generation with low latency, enabling interactive AI avatars that maintain continuity and realism over extended sessions. The project co-designs algorithms and system...

Downloads: 2 This Week

Last Update: 2026-04-08
See Project
4

Intel Extension for PyTorch

A Python package for extending the official PyTorch

Intel® Extension for PyTorch* extends PyTorch* with up-to-date features optimizations for an extra performance boost on Intel hardware. Optimizations take advantage of Intel® Advanced Vector Extensions 512 (Intel® AVX-512) Vector Neural Network Instructions (VNNI) and Intel® Advanced Matrix Extensions (Intel® AMX) on Intel CPUs as well as Intel Xe Matrix Extensions (XMX) AI engines on Intel discrete GPUs. Moreover, Intel® Extension for PyTorch* provides easy GPU acceleration for Intel discrete GPUs through the PyTorch* xpu device.

Downloads: 2 This Week

Last Update: 2025-08-08
See Project
Build Securely on AWS with Proven Frameworks
Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.

Download Now
5

OGRE-Next 3D

aka ogre v2 - scene-oriented, flexible 3D C++ engine

OGRE-Next is the next-generation iteration of the OGRE (Object-Oriented Graphics Rendering Engine), a powerful open-source 3D rendering engine designed for real-time applications, games, simulations, and visualizations. It focuses on high-performance rendering pipelines, especially Vulkan and modern OpenGL, offering tools for photorealistic and stylized rendering. OGRE-Next is modular and flexible, providing a developer-friendly environment with scene management, lighting, shadowing, and...

Downloads: 1 This Week

Last Update: 2025-03-21
See Project
6

ChartGPU

Beautiful, open source, WebGPU-based charting library

The ChartGPU repository is an open-source, WebGPU-based charting library written in TypeScript that enables developers to visualize large datasets with high performance and smooth interactivity even when handling millions of data points. By leveraging WebGPU — the next-generation graphics API for the web — ChartGPU offloads rendering work to the GPU, allowing for fast panning, zooming, and real-time updates with minimal latency. This makes the library particularly valuable for data-intensive dashboards, scientific visualizations, and financial charting where performance bottlenecks of traditional canvas or SVG approaches become apparent. ...

Downloads: 0 This Week

Last Update: 2026-02-26
See Project
7

Floem

A native Rust UI library with fine-grained reactivity

Floem is a cross-platform GUI framework for Rust. It aims to be extremely performant while providing world-class developer ergonomics. Supporting both GPU and CPU rendering, Floem gives you performance that's closest to bare metal. Also primitives are provided to help the developer to write performant UI code without too much effect.

Downloads: 0 This Week

Last Update: 2024-11-15
See Project
8

Arcan

Powerful development framework for creating virtually anything

...At its heart lies a robust and portable multimedia engine, with a well-tested and well-documented Lua scripting interface. The development emphasizes security, debuggability and performance, guided by a principle of least surprise in terms of API design. For the main engine there has been quite some refactoring to reduce input latency; better accommodate variable-refresh rate display; prepare for asymmetric uncooperative multi-GPU and GPU handover; explicit synchronization and runtime transitions back and forth between low (16-bit) to standard (32-bit) to high-definition rendering (10-bit + fp16/fp32).

Downloads: 0 This Week

Last Update: 2025-01-29
See Project
9

GPT4All

Run Local LLMs on Any Device. Open-source

GPT4All is an open-source project that allows users to run large language models (LLMs) locally on their desktops or laptops, eliminating the need for API calls or GPUs. The software provides a simple, user-friendly application that can be downloaded and run on various platforms, including Windows, macOS, and Ubuntu, without requiring specialized hardware. It integrates with the llama.cpp implementation and supports multiple LLMs, allowing users to interact with AI models privately. This...

1 Review

Downloads: 130 This Week

Last Update: 2025-03-17
See Project
AI-powered service management for IT and enterprise teams
Enterprise-grade ITSM, for every business

Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.

Try it Free
10

MESHROOM

3D reconstruction software

Photogrammetry is the science of making measurements from photographs. It infers the geometry of a scene from a set of unordered photographies or videos. Photography is the projection of a 3D scene onto a 2D plane, losing depth information. The goal of photogrammetry is to reverse this process. The dense modeling of the scene is the result yielded by chaining two computer vision-based pipelines, “Structure-from-Motion” (SfM) and “Multi View Stereo” (MVS). Fusion of Multi-bracketing LDR...

1 Review

Downloads: 118 This Week

Last Update: 2025-08-19
See Project
11

QuickViewer

A image/comic viewer application for Windows, Mac and Linux

QuickViewer is a fast and lightweight image viewer designed to handle large image collections and archive formats with blazing performance. Built using C++ and Qt, QuickViewer is optimized for viewing manga, comics, and large photo folders with instant loading and minimal lag. It includes a streamlined interface, hardware-accelerated rendering, and features tailored for browsing through image series efficiently. With support for compressed formats like ZIP and RAR, QuickViewer is especially...

Downloads: 31 This Week

Last Update: 2025-12-12
See Project
12

The Futhark Programming Language

A data-parallel functional programming language

Futhark is a small programming language designed to be compiled into efficient parallel code. It is a statically typed, data-parallel, and purely functional array language in the ML family, and comes with a heavily optimizing ahead-of-time compiler that presently generates either GPU code via CUDA and OpenCL, or multi-threaded CPU code. Futhark is not designed for graphics programming, but can instead use the compute power of the GPU to accelerate data-parallel array computations. The...

Downloads: 3 This Week

Last Update: 2026-04-30
See Project
13

Apollo

The easiest way to stream with the native resolution of your client

Apollo is a self-hosted desktop streaming host designed to enable low-latency game streaming from a personal computer to remote clients using protocols compatible with Moonlight and Artemis. It acts as a server that captures, encodes, and streams desktop or game sessions while supporting hardware acceleration across AMD, Intel, and NVIDIA GPUs. The project includes a web-based interface that allows users to configure streaming settings, manage connected clients, and control application...

Downloads: 58 This Week

Last Update: 2026-03-18
See Project
14

stt

Voice Recognition to Text Tool

...The project is designed to be easy to deploy: you can run a local Python server that exposes an HTTP API for uploading audio/video files and retrieving transcriptions in different formats. It supports GPU acceleration if available, enabling faster processing on compatible hardware but still offers reliable performance on CPUs alone.

Downloads: 0 This Week

Last Update: 2026-02-17
See Project
15

WhisperLive

A nearly-live implementation of OpenAI's Whisper

WhisperLive is a “nearly live” implementation of OpenAI’s Whisper model focused on real-time transcription. It runs as a server–client system in which the server hosts a Whisper backend and clients stream audio to be transcribed with very low delay. The project supports multiple inference backends, including Faster-Whisper, NVIDIA TensorRT, and OpenVINO, allowing you to target GPUs and different CPU architectures efficiently. It can handle microphone input, pre-recorded audio files, and...

Downloads: 9 This Week

Last Update: 2026-03-17
See Project
16

Qwen

The official repo of Qwen chat & pretrained large language model

Qwen is a series of large language models developed by Alibaba Cloud, consisting of various pretrained versions like Qwen-1.8B, Qwen-7B, Qwen-14B, and Qwen-72B. These models, which range from smaller to larger configurations, are designed for a wide range of natural language processing tasks. They are openly available for research and commercial use, with Qwen's code and model weights shared on GitHub. Qwen's capabilities include text generation, comprehension, and conversation, making it a...

1 Review

Downloads: 15 This Week

Last Update: 2026-03-05
See Project
17

Unsloth Studio

Unified web UI for training and running open models locally

Unsloth Studio is a web-based interface for running and training AI models locally with a unified and user-friendly experience. It allows users to work with a wide range of models for text, audio, vision, embeddings, and more without relying heavily on cloud infrastructure. Built on top of the Unsloth framework, it focuses on high-performance training with reduced VRAM usage and faster speeds compared to traditional methods. The platform supports fine-tuning, pretraining, and reinforcement...

Downloads: 15 This Week

Last Update: 2 days ago
See Project
18

node-llama-cpp

Run AI models locally on your machine with node.js bindings for llama

...The system automatically detects the available hardware on a machine and selects the most appropriate compute backend, including CPU or GPU acceleration. Developers can use the library to perform tasks such as text generation, conversational chat, embedding generation, and structured output generation. Because it runs models locally, the platform is particularly useful for privacy-sensitive environments or offline AI deployments.

Downloads: 2 This Week

Last Update: 2026-03-17
See Project
19

Face Alignment

2D and 3D Face alignment library build using pytorch

...By default, the package will use the SFD face detector. However, the users can alternatively use dlib, BlazeFace, or pre-existing ground truth bounding boxes. While not required, for optimal performance(especially for the detector) it is highly recommended to run the code using a CUDA-enabled GPU. While here the work is presented as a black box, if you want to know more about the intrisecs of the method please check the original paper either on arxiv or my webpage.

Downloads: 2 This Week

Last Update: 2026-04-06
See Project
20

CUDA Agent

Large-Scale Agentic RL for High-Performance CUDA Kernel Generation

CUDA Agent is a research-driven agentic reinforcement learning system designed to automatically generate and optimize high-performance CUDA kernels for GPU workloads. The project addresses the long-standing challenge that efficient CUDA programming typically requires deep hardware expertise by training an autonomous coding agent capable of iterative improvement through execution feedback. Its architecture combines large-scale data synthesis, a skill-augmented CUDA development environment, and long-horizon reinforcement learning to build intrinsic optimization capability rather than relying on simple post-hoc tuning. ...

Downloads: 0 This Week

Last Update: 2026-03-03
See Project
21

Handy STT

A free, open source, and extensible speech-to-text application

Handy is a free, open-source, offline speech-to-text application built for privacy, accessibility, and extensibility. Developed using Tauri (Rust + React/TypeScript), it runs natively across Windows, macOS, and Linux while performing local speech recognition without sending any audio to cloud servers. Handy allows users to start transcription instantly using a configurable keyboard shortcut—press to record, release to transcribe—and automatically pastes the resulting text into any active...

Downloads: 62 This Week

Last Update: 2026-04-27
See Project
22

shimmy

Python-free Rust inference server

...This compatibility enables developers to replace remote AI services with locally hosted models while keeping their existing software architecture intact. Shimmy focuses on performance and simplicity, using efficient runtime components to minimize memory usage and startup time compared to heavier inference frameworks. It supports modern model formats such as GGUF and SafeTensors and can automatically discover models stored locally or in common directories used by other AI tools. Advanced capabilities include CPU offloading for Mixture-of-Experts models and GPU acceleration, enabling large models to run on consumer hardware with limited VRAM.

Downloads: 1 This Week

Last Update: 2026-03-11
See Project
23

Audiblez

Generate audiobooks from e-books

Audiblez is a tool for generating high-quality .m4b audiobooks directly from .epub e-books using the Kokoro-82M neural text-to-speech model. It focuses on making audiobook creation easy and fast: from a single command, the tool splits an e-book into chapters, synthesizes audio for each section, and then merges the results into a structured audiobook with chapter-based WAV files and a final .m4b container. The Kokoro-82M model it uses is compact (82M parameters) yet natural sounding, trained...

Downloads: 5 This Week

Last Update: 2025-11-30
See Project
24

JAX Toolbox

Public CI, Docker images for popular JAX libraries

JAX Toolbox is a development toolkit designed to streamline and optimize the use of JAX for machine learning and high-performance computing on NVIDIA GPUs. It provides prebuilt Docker images, continuous integration pipelines, and optimized example implementations that help developers quickly set up and run JAX workloads without complex configuration. The project supports popular JAX-based frameworks and models, including architectures used for large-scale pretraining such as GPT and LLaMA...

Downloads: 2 This Week

Last Update: 2026-04-30
See Project
25

Humanoid-Gym

Reinforcement Learning for Humanoid Robot with Zero-Shot Sim2Real

Humanoid-Gym is a reinforcement learning framework designed to train locomotion and control policies for humanoid robots using high-performance simulation environments. The system is built on top of NVIDIA Isaac Gym, which allows large-scale parallel simulation of robotic environments directly on GPU hardware. Its primary goal is to enable efficient training of humanoid robots in simulation while enabling policies to transfer effectively to real-world hardware without additional training. ...

Downloads: 0 This Week

Last Update: 2026-03-15
See Project

Previous
1
2
3
4
You're on page 5
6
7
8
9
Next

Related Searches

furmark

offline artificial intelligence\

speech

meshroom

gpt4all

handy

transcribe audio to srt

intel

local llm

jcop english software

Related Categories

Artificial Intelligence

Software Development

Multimedia

System

Business

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise