Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Search Results

Search Results for "gpu processing" - Page 3

x

Sort By:

Relevance

OS

Windows 168
Linux 163
Mac 146
More...
BSD 49
ChromeOS 46
Mobile Operating Systems 9
Desktop Operating Systems 3
Server Operating Systems 2

Category

Artificial Intelligence 82
Multimedia 55
Software Development 41
Scientific/Engineering 24
Business 14
System 14
Games 6
Education 4
Security 3
Blockchain 2
Internet 2
Database 1

License

OSI-Approved Open Source 144
Other License 2
Creative Commons Attribution License 1
Public Domain 1

Translations

English 18
French 3
Italian 3
Russian 3
More...
Chinese (Simplified) 2
German 2
Polish 2
Portuguese 2
Ukrainian 2
Belarusian 1
Chinese (Traditional) 1
Dutch 1
Indonesian 1
Japanese 1
Spanish 1

Programming Language

Python 60
C++ 54
C 20
Julia 7
More...
Java 6
Rust 5
C# 4
ActionScript 3
JavaScript 3
TypeScript 3
Unix Shell 3
GLSL (OpenGL Shading Language) 2
Go 2
Lua 2
Objective C 2
Swift 2
VHDL/Verilog 2
Delphi/Kylix 1
Free Pascal 1
Lazarus 1
Objective-C 2.0 1
PHP 1
S/R 1
Yacc 1

Status

Production/Stable 19
Beta 14
Pre-Alpha 6
Planning 5
More...
Alpha 5

Showing 194 open source projects for "gpu processing"

View related business solutions

Build Securely on AWS with Proven Frameworks
Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.

Download Now
Forever Free Full-Stack Observability | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
1

Halide

A language for fast, portable data-parallel computation

Halide is a programming language for fast, portable data-parallel computation. It was designed to make writing high-performance image and array processing code much easier on modern machines. It works on all major operating systems and with several CPU architectures (X86, ARM, MIPS, Hexagon, PowerPC) and GPU Compute APIs (CUDA, OpenCL, OpenGL, among others). It isn't a standalone programming language however; rather it is embedded in C++ which means that you write C++ code, building an in-memory representation of a Halide pipeline using Halide's C++ API. ...

Downloads: 2 This Week

Last Update: 2025-09-17
See Project
2

Handy STT

A free, open source, and extensible speech-to-text application

Handy is a free, open-source, offline speech-to-text application built for privacy, accessibility, and extensibility. Developed using Tauri (Rust + React/TypeScript), it runs natively across Windows, macOS, and Linux while performing local speech recognition without sending any audio to cloud servers. Handy allows users to start transcription instantly using a configurable keyboard shortcut—press to record, release to transcribe—and automatically pastes the resulting text into any active...

Downloads: 51 This Week

Last Update: 2026-04-27
See Project
3

LiveAvatar

Streaming Real-time Audio-Driven Avatar Generation

...It implements techniques from state-of-the-art diffusion-based avatar modeling to support infinite-length continuous video generation with low latency, enabling interactive AI avatars that maintain continuity and realism over extended sessions. The project co-designs algorithms and system optimizations, such as block-wise autoregressive processing and fast sampling strategies, to deliver real-time frame rates (e.g., ~45 FPS on appropriate GPU clusters) while handling non-stop generation without quality degradation. LiveAvatar focuses on delivering not just high-quality visuals but also the responsiveness necessary for immersive conversational experiences, making it suitable for advanced AI agents, virtual assistants, and interactive streaming contexts.

Downloads: 0 This Week

Last Update: 2026-04-08
See Project
4

TensorRT Node for ComfyUI

Enables the best performance on NVIDIA RTX Graphics Cards

...It bridges the gap between ComfyUI’s flexible, node-based workflows and TensorRT’s highly optimized engine format. The result is that complex diffusion or image-processing graphs can be accelerated without the user having to rewrite the pipeline. The repo typically includes instructions for converting models to TensorRT engines and for wiring those engines into ComfyUI nodes. This is particularly attractive for power users who run many generations or who host ComfyUI on dedicated hardware and want to squeeze out every bit of GPU performance. ...

Downloads: 0 This Week

Last Update: 2025-10-30
See Project
AI-powered service management for IT and enterprise teams
Enterprise-grade ITSM, for every business

Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.

Try it Free
5

Anime4KCPP

A high performance anime upscaler

Anime4KCPP provides an optimized bloc97's Anime4K algorithm version 0.9, and it also provides its own CNN algorithm ACNet, it provides a variety of way to use, including preprocessing and real-time playback, it aims to be a high-performance tool to process both image and video. This project is for learning and the exploration task of the algorithm course in SWJTU. Anime4K is a simple high-quality anime upscale algorithm. Version 0.9 does not use any machine learning approaches and can be...

Downloads: 41 This Week

Last Update: 2 days ago
See Project
6

Depth Pro

Sharp Monocular Metric Depth in Less Than a Second

Depth Pro is a foundation model for zero-shot metric monocular depth estimation, producing sharp, high-frequency depth maps with absolute scale from a single image. Unlike many prior approaches, it does not require camera intrinsics or extra metadata, yet still outputs metric depth suitable for downstream 3D tasks. Apple highlights both accuracy and speed: the model can synthesize a ~2.25-megapixel depth map in around 0.3 seconds on a standard GPU, enabling near real-time applications. The...

Downloads: 1 This Week

Last Update: 2025-10-08
See Project
7

Nuclio

High-Performance Serverless event and data processing platform

Nuclio is an open source and managed serverless platform used to minimize development and maintenance overhead and automate the deployment of data-science-based applications. Real-time performance running up to 400,000 function invocations per second. Portable across low laptops, edge, on-prem and multi-cloud deployments. The first serverless platform supporting GPUs for optimized utilization and sharing. Automated deployment to production in a few clicks from Jupyter notebook. Deploy one of...

Downloads: 5 This Week

Last Update: 2026-04-16
See Project
8

PostgresML

The GPU-powered AI application database

PostgresML is a complete platform in a PostgreSQL extension. Build simpler, faster, and more scalable models right inside your database. Explore the SDK and test open source models in our hosted database. Combine and automate the entire workflow from embedding generation to indexing and querying for the simplest (and fastest) knowledge-based chatbot implementation. Leverage multiple types of natural language processing and machine learning models such as vector search and personalization...

Downloads: 4 This Week

Last Update: 2025-01-16
See Project
9

VisPy

Main repository for Vispy

Vispy is an open-source, high-performance interactive visualization library in Python, designed for creating scientific visualizations and interactive plots. It leverages the power of modern Graphics Processing Units (GPUs) through OpenGL to render large datasets efficiently. Vispy supports a wide range of visualization types, including 2D plots, 3D visualizations, volume rendering, and more, making it suitable for scientific research, data analysis, and educational purposes.

Downloads: 0 This Week

Last Update: 2026-01-07
See Project
Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
10

Open-LLM-VTuber

Open source AI VTuber platform with voice chat and Live2D avatars

Open-LLM-VTuber is an open source platform designed to create AI-powered VTuber characters that can interact with users through voice and animated avatars. It enables hands-free conversations with large language models by combining speech recognition, language processing, and text-to-speech synthesis into a single system. Users can speak directly to the AI character, and the system can respond with a generated voice while animating a Live2D avatar to simulate a talking virtual personality....

Downloads: 26 This Week

Last Update: 2026-03-17
See Project
11

CUDA-QX

Accelerated libraries for quantum-classical computing built on CUDA-Q

CUDA-QX is a collection of accelerated libraries built on top of the CUDA-Q platform, designed to enable rapid development of hybrid quantum-classical applications. It extends the CUDA-Q programming model by providing optimized implementations of domain-specific quantum computing primitives and workflows. The libraries are intended to help researchers and developers leverage GPUs, CPUs, and quantum processing units together in a unified computational model. CUDA-QX focuses on key areas such...

Downloads: 5 This Week

Last Update: 2026-04-10
See Project
12

OGRE-Next 3D

aka ogre v2 - scene-oriented, flexible 3D C++ engine

OGRE-Next is the next-generation iteration of the OGRE (Object-Oriented Graphics Rendering Engine), a powerful open-source 3D rendering engine designed for real-time applications, games, simulations, and visualizations. It focuses on high-performance rendering pipelines, especially Vulkan and modern OpenGL, offering tools for photorealistic and stylized rendering. OGRE-Next is modular and flexible, providing a developer-friendly environment with scene management, lighting, shadowing, and...

Downloads: 5 This Week

Last Update: 2025-03-21
See Project
13

Qwen

The official repo of Qwen chat & pretrained large language model

Qwen is a series of large language models developed by Alibaba Cloud, consisting of various pretrained versions like Qwen-1.8B, Qwen-7B, Qwen-14B, and Qwen-72B. These models, which range from smaller to larger configurations, are designed for a wide range of natural language processing tasks. They are openly available for research and commercial use, with Qwen's code and model weights shared on GitHub. Qwen's capabilities include text generation, comprehension, and conversation, making it a...

1 Review

Downloads: 16 This Week

Last Update: 2026-03-05
See Project
14

rspirv

Rust implementation of SPIR-V module processing functionalities

rspirv is a Rust-based parser, builder, and disassembler for SPIR-V, the intermediate binary format used in Vulkan and OpenCL for shaders and compute kernels. It’s part of the gfx-rs ecosystem, a suite of graphics tools aiming to provide cross-platform rendering capabilities in Rust. rspirv enables manipulation and inspection of SPIR-V modules, which is useful in shader compilers, graphics drivers, and development tools for low-level GPU programming. The library strictly follows the SPIR-V...

Downloads: 2 This Week

Last Update: 2026-03-13
See Project
15

ApraPipes

A pipeline framework for developing video and image processing apps

ApraPipes is a C++ multimedia processing framework designed for building high-performance video/audio processing pipelines with GPU acceleration. It provides a modular, declarative architecture for creating complex media processing workflows that span camera capture, encoding/decoding, computer vision, AI operations, and output to files, streams, or displays.

Downloads: 0 This Week

Last Update: 4 hours ago
See Project
16

PennyLane

A cross-platform Python library for differentiable programming

...Train a quantum computer the same way as a neural network. Built-in automatic differentiation of quantum circuits, using the near-term quantum devices directly. You can combine multiple quantum devices with classical processing arbitrarily! Support for hybrid quantum and classical models, and compatible with existing machine learning libraries. Quantum circuits can be set up to interface with either NumPy, PyTorch, JAX, or TensorFlow, allowing hybrid CPU-GPU-QPU computations. The same quantum circuit model can be run on different devices. Install plugins to run your computational circuits on more devices, including Strawberry Fields, Amazon Braket, Qiskit and IBM Q, Google Cirq, Rigetti Forest, and the Microsoft QDK.

Downloads: 1 This Week

Last Update: 2026-03-11
See Project
17

Ray

A unified framework for scalable computing

Modern workloads like deep learning and hyperparameter tuning are compute-intensive and require distributed or parallel execution. Ray makes it effortless to parallelize single machine code — go from a single CPU to multi-core, multi-GPU or multi-node with minimal code changes. Accelerate your PyTorch and Tensorflow workload with a more resource-efficient and flexible distributed execution framework powered by Ray. Accelerate your hyperparameter search workloads with Ray Tune. Find the best...

Downloads: 2 This Week

Last Update: 2026-04-19
See Project
18

HunyuanDiT

Diffusion Transformer with Fine-Grained Chinese Understanding

HunyuanDiT is a high-capability text-to-image diffusion transformer with bilingual (Chinese/English) understanding and multi-turn dialogue capability. It trains a diffusion model in latent space using a transformer backbone and integrates a Multimodal Large Language Model (MLLM) to refine captions and support conversational image generation. It supports adapters like ControlNet, IP-Adapter, LoRA, and can run under constrained VRAM via distillation versions. LoRA, ControlNet (pose, depth,...

Downloads: 0 This Week

Last Update: 2025-11-27
See Project
19

IMS Toucan

Controllable and fast Text-to-Speech for over 7000 languages

...IMS-Toucan ships with several ready-to-run scripts, including GUIs for interactive demos, prosody override tools, zero-shot language embedding injection, and text-to-audio file generation. Pretrained models are automatically downloaded when needed, and there is an online demo instance hosted on GPU that anyone can try.

Downloads: 0 This Week

Last Update: 2025-11-28
See Project
20

PyTorch Geometric Temporal

Spatiotemporal Signal Processing with Neural Machine Learning Models

The library consists of various dynamic and temporal geometric deep learning, embedding, and Spatio-temporal regression methods from a variety of published research papers. Moreover, it comes with an easy-to-use dataset loader, train-test splitter and temporal snaphot iterator for dynamic and temporal graphs. The framework naturally provides GPU support. It also comes with a number of benchmark datasets from the epidemiological forecasting, sharing economy, energy production and web traffic...

Downloads: 0 This Week

Last Update: 2025-03-28
See Project
21

fastai

Deep learning library

fastai is a deep learning library which provides practitioners with high-level components that can quickly and easily provide state-of-the-art results in standard deep learning domains, and provides researchers with low-level components that can be mixed and matched to build new approaches. It aims to do both things without substantial compromises in ease of use, flexibility, or performance. This is possible thanks to a carefully layered architecture, which expresses common underlying...

Downloads: 0 This Week

Last Update: 2026-02-14
See Project
22

MegEngine

Easy-to-use deep learning framework with 3 key features

MegEngine is a fast, scalable and easy-to-use deep learning framework with 3 key features. You can represent quantization/dynamic shape/image pre-processing and even derivation in one model. After training, just put everything into your model and inference it on any platform at ease. Speed and precision problems won't bother you anymore due to the same core inside. In training, GPU memory usage could go down to one-third at the cost of only one additional line, which enables the DTR algorithm. ...

Downloads: 3 This Week

Last Update: 2024-04-30
See Project
23

Waifu2x-Extension-GUI

Photo/Video/GIF enlargement using machine learning

Image & GIF & Video Super-Resolution using Deep Convolutional Neural Networks. Built-in image processing algorithm: Waifu2x / SRMD / RealSR / Anime4K / ACNet Built-in image processing engine: Waifu2x-caffe / Waifu2x-converter / Waifu2x-ncnn-vulkan / SRMD-ncnn-vulkan / RealSR-ncnn-vulkan / Anime4KCPP Github: https://github.com/AaronFeng753/Waifu2x-Extension-GUI

Downloads: 717 This Week

Last Update: 2026-05-02
See Project
24

VCClient

Software that uses AI to perform real-time voice conversion

VCClient is a real-time voice conversion system that uses machine learning models to transform a speaker’s voice into another voice with minimal latency. It is designed for live applications such as streaming, gaming, and virtual communication, where immediate feedback is essential. The system supports multiple voice conversion models, including RVC and other neural network-based approaches, allowing users to switch between different voices or customize their output. It provides both a...

Downloads: 25 This Week

Last Update: 2026-03-23
See Project
25

GrOWin

Gromacs on Windows

...Cross-Platform Compatibility: Growin extends the reach of GROMACS by introducing a dedicated Windows version, allowing users on this platform to harness the power of GROMACS for their MD simulations. 2. Optimized Performance: Experience enhanced performance with Growin's dedicated CPU and GPU versions of the software. Whether you're utilizing the raw processing power of your CPU or leveraging the parallel computing capabilities of your GPU. 3. User-Friendly Interface: A simple command line interface on Windows for seamless navigation and efficient utilization of GROMACS functionalities. Discover the next level of MD simulations with Growin opening new possibilities

10 Reviews

Downloads: 23 This Week

Last Update: 2025-10-07
See Project

Previous
1
2
You're on page 3
4
5
6
7
8
Next

Related Searches

speech

handy

transcribe audio to srt

depth map creator

waifu2x gui

cuda

transcribe

speech to text

anime4kcpp

qwen

Related Categories

Artificial Intelligence

Multimedia

Software Development

Scientific/Engineering

Business

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise