Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Artificial Intelligence Software
Search Results

Search Results for "gpu max performance" - Page 4

x

Sort By:

Relevance

Clear All Filters

OS

Linux 90
Windows 88
Mac 84
More...
ChromeOS 35
BSD 34
Mobile Operating Systems 3

Category

Artificial Intelligence 100
Software Development 9
Multimedia 3
System 2
Games 1
Scientific/Engineering 1

License

OSI-Approved Open Source 89
Creative Commons Attribution License 1

Translations

English 5
Bengali 1

Programming Language

Python 100
C++ 5
Unix Shell 4
JavaScript 2
Go 1
More...
Java 1
Julia 1
Rust 1

Status

Production/Stable 3
Beta 1

Showing 100 open source projects for "gpu max performance"

View related business solutions

Artificial Intelligence Python Clear Filters & Widen Search

$300 in Free Credit Towards Top Cloud Services
Build VMs, containers, AI, databases, storage—all in one place.

Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.

Get Started
Earn up to 16% annual interest with Nexo.
Let your crypto work for you

Put idle assets to work with competitive interest rates, borrow without selling, and trade with precision. All in one platform. Geographic restrictions, eligibility, and terms apply.

Get started with Nexo.
1

PC_Workman_HCK

AI-powered PC monitoring that explains. Not shows numbers/spikes.

PC_Workman is what 680 hours of coding after warehouse shifts looks like. Built on a laptop hitting 94°C, this AI-powered monitoring tool does what Task Manager can't: it understands your system, not just measures it. Features: - Time travel monitoring - debug issues from hours ago - AI diagnostics with HCK_GPT - Custom fan curves with profiles - Floating always-on-top widget - 2D system map - Cross-GPU support (NVIDIA/AMD/Intel) Four complete rebuilds. 29 features killed....

3 Reviews

Downloads: 6 This Week

Last Update: 2026-04-27
See Project
2

Xfl

An Efficient and Easy-to-use Federated Learning Framework

XFL is a lightweight, high-performance federated learning framework supporting both horizontal and vertical FL. It integrates homomorphic encryption, DP, secure MPC, and optimizes network resilience. Compatible with major ML libraries and deployable via Docker or Conda.

Downloads: 0 This Week

Last Update: 2025-07-15
See Project
3

SSD in PyTorch 1.0

High quality, fast, modular reference implementation of SSD in PyTorch

This repository implements SSD (Single Shot MultiBox Detector). The implementation is heavily influenced by the projects ssd.pytorch, pytorch-ssd and maskrcnn-benchmark. This repository aims to be the code base for research based on SSD. Multi-GPU training and inference: We use DistributedDataParallel, you can train or test with arbitrary GPU(s), the training schema will change accordingly. Add your own modules without pain. We abstract backbone, Detector, BoxHead, BoxPredictor, etc. You can...

Downloads: 0 This Week

Last Update: 2024-01-13
See Project
4

Aviary

Ray Aviary - evaluate multiple LLMs easily

Aviary is an LLM serving solution that makes it easy to deploy and manage a variety of open source LLMs. Providing an extensive suite of pre-configured open source LLMs, with defaults that work out of the box. Supporting Transformer models hosted on Hugging Face Hub or present on local disk. Aviary has native support for autoscaling and multi-node deployments thanks to Ray and Ray Serve. Aviary can scale to zero and create new model replicas (each composed of multiple GPU workers) in...

Downloads: 0 This Week

Last Update: 2024-01-18
See Project
Train ML Models With SQL You Already Know
BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.

Try Free
5

Chinese-LLaMA-Alpaca 2

Chinese LLaMA-2 & Alpaca-2 Large Model Phase II Project

This project is developed based on the commercially available large model Llama-2 released by Meta. It is the second phase of the Chinese LLaMA&Alpaca large model project. The Chinese LLaMA-2 base model and the Alpaca-2 instruction fine-tuning large model are open-sourced. These models expand and optimize the Chinese vocabulary on the basis of the original Llama-2, use large-scale Chinese data for incremental pre-training, and further improve the basic semantics and command understanding of...

Downloads: 0 This Week

Last Update: 2024-01-23
See Project
6

pipeless

A computer vision framework to create and deploy apps in minutes

...You can easily use industry-standard models, such as YOLO, or load your custom model in one of the supported inference runtimes. Pipeless ships some of the most popular inference runtimes, such as the ONNX Runtime, allowing you to run inference with high performance on CPU or GPU out-of-the-box. You can deploy your Pipeless application with a single command to edge and IoT devices or the cloud.

Downloads: 0 This Week

Last Update: 2024-02-23
See Project
7

FEDML Open Source

The unified and scalable ML library for large-scale training

...Highly integrated with TensorOpera open source library, TensorOpera AI provides holistic support of three interconnected AI infrastructure layers: user-friendly MLOps, a well-managed scheduler, and high-performance ML libraries for running any AI jobs across GPU Clouds. A typical workflow is shown in the figure above. When a developer wants to run a pre-built job in Studio or Job Store, TensorOperaLaunch swiftly pairs AI jobs with the most economical GPU resources, and auto-provisions, and effortlessly runs the job, eliminating complex environment setup and management.

Downloads: 0 This Week

Last Update: 2024-08-05
See Project
8

ReplitLM

Inference code and configs for the ReplitLM model family

...Developers can fine-tune the models using instruction-tuning techniques to adapt them for specific programming tasks or domains. The models were trained using modern deep learning techniques and large-scale GPU infrastructure to achieve strong performance in code completion and generation tasks.

Downloads: 0 This Week

Last Update: 2026-03-10
See Project
9

Chinese-LLaMA-Alpaca-2 v2.0

Chinese LLaMA & Alpaca large language model + local CPU/GPU training

This project has open-sourced the Chinese LLaMA model and the Alpaca large model with instruction fine-tuning to further promote the open research of large models in the Chinese NLP community. Based on the original LLaMA , these models expand the Chinese vocabulary and use Chinese data for secondary pre-training, which further improves the basic semantic understanding of Chinese. At the same time, the Chinese Alpaca model further uses Chinese instruction data for fine-tuning, which...

Downloads: 0 This Week

Last Update: 2023-08-21
See Project
Go From AI Idea to AI App Fast
One platform to build, fine-tune, and deploy ML models. No MLOps team required.

Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.

Try Free
10

OptiMate

Libraries for optimizing AI models, inference speed, and GPU usage

...One of the core components, Speedster, focuses on accelerating model inference by applying state of the art optimization techniques to increase performance while lowering operational costs. Another component, Nos, targets infrastructure optimization by improving GPU utilization in Kubernetes clusters through dynamic partitioning and elastic resource quotas.

Downloads: 2 This Week

Last Update: 7 hours ago
See Project
11

MLPACK C++ machine learning library

MLPACK is a C++ machine learning library with emphasis on scalability, speed, and ease-of-use. Its aim is to make machine learning possible for novice users by means of a simple, consistent API, while simultaneously exploiting C++ language features to provide maximum performance and flexibility for expert users. * More info + downloads: https://mlpack.org * Git repo: https://github.com/mlpack/mlpack

Downloads: 0 This Week

Last Update: 2023-06-28
See Project
12

ASRT Speech Recognition

A Deep-Learning-Based Chinese Speech Recognition System

ASRT is an end-to-end deep-learning Chinese ASR system built with TensorFlow/Keras, using convolution + CTC and a Max-Entropy HMM language model. It provides a REST/gRPC server backend and client SDKs in multiple languages (Python, Java, Go, Windows). Notably lightweight, it performs well without needing GPU acceleration and runs across platforms, targeting developers and researchers building Chinese voice interfaces.

Downloads: 0 This Week

Last Update: 2025-07-03
See Project
13

Big Sleep

A simple command line tool for text to image generation

...You can set the number of classes that you wish to restrict Big Sleep to use for the Big GAN with the --max-classes flag as follows (ex. 15 classes). This may lead to extra stability during training, at the cost of lost expressivity.

Downloads: 0 This Week

Last Update: 2022-08-09
See Project
14

MoCo v3

PyTorch implementation of MoCo v3

MoCo v3 is a PyTorch reimplementation of Momentum Contrast v3 (MoCo v3), Facebook Research’s state-of-the-art self-supervised learning framework for visual representation learning using ResNet and Vision Transformer (ViT) backbones. Originally developed in TensorFlow for TPUs, this version faithfully reproduces the paper’s results on GPUs while offering an accessible and scalable PyTorch interface. MoCo v3 introduces improvements for training self-supervised ViTs by combining contrastive...

Downloads: 0 This Week

Last Update: 5 days ago
See Project
15

DeepMosaics

Automatically remove the mosaics in images and videos, or add mosaics

...This project is based on "semantic segmentation" and "Image-to-Image Translation". You can either run DeepMosaics via a pre-built binary package, or from source. Run time depends on the computer's performance (GPU version has better performance but requires CUDA to be installed). Different pre-trained models are suitable for different effects.[Introduction to pre-trained models].

Downloads: 88 This Week

Last Update: 2022-08-17
See Project
16

YOLOv4-large

Scaled-YOLOv4: Scaling Cross Stage Partial Network

YOLOv4-large is an open-source implementation of the Scaled-YOLOv4 object detection architecture, designed to improve both the accuracy and scalability of real-time computer vision models. The project provides a PyTorch implementation of the Scaled-YOLOv4 framework, which extends the original YOLOv4 architecture using Cross Stage Partial (CSP) networks and new scaling techniques. Unlike earlier object detection systems that only scale depth or width, this architecture scales multiple aspects...

Downloads: 0 This Week

Last Update: 2026-03-15
See Project
17

TensorLayer

Deep learning and reinforcement learning library for scientists

TensorLayer is a novel TensorFlow-based deep learning and reinforcement learning library designed for researchers and engineers. It provides an extensive collection of customizable neural layers to build advanced AI models quickly, based on this, the community open-sourced mass tutorials and applications. TensorLayer is awarded the 2017 Best Open Source Software by the ACM Multimedia Society. This project can also be found at OpenI and Gitee. 3.0.0 has been pre-released, the current version...

Downloads: 0 This Week

Last Update: 2022-01-17
See Project
18

BytePS

A high performance and generic framework for distributed DNN training

BytePS is a high-performance and generally distributed training framework. It supports TensorFlow, Keras, PyTorch, and MXNet, and can run on either TCP or RDMA networks. BytePS outperforms existing open-sourced distributed training frameworks by a large margin. For example, on BERT-large training, BytePS can achieve ~90% scaling efficiency with 256 GPUs (see below), which is much higher than Horovod+NCCL.

Downloads: 0 This Week

Last Update: 2022-08-04
See Project
19

Bangla TTS

Bangla text to speech synthesis in python

Bangla text to speech Multilingual (Bangla, English) real-time ([almost] in a GPU) speech synthesis library. Installation -------------------------------------- * Install Anaconda * conda create -n new_virtual_env python==3.6.8 * conda activate new_virtual_env * pip install -r requirements.txt * While running for the first time, keep your internet connection on to download the weights of the speech synthesis models (>500 MB) * For...

Downloads: 1 This Week

Last Update: 2020-09-03
See Project
20

imgaug

Image augmentation for machine learning experiments

imgaug is a library for image augmentation in machine learning experiments. It supports a wide range of augmentation techniques, allows to easily combine these and to execute them in random order or on multiple CPU cores, has a simple yet powerful stochastic interface and can not only augment images but also key points/landmarks, bounding boxes, heatmaps and segmentation maps. Affine transformations, perspective transformations, contrast changes, gaussian noise, dropout of regions,...

Downloads: 0 This Week

Last Update: 2022-07-29
See Project
21

Torchreid

Deep learning person re-identification in PyTorch

Torchreid is a library for deep-learning person re-identification, written in PyTorch and developed for our ICCV’19 project, Omni-Scale Feature Learning for Person Re-Identification. In "deep-person-reid/scripts/", we provide a unified interface to train and test a model. See "scripts/main.py" and "scripts/default_config.py" for more details. The folder "configs/" contains some predefined configs which you can use as a starting point. The code will automatically (download and) load the...

Downloads: 1 This Week

Last Update: 2022-08-05
See Project
22

maskrcnn-benchmark

Fast, modular reference implementation of Instance Segmentation

...It supports multi-GPU distributed training, mixed precision, and custom data loaders for new datasets. Built as a reference implementation, it became a foundation for the next-generation Detectron2, yet remains widely used for research needing a stable, reproducible environment. Visualization tools, model zoo checkpoints, and benchmark scripts make it easy to replicate state-of-the-art results or fine-tune models for custom tasks.

Downloads: 0 This Week

Last Update: 2025-10-06
See Project
23

Tensorpack

A Neural Net Training Interface on TensorFlow, with focus on speed

...On common CNNs, it runs training 1.2~5x faster than the equivalent Keras code. Your training can probably gets faster if written with Tensorpack. Scalable data-parallel multi-GPU / distributed training strategy is off-the-shelf to use. Squeeze the best data loading performance of Python with tensorpack.dataflow. Symbolic programming (e.g. tf.data) does not offer the data processing flexibility needed in research. Tensorpack squeezes the most performance out of pure Python with various auto parallelization strategies. ...

Downloads: 0 This Week

Last Update: 2022-08-01
See Project
24

DIGITS

Deep Learning GPU training system

The NVIDIA Deep Learning GPU Training System (DIGITS) puts the power of deep learning into the hands of engineers and data scientists. DIGITS can be used to rapidly train the highly accurate deep neural network (DNNs) for image classification, segmentation and object detection tasks. DIGITS simplifies common deep learning tasks such as managing data, designing and training neural networks on multi-GPU systems, monitoring performance in real-time with advanced visualizations, and selecting the best performing model from the results browser for deployment. ...

Downloads: 1 This Week

Last Update: 2022-01-31
See Project
25

Intel neon

Intel® Nervana™ reference deep learning framework

neon is Intel's reference deep learning framework committed to best performance on all hardware. Designed for ease of use and extensibility. See the new features in our latest release. We want to highlight that neon v2.0.0+ has been optimized for much better performance on CPUs by enabling Intel Math Kernel Library (MKL). The DNN (Deep Neural Networks) component of MKL that is used by neon is provided free of charge and downloaded automatically as part of the neon installation. ...

Downloads: 0 This Week

Last Update: 2022-02-16
See Project

Previous
1
2
3
You're on page 4
Next

Related Searches

pc workman

ssd health check

chinese

k nearest neighbor

image generator

video editor

deepmosaics

self-learning ai

bangla text to speech

image segmentation softwares

Related Categories

Artificial Intelligence

Software Development

Multimedia

System

Games

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise