Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Artificial Intelligence
Machine Learning Software
Search Results

Search Results for "throughput"

x

Sort By:

Relevance

Clear All Filters

OS

Windows 14
Linux 13
Mac 12
More...
BSD 3
ChromeOS 3

Category

Artificial Intelligence 15
Software Development 5
Scientific/Engineering 3
Business 2
Blockchain 1
Internet 1
System 1

License

OSI-Approved Open Source 14

Programming Language

C++ 4
Python 4
Java 2
MATLAB 1
More...
R 1
Rust 1
Scala 1

Status

Planning 1
Beta 1

Showing 15 open source projects for "throughput"

View related business solutions

Machine Learning Clear Filters & Widen Search

Go From AI Idea to AI App Fast
One platform to build, fine-tune, and deploy ML models. No MLOps team required.

Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.

Try Free
Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
1

FlexLLMGen

Running large language models on a single GPU

...This design allows organizations to deploy powerful language models for high-volume tasks without the infrastructure costs typically associated with large-scale AI systems. The project is particularly useful for workloads that prioritize throughput over latency, including benchmarking experiments and large corpus analysis.

Downloads: 0 This Week

Last Update: 2026-03-10
See Project
2

TensorRT

C++ library for high performance inference on NVIDIA GPUs

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference. It includes a deep learning inference optimizer and runtime that delivers low latency and high throughput for deep learning inference applications. TensorRT-based applications perform up to 40X faster than CPU-only platforms during inference. With TensorRT, you can optimize neural network models trained in all major frameworks, calibrate for lower precision with high accuracy, and deploy to hyperscale data centers, embedded, or automotive product platforms. ...

Downloads: 19 This Week

Last Update: 2026-03-25
See Project
3

FlashAttention

Fast and memory-efficient exact attention

...The project provides implementations of FlashAttention, FlashAttention-2, and newer iterations optimized for modern GPU architectures such as NVIDIA Hopper and AMD accelerators. By improving both forward and backward pass efficiency, it enables training and inference of large language models with longer sequence lengths and higher throughput. The library integrates with PyTorch and supports various attention configurations, including causal masking, multi-query attention, and rotary embeddings.

Downloads: 69 This Week

Last Update: 2026-03-18
See Project
4

OpenMLDB

OpenMLDB is an open-source machine learning database

...However, a feature engineering script developed by data scientists (Python scripts in most cases) cannot be directly deployed into production for online inference because it usually cannot meet the engineering requirements, such as low latency, high throughput and high availability.

Downloads: 0 This Week

Last Update: 2025-02-21
See Project
$300 in Free Credit Towards Top Cloud Services
Build VMs, containers, AI, databases, storage—all in one place.

Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.

Get Started
5

DALI

A GPU-accelerated library containing highly optimized building blocks

...DALI addresses the problem of the CPU bottleneck by offloading data preprocessing to the GPU. Additionally, DALI relies on its own execution engine, built to maximize the throughput of the input pipeline.

Downloads: 0 This Week

Last Update: 2026-04-16
See Project
6

Synapse Machine Learning

Simple and distributed Machine Learning

...With the HTTP on Spark project, users can embed any web service into their SparkML models. For production-grade deployment, the Spark Serving project enables high throughput, sub-millisecond latency web services, backed by your Spark cluster.

Downloads: 0 This Week

Last Update: 2026-04-04
See Project
7

OnnxStream

Lightweight inference library for ONNX files, written in C++

...The recommended minimum RAM/VRAM for Stable Diffusion 1.5 is typically 8GB. Generally, major machine learning frameworks and libraries are focused on minimizing inference latency and/or maximizing throughput, all of which at the cost of RAM usage. So I decided to write a super small and hackable inference library specifically focused on minimizing memory consumption: OnnxStream. OnnxStream is based on the idea of decoupling the inference engine from the component responsible for providing the model weights, which is a class derived from WeightsProvider. ...

Downloads: 24 This Week

Last Update: 2024-08-14
See Project
8

FFCV

Fast Forward Computer Vision (and other ML workloads!)

ffcv is a drop-in data loading system that dramatically increases data throughput in model training. From gridding to benchmarking to fast research iteration, there are many reasons to want faster model training. Below we present premade codebases for training on ImageNet and CIFAR, including both (a) extensible codebases and (b) numerous premade training configurations.

Downloads: 0 This Week

Last Update: 2024-08-07
See Project
9

OmicSelector

Feature selection and deep learning modeling for omic biomarker study

OmicSelector is an environment, Docker-based web application, and R package for biomarker signature selection (feature selection) from high-throughput experiments and others. It was initially developed for miRNA-seq (small RNA, smRNA-seq; hence the name was miRNAselector), RNA-seq and qPCR, but can be applied for every problem where numeric features should be selected to counteract overfitting of the models. Using our tool, you can choose features, like miRNAs, with the most significant diagnostic potential (based on the results of miRNA-seq, for validation in qPCR experiments).

1 Review

Downloads: 0 This Week

Last Update: 2024-04-05
See Project
Full-stack observability with actually useful AI | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
10

hora

Efficient approximate nearest neighbor search algorithm collections

...The library is written in Rust and emphasizes performance, safety, and efficient memory management, making it suitable for production-grade applications requiring low latency and high throughput.

Downloads: 0 This Week

Last Update: 2026-03-11
See Project
11

exchange-core

Ultra-fast matching engine written in Java based on LMAX Disruptor

Exchange-core is an open-source market exchange core based on LMAX Disruptor, Eclipse Collections (ex. Goldman Sachs GS Collections), Real Logic Agrona, OpenHFT Chronicle-Wire, LZ4 Java, and Adaptive Radix Trees. Designed for high scalability and pauseless 24/7 operation under high-load conditions and providing low-latency responses. Single order book configuration is capable to process 5M operations per second on 10-years old hardware (Intel® Xeon® X5690) with moderate latency degradation....

Downloads: 0 This Week

Last Update: 2022-04-15
See Project
12

X-DeepLearning

An industrial deep learning framework for high-dimension sparse data

...Complete streaming training features including feature admission, feature elimination, model incremental export, feature counting statistics, etc. Background: XDL1.0 focuses on throughput optimization and adopts the one request per thread processing model, which can significantly improve the limit throughput under ultra-high concurrency.

Downloads: 0 This Week

Last Update: 2022-02-02
See Project
13

Root Phenotyping Suite

Three different software tools for phenotyping plant root images

RootAnalyzer is a fully automated tool, for efficiently extracting and analyzing anatomical traits from root-cross section images. RootAnalyzer segments the plant root from the image's background, classifies and characterizes the cortex, stele, endodermis and metaxylem, and produces statistics about the morphological properties of the root cells and tissues. RTipC is a system for the fully automated detection and classification of root tips in root images obtained either by 2d flat bed...

Downloads: 0 This Week

Last Update: 2018-10-23
See Project
14

JAABA

The Janelia Automated Animal Behavior Annotator

...JAABA uses machine learning techniques to convert these manual labels into behavior detectors that can then be used to automatically classify the behaviors of animals in large data sets with high throughput. JAABA combines an intuitive graphical user interface, a fast and powerful machine learning algorithm, and visualizations of the classifier into an interactive, usable system for creating automatic behavior detectors. Documentation is available at: http://jaaba.sourceforge.net/

1 Review

Downloads: 6 This Week

Last Update: 2015-09-08
See Project
15

SOPF

Extensible framework that enables productive design, analysis, and execution of arbitrary-sized neural-networks or system on a distributed, scalable, high-throughput runtime platform. Enables synapse-oriented-programming.

1 Review

Downloads: 0 This Week

Last Update: 2013-04-23
See Project

Previous
You're on page 1
Next

Related Searches

root

nvidia

machine learning

artificial intelligence stock market

offline artificial intelligence\

nvidia deepstream sdk

neural network

network performance

intel

crypto exchange

Related Categories

Artificial Intelligence

Software Development

Scientific/Engineering

Business

Blockchain

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise