Showing 350 open source projects for "throughput"

View related business solutions
  • Auth0 B2B Essentials: SSO, MFA, and RBAC Built In Icon
    Auth0 B2B Essentials: SSO, MFA, and RBAC Built In

    Unlimited organizations, 3 enterprise SSO connections, role-based access control, and pro MFA included. Dev and prod tenants out of the box.

    Auth0's B2B Essentials plan gives you everything you need to ship secure multi-tenant apps. Unlimited orgs, enterprise SSO, RBAC, audit log streaming, and higher auth and API limits included. Add on M2M tokens, enterprise MFA, or additional SSO connections as you scale.
    Sign Up Free
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 1
    Torrra

    Torrra

    A Python tool that lets you search and download torrents

    ...Users can add, pause, prioritize, and remove torrents dynamically, while the client intelligently manages piece selection, peer discovery, and bandwidth allocation to optimize download throughput. Torrra also includes hooks for event monitoring and status reporting, enabling third-party tools to visualize torrent progress or integrate with home automation dashboards.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 2
    Cortex

    Cortex

    A horizontally scalable, multi-tenant, long term Prometheus

    ...Cortex gives you a global view of Prometheus time series data that includes data in long-term storage, greatly expanding the usefulness of PromQL for analytical purposes. Cortex runs across multiple machines in a cluster, exceeding the throughput and storage of a single machine. This enables you to send the metrics from multiple Prometheus servers to a single Cortex cluster.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    Pingora

    Pingora

    A library for building fast, reliable and evolvable network services

    Pingora is a Rust-based high-performance HTTP proxy and networking framework developed by Cloudflare to eventually replace NGINX. Designed for extreme reliability and memory-safe concurrency, it handles over 40 million requests per second in production and reduces CPU/memory usage significantly compared to traditional web servers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Fuel Core

    Fuel Core

    Rust full node implementation of the Fuel v2 protocol

    Fuel-Core is the Rust-based implementation of a full FUEL blockchain node—supporting chain execution, mempool, consensus, GraphQL API, and state management. It powers the Fuel Layer 1 network and enables developers to run local or production nodes with high throughput and low latency.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Compliant and Reliable File Transfers Backed by Top Security Certifications Icon
    Compliant and Reliable File Transfers Backed by Top Security Certifications

    Cerberus FTP Server delivers SOC 2 Type II certified security and FIPS 140-2 validated encryption.

    Stop relying on non-certified, legacy file transfer tools that creak under the weight of modern security demands. Get full audit trails, advanced access controls and more supported by an award-winning team of experts. Start your free 25-day trial today.
    Start Free Trial
  • 5
    Fluentd

    Fluentd

    Fluentd: Unified Logging Layer (project under CNCF)

    Fluentd is a CNCF‑graduated open-source data collector that unifies log data collection and consumption across diverse systems. It supports robust reliability, buffering, extensible plugin architecture, and real-time log routing. Fluentd serves as a unified logging layer for structured/unstructured data processing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    SPQR

    SPQR

    Stateless Postgres Query Router

    SPQR is a sharding proxy for PostgreSQL that enables horizontal scalability by distributing queries across multiple PostgreSQL instances. Designed for large-scale, multi-tenant architectures, SPQR routes, rewrites, and manages SQL queries to ensure they are executed efficiently across shards. It provides a PostgreSQL-compatible interface so applications can scale transparently without code changes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Text Generation Inference

    Text Generation Inference

    Large Language Model Text Generation Inference

    Text Generation Inference is a high-performance inference server for text generation models, optimized for Hugging Face's Transformers. It is designed to serve large language models efficiently with optimizations for performance and scalability.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    kallisto

    kallisto

    Near-optimal RNA-Seq quantification

    kallisto is a program for near-optimal quantification of transcript abundances from RNA-Seq data, or more generally of target sequences using high-throughput sequencing reads. It is based on the idea of using pseudoalignment to quickly determine reads and targets’ compatibility, with no need for alignment. According to benchmarks done on a Mac desktop computer, kallisto can quantify 30 million human bulk RNA-seq reads in less than 3 minutes with just the read sequences and a transcriptome index, that in itself can take more than 10 minutes to build. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 9
    AIClient-2-API

    AIClient-2-API

    Simulates Gemini CLI, Antigravity, Qwen Code, and Kiro client

    ...It enables developers to connect various AI tools and clients, such as Gemini CLI, Qwen Code, or other proprietary interfaces, into a unified backend without needing to rewrite integrations. The system is designed to support high-throughput usage, handling large volumes of requests while maintaining compatibility across different AI ecosystems. It also provides access to alternative models, including built-in support for certain Claude-based functionalities, expanding flexibility for developers working with multiple providers. By abstracting away client-specific differences, it simplifies AI application development and reduces integration overhead.
    Downloads: 8 This Week
    Last Update:
    See Project
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 10
    Tencent-Hunyuan-Large

    Tencent-Hunyuan-Large

    Open-source large language model family from Tencent Hunyuan

    Tencent-Hunyuan-Large is the flagship open-source large language model family from Tencent Hunyuan, offering both pre-trained and instruct (fine-tuned) variants. It is designed with long-context capabilities, quantization support, and high performance on benchmarks across general reasoning, mathematics, language understanding, and Chinese / multilingual tasks. It aims to provide competitive capability with efficient deployment and inference. FP8 quantization support to reduce memory usage...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 11
    DFlash

    DFlash

    Block Diffusion for Ultra-Fast Speculative Decoding

    DFlash is an open-source framework for ultra-fast speculative decoding using a lightweight block diffusion model to draft text in parallel with a target large language model, dramatically improving inference speed without sacrificing generation quality. It acts as a “drafter” that proposes likely continuations which the main model then verifies, enabling significant throughput gains compared to traditional autoregressive decoding methods that generate token by token. This approach has been shown to deliver lossless acceleration on models like Qwen3-8B by combining block diffusion techniques with efficient batching, making it ideal for applications where latency and cost matter. The project includes support for multiple draft models, example integration code, and scripts to benchmark performance, and it is structured to work with popular model serving stacks like SGLang and the Hugging Face Transformers ecosystem.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 12
    Decred

    Decred

    Decred daemon in Go (golang)

    dcrd is the full-node implementation of Decred, a hybrid proof-of-work/proof-of-stake cryptocurrency focused on decentralized governance, long-term sustainability, and robust network security. Written in Go, dcrd provides all the core consensus, networking, and validation logic necessary for Decred nodes to participate in the network. It plays a critical role in enforcing consensus rules, managing peer connections, and enabling advanced features like on-chain voting and self-funding via a...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    MagicAPI AI Gateway

    MagicAPI AI Gateway

    Built for demanding AI workflows

    The world's fastest AI Gateway proxy, written in Rust and optimized for maximum performance. This high-performance API gateway routes requests to various AI providers (OpenAI, GROQ) with streaming support, making it perfect for developers who need reliable and blazing-fast AI API access.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    HyperExpress

    HyperExpress

    High performance Node.js webserver with a simple-to-use API

    HyperExpress aims to be a simple yet performant HTTP & Websocket Server. Combined with the power of uWebsockets.js, a Node.js binding of uSockets written in C++, HyperExpress allows developers to unlock higher throughput for their web applications with their existing hardware. This can allow many web applications to become much more performant on optimized data serving endpoints without having to scale hardware. High-performance Node.js webserver with a simple-to-use API powered by uWebsockets.js under the hood.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    fluentbit

    fluentbit

    Fast and Lightweight Logs and Metrics processor for Linux, BSD, OSX

    Fluent Bit is a super-fast, lightweight, and highly scalable logging and metrics processor and forwarder. It is the preferred choice for cloud and containerized environments. A robust, lightweight, and portable architecture for high throughput with low CPU and memory usage from any data source to any destination. Proven across distributed cloud and container environments. Highly available with I/O handlers to store data for disaster recovery. Granular management of data parsing and routing. Filtering and enrichment to optimize security and minimize cost. The lightweight, asynchronous design optimizes resource usage: CPU, memory, disk I/O, network. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 16
    Async MySQL Connector .NET and .NET Core

    Async MySQL Connector .NET and .NET Core

    Async MySQL Connector for .NET and .NET Core

    ...This library implements true asynchronous I/O for database operations, without blocking (or using Task.Run to run synchronous methods on a background thread). This greatly improves the throughput of a web server that performs database operations. This library outperforms MySQL Connector/NET (MySql.Data) on benchmarks. This library is MIT-licensed and may be freely distributed with commercial software. Commercial software that uses Connector/NET may have to purchase a commercial license from Oracle. Fixes dozens of open bugs in Oracle’s Connector/NET; passes all ADO.NET Specification Tests. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    GuitarPedal

    GuitarPedal

    Linus learns analog circuits

    GuitarPedal is an experimental repository exploring a digital guitar-effects signal chain implemented with lean, low-level code. The project demonstrates how to read audio input, process it through simple transformations, and write the result out in real time with minimal latency. It emphasizes straightforward, inspectable DSP so developers can follow the math and tweak parameters without a giant framework in the way. The codebase favors portability and simplicity, focusing on a handful of...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 18
    EnvPool

    EnvPool

    C++-based high-performance parallel environment execution engine

    EnvPool is a fast, asynchronous, and parallel RL environment library designed for scaling reinforcement learning experiments. Developed by SAIL at Singapore, it leverages C++ backend and Python frontend for extremely high-speed environment interaction, supporting thousands of environments running in parallel on a single machine. It's compatible with Gymnasium API and RLlib, making it suitable for scalable training pipelines.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Dragonfly

    Dragonfly

    A modern replacement for Redis and Memcached

    ...Designed to fully utilize the power of modern cloud hardware and deliver on the data demands of modern applications, Dragonfly frees developers from the limits of traditional in-memory data stores. The power of modern cloud hardware can never be realized with legacy software. Dragonfly is optimized for modern cloud computing, delivering 25x more throughput and 12x lower snapshotting latency when compared to legacy in-memory data stores like Redis, making it easy to deliver the real-time experience your customers expect. Scaling Redis workloads is expensive due to their inefficient, single-threaded model. Dragonfly is far more compute and memory efficient, resulting in up to 80% lower infrastructure costs. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 20
    Supertonic

    Supertonic

    Lightning-fast, on-device TTS, running natively via ONNX

    ...The core model is highly compact at around 66 million parameters, yet benchmarks show it can generate speech up to 167× faster than real time on modern consumer hardware and significantly outpace popular cloud TTS APIs in throughput and real-time factor. Supertonic is designed to handle real-world text gracefully, including numbers, dates, currency symbols, abbreviations, and technical units, without requiring heavy pre-processing or custom text normalization. The repository provides complete reference implementations across many programming ecosystems—Python, Node.js, browser (WebGPU/WASM), Java, C++, C#, Go, Swift, iOS, Rust, and Flutter.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 21
    Modular Platform

    Modular Platform

    The Modular Platform (includes MAX & Mojo)

    ...It is closely associated with the Mojo programming language and related tooling that aims to combine Python usability with systems-level performance. Modular’s ecosystem is designed to simplify deployment of AI workloads across heterogeneous hardware while maximizing throughput. The repository reflects an effort to modernize the AI development pipeline from compilation to runtime execution. Overall, Modular represents an ambitious attempt to unify performance engineering and developer ergonomics for large-scale AI systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Orpheus TTS

    Orpheus TTS

    Towards Human-Sounding Speech

    ...The project ships both pretrained and finetuned English models, as well as a family of multilingual models released as a research preview, and includes data-processing scripts so users can train or finetune their own variants. Inference is provided through a Python package that uses vLLM under the hood for high-throughput, low-latency generation, including streaming examples that show how to generate audio chunks in real time. The maintainers provide Colab notebooks, a standardized prompting format, and one-click deployment via Baseten for production-grade, FP8/FP16 optimized inference with ~200 ms streaming latency.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 23
    Qwen-Agent

    Qwen-Agent

    Agent framework and applications built upon Qwen>=3.0

    Qwen-Agent is a framework for building applications / agents using Qwen models (version 3.0+). It provides components for instruction following, tool usage (function calling), planning, memory, RAG (retrieval augmented generation), code interpreter, etc. It ships with example applications (Browser Assistant, Code Interpreter, Custom Assistant), supports GUI front-ends, backends, server setups. Agent workflow can maintain context / memory to perform multi-turn or more complex logic over time....
    Downloads: 5 This Week
    Last Update:
    See Project
  • 24
    Postgres Operator

    Postgres Operator

    Postgres operator creates and manages PostgreSQL clusters

    The Postgres Operator delivers easy-to-run highly available PostgreSQL clusters on Kubernetes (K8s) powered by Patroni. It is configured only through Postgres manifests (CRDs) to ease integration into automated CI/CD pipelines with no access to Kubernetes API directly, promoting infrastructure as code vs manual operations.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    TensorRT LLM

    TensorRT LLM

    TensorRT LLM provides users with an easy-to-use Python API

    ...It provides a Python-based API built on top of PyTorch that allows developers to define, customize, and deploy LLMs efficiently across a variety of hardware configurations, from single GPUs to large multi-node clusters. The library focuses on maximizing throughput and minimizing latency through advanced techniques such as quantization, custom attention kernels, and optimized memory management strategies. It includes support for cutting-edge inference methods like speculative decoding and inflight batching, enabling real-time and large-scale AI applications. TensorRT-LLM integrates seamlessly with NVIDIA’s broader inference ecosystem, including Triton Inference Server and distributed deployment frameworks, making it suitable for production environments.
    Downloads: 1 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB