Showing 350 open source projects for "throughput"

View related business solutions
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    More flexibility. More control.

    Generate interest, access liquidity without selling, and execute trades seamlessly. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 1
    Ring

    Ring

    Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI

    Ring is a reasoning Mixture-of-Experts (MoE) large language model (LLM) developed by inclusionAI. It is built from or derived from Ling. Its design emphasizes reasoning, efficiency, and modular expert activation. In its “flash” variant (Ring-flash-2.0), it optimizes inference by activating only a subset of experts. It applies reinforcement learning/reasoning optimization techniques. Its architectures and training approaches are tuned to enable efficient and capable reasoning performance....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Linera Protocol

    Linera Protocol

    Main repository for the Linera protocol

    ...The protocol supports dynamic reconfiguration: shards can split, merge, or reassign workloads while preserving correctness. By decoupling domain logic from global consensus, Linera can support modular upgrades, domain-specific smart contracts, and high throughput across partitions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Fastify

    Fastify

    Fast and low overhead web framework, for Node.js

    ...Fastify ships with thoughtful defaults like structured logging via Pino and AJV for schema validation, and it has first-class TypeScript types. The result is a server stack that scales to high throughput while remaining easy to reason about in production.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Isaac ROS Visual SLAM

    Isaac ROS Visual SLAM

    Visual SLAM/odometry package based on NVIDIA-accelerated cuVSLAM

    Discover a faster, easier way to build advanced AI robotics applications with the NVIDIA Isaac™ ROS collection of accelerated computing packages and AI models, bringing NVIDIA acceleration to ROS developers everywhere. Isaac ROS Visual SLAM provides a high-performance, best-in-class ROS 2 package for VSLAM (visual simultaneous localization and mapping). This package uses one or more stereo cameras and optionally an IMU to estimate odometry as an input to navigation. It is GPU-accelerated to...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Go from Code to Production URL in Seconds Icon
    Go from Code to Production URL in Seconds

    Cloud Run deploys apps in any language instantly. Scales to zero. Pay only when code runs.

    Skip the Kubernetes configs. Cloud Run handles HTTPS, scaling, and infrastructure automatically. Two million requests free per month.
    Try it free
  • 5
    ParallelStencil.jl

    ParallelStencil.jl

    Package for writing high-level code for parallel stencil computations

    ...For example, a 2-D shallow ice solver presented at JuliaCon 2020 [1] achieved a nearly 20 times better performance than a corresponding GPU Array programming implementation; in absolute terms, it reached 70% of the theoretical upper performance bound of the used Nvidia P100 GPU, as defined by the effective throughput metric, T_eff. ParallelStencil relies on the native kernel programming capabilities of CUDA.jl and AMDGPU.jl and on Base.Threads for high-performance computations on GPUs and CPUs, respectively. It is seamlessly interoperable with ImplicitGlobalGrid.jl, which renders the distributed parallelization of stencil-based GPU and CPU apps.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Devtron

    Devtron

    Tool integration platform for Kubernetes

    ...Provides deployment metrics like; deployment frequency, lead time, change failure rate, and mean-time recovery. Seamlessly integrates with Grafana for continuous application metrics like CPU and memory usage, status code, throughput, and latency on the dashboard.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    DALI

    DALI

    A GPU-accelerated library containing highly optimized building blocks

    ...DALI addresses the problem of the CPU bottleneck by offloading data preprocessing to the GPU. Additionally, DALI relies on its own execution engine, built to maximize the throughput of the input pipeline.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    ElasticJob

    ElasticJob

    Distributed scheduled job framework

    ...It uses a unified job API for each project. Developers only need code one time and can deploy at will. Support job sharding and high availability in distributed system. Scale out for throughput and efficiency improvement. Job processing capacity is flexible and scalable with the allocation of resources. Execute job on suitable time and assigned resources. Aggregation same job to same job executor. Append resources to newly assigned jobs dynamically. Using ElasticJob can make developers no longer worry about the non-functional requirements such as jobs scale out, so that they can focus more on business coding.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Workerman

    Workerman

    An asynchronous event driven PHP socket framework

    ...It has many high-performance components such as asynchronous Mysql, asynchronous Redis, asynchronous Http, MQTT Internet of Things client, and asynchronous message queue. Based on the high-performance Epoll event network library, a single machine can support millions of concurrent connections, with a long connection throughput of up to 36W/S. Objects such as PHP database can be resident in memory, reducing analysis and compilation and network overhead.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 10
    qvac-fabric-llm.cpp

    qvac-fabric-llm.cpp

    QVAC Fabric: cross-platform LLM inference and fine-tuning

    qvac-fabric-llm.cpp is a cross-platform large language model inference and fine-tuning engine built as an advanced fork of llama.cpp, designed to run efficiently across desktops, mobile devices, and heterogeneous GPU environments. The project focuses on removing hardware limitations traditionally associated with LLM deployment by enabling support for a wide range of backends, including Vulkan, Metal, CUDA, and CPU, making it accessible on devices ranging from smartphones to enterprise...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Mooncake

    Mooncake

    Mooncake is the serving platform for Kimi

    ...Mooncake also introduces distributed key-value cache storage that allows inference systems to reuse previously computed attention states, significantly improving throughput in large-scale deployments. The system supports advanced networking technologies such as RDMA and NVMe over Fabric, enabling high-speed communication across clusters.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    XLS

    XLS

    XLS: Accelerated HW Synthesis

    ...XLS also provides tooling for pipelining, state insertion, and formal equivalence checks between different stages, giving developers confidence as designs evolve. The result is a productive path from high-level specification to real hardware, without giving up control over latency, throughput, and area trade-offs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Netty

    Netty

    An event-driven asynchronous network application framework

    ...It wraps Java NIO with a channel/handler pipeline model, enabling clean composition of codecs, TLS, and application logic with minimal overhead. Its ByteBuf abstractions, buffer pooling, and zero-copy techniques help reduce GC pressure and improve throughput under heavy I/O. Support for common protocols—HTTP/1.1, HTTP/2, WebSocket, and raw TCP/UDP—makes it a foundation for RPC frameworks, proxies, and messaging systems. Pluggable transports and native integrations (like epoll or kqueue where available) allow fine-tuning for different operating systems. With robust resource management, leak detection, and extensive testing, Netty is widely chosen when low latency, scalability, and control over the I/O stack are required.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Lapis

    Lapis

    A web framework for Lua and OpenResty written in MoonScript

    ...Have a look at Web Framework Benchmarks just to see how OpenResty stacks up against other platforms. Utilizing the power of Lua coroutines, you can write clean code that looks synchronous but can achieve high throughput by automatically running asynchronously without blocking. Networking operations like database queries and HTTP requests will automatically yield to allow for handling concurrent requests, all without all that callback spaghetti seen in other asynchronous platforms.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    RESTinio

    RESTinio

    HTTP/WebSocket server C++14 library

    RESTinio is a header-only C++14 library that gives you an embedded HTTP/WebSocket server. It is based on the standalone version of ASIO and is targeted primarily for the asynchronous processing of HTTP requests. Since v.0.4.1 Boost::ASIO (1.66 or higher) is also supported. Consider the task of writing a C++ application that must support some REST API, RESTinio represents our solution for that task. Server runs on the main thread, and responds to all requests with a hello-world message. Of...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Fluent Bit Plugin for CloudWatch Logs

    Fluent Bit Plugin for CloudWatch Logs

    A Fluent Bit output plugin for CloudWatch Logs

    ...To access sub-values in the map use the form $(variable['subkey']). Also, it can be replaced with special values to insert the tag, ECS metadata, or a random string in the name. It can achieve higher throughput and will consume less CPU and memory.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Puma

    Puma

    A Ruby/Rack web server built for concurrency

    ...On MRI, there is a Global VM Lock (GVL) that ensures only one thread can run Ruby code at a time. But if you're doing a lot of blocking IO (such as HTTP calls to external APIs like Twitter), Puma still improves MRI's throughput by allowing IO waiting to be done in parallel.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Apache brpc

    Apache brpc

    Industrial-grade RPC framework used throughout Baidu

    Apache brpc is an industrial-grade RPC framework for building reliable and high-performance services. Apache brpc (incubating) is an effort undergoing Incubation at The Apache Software Foundation (ASF), sponsored by the Incubator. Incubation is required of all newly accepted projects until a further review indicates that the infrastructure, communications, and decision making process have stabilized in a manner consistent with other successful ASF projects. While incubation status is not...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Matrix

    Matrix

    Multi-Agent daTa geneRation Infra and eXperimentation framework

    Matrix is a distributed, large-scale engine for multi-agent synthetic data generation and experiments: it provides the infrastructure to run thousands of “agentic” workflows concurrently (e.g. multiple LLMs interacting, reasoning, generating content, data-processing pipelines) by leveraging distributed computing (like Ray + cluster management). The idea is to treat data generation as a “data-to-data” transformation: each input item defines a task, and the runtime orchestrates asynchronous,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    FerrumC

    FerrumC

    A reimplementation of the minecraft server in rust

    ...By using Rust’s concurrency and safety guarantees, FerrumC runs fully multithreaded, allowing it to utilize all available CPU cores — which helps deliver smoother, low-latency, high-throughput multiplayer experiences. The project aims both at providing a fast, lean server for typical use (friends or small communities) and offering a foundation that could support large-scale servers, mod/plugin systems, and custom server configurations. FerrumC supports importing existing Minecraft worlds, making it easy for users to migrate from standard servers without losing progress. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    marqo

    marqo

    Tensor search for humans

    A tensor-based search and analytics engine that seamlessly integrates with your applications, websites, and workflows. Marqo is a versatile and robust search and analytics engine that can be integrated into any website or application. Due to horizontal scalability, Marqo provides lightning-fast query times, even with millions of documents. Marqo helps you configure deep-learning models like CLIP to pull semantic meaning from images. It can seamlessly handle image-to-image, image-to-text and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    JCTools

    JCTools

    Java Concurrency Tools for the JVM

    Java Concurrency Tools for the JVM. This project aims to offer some concurrent data structures currently missing from the JDK. There’s more to come and contributions/suggestions are most welcome. JCTools has enjoyed support from the community and contributions in the form of issues/tests/documentation/code have helped it grow. JCTools offers excellent performance at a reasonable price (FREE! under the Apache 2.0 License). It’s stable and in use by such distinguished frameworks as Netty,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Synapse Machine Learning

    Synapse Machine Learning

    Simple and distributed Machine Learning

    ...With the HTTP on Spark project, users can embed any web service into their SparkML models. For production-grade deployment, the Spark Serving project enables high throughput, sub-millisecond latency web services, backed by your Spark cluster.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Nebula Graph

    Nebula Graph

    A distributed, fast open-source graph database

    The graph database built for super large-scale graphs with milliseconds of latency. Optimized SUBGRAPH and FIND PATH for better performance. Optimized query paths to reduce redundant paths and time complexity. Optimized the method to get properties for better performance of MATCH statements. Nebula Graph adopts the Apache 2.0 license, one of the most permissive free software licenses in the world. Free as in freedom, because, under the Apache 2.0 license, you can use, copy, modify and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    MiniMax-01

    MiniMax-01

    Large-language-model & vision-language-model based on Linear Attention

    ...MiniMax-Text-01 uses a hybrid attention architecture that blends Lightning Attention, standard softmax attention, and Mixture-of-Experts (MoE) routing to achieve both high throughput and long-context reasoning. It has 456 billion total parameters with 45.9 billion activated per token and is trained with advanced parallel strategies such as LASP+, varlen ring attention, and Expert Tensor Parallelism, enabling a training context of 1 million tokens and up to 4 million tokens at inference. MiniMax-VL-01 extends this core by adding a 303M-parameter Vision Transformer and a two-layer MLP projector in a ViT–MLP–LLM framework, allowing the model to process images at dynamic resolutions up to 2016×2016.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB