routing free download

Showing 74 open source projects for "routing"

View related business solutions

Python Clear Filters & Widen Search

Train ML Models With SQL You Already Know
BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.

Try Free
Gemini 3 and 200+ AI Models on One Platform
Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.

Start Free
1

The Falcon Web Framework

The no-nonsense REST API and microservices framework

...Idiomatic HTTP error responses. Straightforward exception handling. Snappy testing with WSGI/ASGI helpers and mocks. CPython 3.5+ and PyPy 3.5+ support. No reliance on magic globals for routing and state management. Stable interfaces with an emphasis on backward compatibility. Simple API modeling through centralized RESTful routing. Highly-optimized, extensible code base.

Downloads: 6 This Week

Last Update: 2025-11-10
See Project
2

Tribler

Privacy enhanced BitTorrent client with P2P content discovery

Tribler is a decentralized, privacy-enhanced BitTorrent client developed by researchers at Delft University of Technology. It introduces built-in anonymity using a Tor-like onion routing network and integrates its own blockchain for economic incentives and trust management. Tribler supports standard torrenting features along with distributed search, self-contained channels, and peer reputation. Its goal is to provide a fully autonomous file-sharing network without relying on external servers, search engines, or trackers.

Downloads: 39 This Week

Last Update: 2026-04-02
See Project
3

Bottle

bottle.py is a fast and simple micro-framework for python applications

...It is distributed as a single file with no external dependencies, making it perfect for rapid development, prototyping, or embedded use. Despite its small size, Bottle supports routing, templates, request handling, and plugin support, offering a full-featured toolkit in an extremely compact package.

Downloads: 5 This Week

Last Update: 2025-07-02
See Project
4

caveman

Why use many token when few token do trick

...The project often serves as a foundation for developers who want to build applications quickly without being constrained by heavy conventions or extensive configuration. It may include utilities for routing, state handling, or simple server logic, depending on its implementation scope. Caveman embraces a philosophy of “less is more,” encouraging developers to focus on core functionality rather than framework overhead. Its design makes it particularly useful for experimentation, small tools, or proof-of-concept applications.

Downloads: 11 This Week

Last Update: 1 day ago
See Project
Auth0 B2B Essentials: SSO, MFA, and RBAC Built In
Unlimited organizations, 3 enterprise SSO connections, role-based access control, and pro MFA included. Dev and prod tenants out of the box.

Auth0's B2B Essentials plan gives you everything you need to ship secure multi-tenant apps. Unlimited orgs, enterprise SSO, RBAC, audit log streaming, and higher auth and API limits included. Add on M2M tokens, enterprise MFA, or additional SSO connections as you scale.

Sign Up Free
5

OpenMythos

A theoretical reconstruction of the Claude Mythos architecture

...It divides computation into three main stages, including a pre-processing phase, a looped recurrent reasoning block, and a final output refinement stage, creating a structured pipeline for inference. The architecture incorporates advanced techniques such as mixture-of-experts routing, adaptive computation time, and multiple attention mechanisms to dynamically allocate compute where needed. It is highly configurable through a centralized configuration system, allowing experimentation with different architectural parameters such as loop depth, attention type.

Downloads: 17 This Week

Last Update: 2026-04-27
See Project
6

Claude Cognitive

Persistent context and multi-instance coordination

...It introduces an attention-based context router that prioritizes files and content relevant to the current development discussion — tagging them as HOT, WARM, or COLD based on recency and keyword activation — so Claude Code doesn’t waste token budget rereading irrelevant code. This context routing dramatically reduces redundant token usage and accelerates large codebase interactions by focusing only on what truly matters to the current task. Additionally, Claude-Cognitive includes a pool coordinator to share state across multiple Claude Code instances, preserving what’s been learned or completed and preventing repetitive debugging or redundant exploration.

Downloads: 7 This Week

Last Update: 2026-01-28
See Project
7

NVIDIA cuOpt

GPU accelerated decision optimization

NVIDIA cuOpt is a GPU-accelerated optimization engine designed to solve complex mathematical optimization problems at large scale. It supports a range of optimization models including linear programming (LP), mixed integer linear programming (MILP), quadratic programming (QP), and vehicle routing problems (VRP). Built primarily in C++, cuOpt leverages NVIDIA GPUs to deliver near real-time solutions for optimization tasks involving millions of variables and constraints. The platform provides multiple interfaces, including C, Python, and server APIs, allowing developers to integrate optimization capabilities into applications and services. cuOpt is designed for high-performance environments and can be deployed across cloud, hybrid, or on-premise infrastructures. ...

Downloads: 3 This Week

Last Update: 2026-04-09
See Project
8

LoLLMs Hub Fortress

A proxy server for multiple ollama instances with Key security

...This design allows organizations to scale horizontally, combining local hardware, cloud resources, and specialized inference servers into a unified infrastructure. LoLLMs Hub also introduces intelligent routing mechanisms that automatically select the most appropriate model based on rules such as priority, load balancing, or availability, improving efficiency and reliability.

Downloads: 1 This Week

Last Update: 2026-04-20
See Project
9

FastHX

FastAPI server-side rendering with built-in HTMX support.

FastHX is a high-performance HTTP and WebSocket server framework designed for Haxe, enabling fast and scalable web application development.

Downloads: 2 This Week

Last Update: 2025-12-31
See Project
AI-powered service management for IT and enterprise teams
Enterprise-grade ITSM, for every business

Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.

Try it Free
10

socketify.py

Bringing Http/Https and WebSockets High Performance servers for PyPy3

Socketify.py is a reliable, high-performance Python web framework for building large-scale app backends and microservices. With no precedents websocket performance and a really fast HTTP server that can delivery encrypted TLS 1.3 quicker than most alternative servers can do even unencrypted, cleartext messaging.

Downloads: 5 This Week

Last Update: 2024-10-29
See Project
11

Django

The Web framework for perfectionists with deadlines

Django is a high-level, free and open-source Python web framework founded on the Model–Template–View (MTV) pattern, designed to facilitate rapid development of secure, maintainable, and scalable database-driven websites. First, read docs/intro/install.txt for instructions on installing Django. Next, work through the tutorials in order (docs/intro/tutorial01.txt, docs/intro/tutorial02.txt, etc.). If you want to set up an actual deployment server, read docs/howto/deployment/index.txt for...

Downloads: 43 This Week

Last Update: 2026-04-07
See Project
12

OpenAI Forward

An efficient forwarding service designed for LLMs

...Its main purpose is to make model access more manageable and efficient by adding operational controls such as request rate limiting, token rate limiting, caching, logging, routing, and key management around existing LLM endpoints. The project can proxy both local and cloud-hosted language model services, which makes it useful for teams that want a single control layer regardless of whether they are using something like LocalAI or a hosted provider compatible with OpenAI-style APIs. A major emphasis of the repository is asynchronous performance, using tools such as uvicorn, aiohttp, and asyncio to support high-throughput forwarding workloads.

Downloads: 0 This Week

Last Update: 2026-03-10
See Project
13

MoBA

MoBA: Mixture of Block Attention for Long-Context LLMs

...Instead of forcing each token to attend to every other token in the sequence, MoBA divides the context into blocks and dynamically routes queries to only the most relevant segments of information. This routing strategy reduces the computational cost associated with traditional attention while preserving performance on reasoning and long-context tasks. The approach allows language models to scale to significantly longer input contexts without the quadratic computational cost normally associated with transformer attention mechanisms.

Downloads: 0 This Week

Last Update: 2026-03-06
See Project
14

DeepEP

DeepEP: an efficient expert-parallel communication library

...Its core role is to implement high-throughput, low-latency all-to-all GPU communication kernels, which handle the dispatching of tokens to different experts (or shards) and then combining expert outputs back into the main data flow. Because MoE architectures require routing inputs to different experts, communication overhead can become a bottleneck — DeepEP addresses that by providing optimized GPU kernels and efficient dispatch/combining logic. The library also supports low-precision operations (such as FP8) to reduce memory and bandwidth usage during communication. DeepEP is aimed at large-scale model inference or training systems where expert parallelism is used to scale model capacity without replicating entire networks.

Downloads: 0 This Week

Last Update: 2025-10-03
See Project
15

Python Web

Course to learn frontend web development

This repository is a beginner-friendly template for creating Python web applications using Flask. Designed by @mouredev for learning and practice, it provides a simple, minimalistic structure for serving HTML pages and static content. Ideal for educational purposes and small-scale web projects, it also includes preconfigured files to simplify deployment and local development.

Downloads: 3 This Week

Last Update: 2025-06-04
See Project
16

AIOHTTP

Asynchronous HTTP client/server framework for asyncio and Python

Asynchronous HTTP Client/Server for asyncio and Python. AIOHTTP supports both client and server side of HTTP protocol. A long awaited new feature is tracing client request life cycle to figure out when and why client request spends a time waiting for connection establishment, getting server response headers etc. Now it is possible by registering special signal handlers on every request processing stage. The main change is dropping yield from support and using async/await everywhere....

1 Review

Downloads: 27 This Week

Last Update: 2026-03-31
See Project
17

VT.ai

Multimodal AI chat app with dynamic conversation routing

VT.ai is a minimal multimodal AI chat application designed to provide a simple and efficient interface for interacting with large language models. It focuses on delivering a clean user experience with support for both text and multimodal inputs, allowing users to engage with models in a flexible way. The application is lightweight and designed to run with minimal overhead, making it suitable for quick interactions and experimentation. It integrates with local model providers such as Ollama,...

Downloads: 4 This Week

Last Update: 2026-04-25
See Project
18

DreamO

A Unified Framework for Image Customization

...Built on a diffusion-transformer (DiT) backbone, it supports a diverse set of tasks — including identity preservation, virtual “try-on” (e.g. clothing, accessories), style transfer, IP adaptation (objects/characters), and layout/condition-aware customizations — all handled within the same unified architecture. DreamO’s design introduces a feature routing constraint that helps disentangle different control conditions (like identity, style, clothing) when more than one is specified, which significantly reduces conflicts and artifacts when combining controls. It also uses a “placeholder strategy” to precisely align conditional inputs (e.g. where to place clothing or objects) in generated images, giving users fine-grained control over composition.

Downloads: 0 This Week

Last Update: 2025-12-02
See Project
19

Raccoon

High-performance reconnaissance and vulnerability scanning tool

Raccoon is a high-performance offensive security tool designed to assist with reconnaissance and vulnerability scanning during penetration testing and security assessments. It automates several common reconnaissance tasks, allowing security professionals to quickly gather information about a target system or web application. The tool combines multiple scanning techniques into a single workflow, helping users identify potential weaknesses, exposed services, and accessible resources on a...

Downloads: 9 This Week

Last Update: 2 days ago
See Project
20

BlackSheep

Fast ASGI web framework for Python

BlackSheep is an asynchronous web framework to build event-based web applications with Python. A rich code API, based on dependency injection and inspired by Flask and ASP.NET Core. A typing-friendly codebase, which enables a comfortable development experience thanks to hints when coding with IDEs. Built-in generation of OpenAPI Documentation, supporting version 3, YAML, and JSON. A cross-platform framework, using the most modern versions of Python. BlackSheep supports automatic binding of...

Downloads: 2 This Week

Last Update: 2026-02-25
See Project
21

Pruna AI

Pruna is a model optimization framework built for developers

Pruna is an open-source, self-hostable AI inference engine designed to help teams deploy and manage large language models (LLMs) efficiently across private or hybrid infrastructures. Built with performance and developer ergonomics in mind, Pruna simplifies inference workflows by enabling multi-model orchestration, autoscaling, GPU resource allocation, and compatibility with popular open-source models. It is ideal for companies or teams looking to reduce reliance on external APIs while...

Downloads: 5 This Week

Last Update: 2026-04-22
See Project
22

Robin

AI-powered tool for dark web OSINT search and investigation

...Robin also performs scraping of discovered pages through Tor sessions, allowing users to gather additional context from dark web sites while maintaining the required network routing. By integrating AI models, the platform can interpret results, highlight key information, and produce summaries that help analysts understand findings faster. The project provides a modular architecture separating search, scraping, and AI processing components so it can be extended with new data sources.

Downloads: 14 This Week

Last Update: 2026-03-31
See Project
23

Semantic Router

Superfast AI decision making and processing of multi-modal data

Semantic Router is a superfast decision-making layer for your LLMs and agents. Rather than waiting for slow, unreliable LLM generations to make tool-use or safety decisions, we use the magic of semantic vector space — routing our requests using semantic meaning. Combining LLMs with deterministic rules means we can be confident that our AI systems behave as intended. Cramming agent tools into the limited context window is expensive, slow, and fundamentally limited. Semantic Router enables lightning-fast and cheap tool usage that can scale to many thousands of tools. ...

Downloads: 6 This Week

Last Update: 2025-11-18
See Project
24

Whoogle Search

A self-hosted, ad-free, privacy-respecting metasearch engine

...<tag> <query>) searches. Optional location-based searching (i.e. results near <city>). Optional NoJS mode to view search results in a separate window with JavaScript blocked. If routing your request through Tor you will need to make the following adjustments. Due to the nature of interacting with Google through Tor we will need to be able to send signals to Tor.

Downloads: 6 This Week

Last Update: 2026-04-15
See Project
25

KServe

Standardized Serverless ML Inference Platform on Kubernetes

KServe provides a Kubernetes Custom Resource Definition for serving machine learning (ML) models on arbitrary frameworks. It aims to solve production model serving use cases by providing performant, high abstraction interfaces for common ML frameworks like Tensorflow, XGBoost, ScikitLearn, PyTorch, and ONNX. It encapsulates the complexity of autoscaling, networking, health checking, and server configuration to bring cutting edge serving features like GPU Autoscaling, Scale to Zero, and...

Downloads: 8 This Week

Last Update: 2026-04-29
See Project