server performance free download

Showing 5 open source projects for "server performance"

View related business solutions

Artificial Intelligence Unix Shell Clear Filters & Widen Search

Build Securely on AWS with Proven Frameworks
Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.

Download Now
Go From AI Idea to AI App Fast
One platform to build, fine-tune, and deploy ML models. No MLOps team required.

Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.

Try Free
1

Text Embeddings Inference

High-performance inference server for text embeddings models API layer

Text Embeddings Inference is a high-performance server designed to serve text embedding models efficiently in production environments. It focuses on delivering fast and scalable embedding generation by leveraging optimized inference techniques and modern hardware acceleration. It is built to support transformer-based embedding models, making it suitable for tasks such as semantic search, clustering, and retrieval-augmented systems.

Downloads: 2 This Week

Last Update: 2026-03-23
See Project
2

MCP Toolbox for Databases

Open source MCP server that exposes database tools for AI agents

GenAI Toolbox, also known as MCP Toolbox for Databases, is an open source server designed to simplify how generative AI applications interact with databases. It provides a central service that exposes database operations as reusable tools that can be consumed by AI agents and developer workflows. It handles common infrastructure concerns such as authentication, connection pooling, and performance optimization so developers do not have to implement them individually in each application. ...

Downloads: 0 This Week

Last Update: 1 day ago
See Project
3

Devon

Open source AI pair programmer for coding, debugging, automation

...It operates as an agent-based system that can explore codebases, edit files, and execute development workflows with minimal manual intervention. Devon uses a client-server architecture with a Python backend and multiple user interfaces, including a terminal interface and an Electron-based desktop application. Devon integrates with multiple large language models, allowing users to choose between different providers for performance, cost, and latency considerations. It is capable of performing tasks such as debugging, writing tests, analyzing code structure, and navigating complex repositories. ...

Downloads: 0 This Week

Last Update: 14 hours ago
See Project
4

Preswald

Python tool for browser-based interactive data apps in one file

Preswald is an open source Python-based framework and static-site generator designed for building interactive data applications that run entirely in the browser. It packages application logic, data processing, and user interface components into a single self-contained output, enabling easy sharing and deployment without requiring local dependencies. Preswald leverages a WebAssembly runtime along with technologies like Pyodide and DuckDB to execute Python code directly in the browser...

Downloads: 0 This Week

Last Update: 2026-03-18
See Project
Stop Storing Third-Party Tokens in Your Database
Auth0 Token Vault handles secure token storage, exchange, and refresh for external providers so you don't have to build it yourself.

Rolling your own OAuth token storage can be a security liability. Token Vault securely stores access and refresh tokens from federated providers and handles exchange and renewal automatically. Connected accounts, refresh exchange, and privileged worker flows included.

Try Auth0 for Free
5

GLM-130B

GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)

...It is designed for large-scale inference and supports both left-to-right generation and blank filling, making it versatile across NLP tasks. Trained on over 400 billion tokens (200B English, 200B Chinese), it achieves performance surpassing GPT-3 175B, OPT-175B, and BLOOM-176B on multiple benchmarks, while also showing significant improvements on Chinese datasets compared to other large models. The model supports efficient inference via INT8 and INT4 quantization, reducing hardware requirements from 8× A100 GPUs to as little as a single server with 4× RTX 3090s. ...

Downloads: 1 This Week

Last Update: 5 days ago
See Project