performance free download

llmfit

157 models, 30 providers, one command to find what runs on hardware

...By presenting clear performance estimates and compatibility guidance, the project reduces the trial-and-error typically involved in local LLM experimentation. Overall, llmfit serves as a practical decision assistant for developers who want to run language models efficiently on their own machines.

Downloads: 19 This Week

Last Update: 4 hours ago

See Project

uzu

A high-performance inference engine for AI models

uzu is a high-performance inference engine designed to run artificial intelligence models efficiently on Apple Silicon hardware. Written primarily in Rust and leveraging Apple’s Metal framework, the project focuses on maximizing performance when executing large language models and other AI workloads on devices such as Mac computers with M-series chips.

Downloads: 0 This Week

Last Update: 2026-06-08

See Project

mistral.rs

Fast, flexible LLM inference

mistral.rs is a fast and flexible LLM inference engine implemented in Rust, designed to run and serve modern language models with an emphasis on performance and practical deployment. It provides multiple entry points for developers, including a CLI for running models locally and an HTTP server that exposes an OpenAI-compatible API surface for easy integration with existing clients. The project includes hardware-aware tooling that can benchmark a system and choose sensible quantization and device-mapping strategies, helping users get strong performance without manual tuning. ...

Downloads: 2 This Week

Last Update: 2026-06-25

See Project

Paddler

Open-source LLM load balancer and serving platform for hosting LLMs

...A built-in administrative interface allows developers and operations teams to manage models, observe system performance, and test inference endpoints.

Downloads: 1 This Week

Last Update: 2026-06-11

See Project

MusicGPT

Generate music based on natural language prompts using LLMs

...The software allows users to run advanced music generation systems directly on their own devices without requiring heavy dependencies such as Python or full machine learning frameworks. Instead, it provides a lightweight environment capable of executing music generation models locally on CPUs or GPUs while maintaining strong performance across operating systems including Windows, macOS, and Linux. Users can describe a musical style, mood, or instrumentation using text prompts, and the system produces original audio samples based on those instructions. The application currently integrates with models such as MusicGen and is designed to support additional models transparently in the future. ...

Downloads: 17 This Week

Last Update: 2026-03-09

See Project

webclaw

Fast, local-first web content extraction for LLMs

webclaw is a high-performance web content extraction tool designed specifically for AI agents and large language models, focusing on delivering clean, structured data instead of raw HTML. It is built in Rust and operates without a headless browser, using advanced techniques such as TLS fingerprinting to bypass common scraping barriers and mimic real browser behavior.

Downloads: 0 This Week

Last Update: 2026-06-27

See Project

Korvus

Korvus is a search SDK that unifies the entire RAG pipeline

...By leveraging PostgresML and vector extensions such as pgvector, Korvus eliminates the need for external microservices typically used for AI search architectures, reducing both system complexity and latency. The architecture enables machine learning operations to occur directly in the database, minimizing data transfer between services and improving overall performance for large datasets.

Downloads: 0 This Week

Last Update: 2026-03-09

See Project

pgvecto.rs

Vector database plugin for Postgres, written in Rust

pgvecto.rs is a Postgres extension that provides vector similarity search functions. It is written in Rust and based on pgrx. It is currently under heavy development, please take care when using it in production. pgvecto.rs is a Postgres extension, which means that you can use it directly within your existing database. This makes it easy to integrate into your existing workflows and applications. pgvecto.rs supports filtering. You can set conditions when searching or retrieving points. This...

Downloads: 2 This Week

Last Update: 2024-11-22

See Project

Extractous

Fast and efficient unstructured data extraction

...Its purpose is to extract text and metadata efficiently from formats such as PDF, Word, HTML, email archives, images, and more, without depending on external APIs or separate parsing servers. The project emphasizes performance and low memory usage, and its maintainers describe it as a local-first alternative to heavier extraction stacks. For broader format support, the system combines its Rust core with ahead-of-time compiled Apache Tika shared libraries, which allows it to extend parsing coverage while still avoiding traditional server-based overhead. ...

Downloads: 0 This Week

Last Update: 2026-03-06

See Project

Floneum

Instant, controllable, local pre-trained AI models in Rust

...Many plugins can be written in different programming languages and compiled to WebAssembly modules, allowing them to run safely within the system. The platform is implemented primarily in Rust and emphasizes performance, modularity, and local execution.

Downloads: 0 This Week

Last Update: 2026-03-06

See Project

LangChain Rust

LangChain for Rust, the easiest way to write LLM-based programs

...The library aims to provide Rust developers with a structured framework for orchestrating prompts, chains, agents, and external tools within LLM-driven workflows. By adapting LangChain concepts to the Rust programming language, the project emphasizes performance, safety, and efficient memory management. Developers can use the framework to build chatbots, autonomous agents, and knowledge-augmented AI systems that interact with external data sources. The library provides abstractions for model providers, prompt templates, conversation memory, and vector search integrations. It also enables the construction of multi-step pipelines where LLM outputs feed into subsequent actions or tool calls.

Downloads: 0 This Week

Last Update: 2026-03-09

See Project

llm

An ecosystem of Rust libraries for working with large language models

llm is an ecosystem of Rust libraries for working with large language models - it's built on top of the fast, efficient GGML library for machine learning. The primary entry point for developers is the llm crate, which wraps the llm-base and the supported model crates. Documentation for the released version is available on Docs.rs. For end-users, there is a CLI application, llm-cli, which provides a convenient interface for interacting with supported models. Text generation can be done as a...

Downloads: 0 This Week

Last Update: 2023-08-21

See Project

Search Results for "performance"

Showing 12 open source projects for "performance"

llmfit

uzu

mistral.rs

Paddler

MusicGPT

webclaw

Korvus

pgvecto.rs

Extractous

Floneum

LangChain Rust

llm

Search Results for "performance"

Showing 12 open source projects for "performance"

llmfit

uzu

mistral.rs

Paddler

MusicGPT

webclaw

Korvus

pgvecto.rs

Extractous

Floneum

LangChain Rust

llm

Related Searches

Related Categories