Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Search Results

Search Results for "llama-cpp-python.whl" - Page 2

x

Sort By:

Relevance

Clear All Filters

OS

Windows 309
Linux 249
Mac 224
More...
BSD 92
ChromeOS 75
Desktop Operating Systems 12
Mobile Operating Systems 5
Server Operating Systems 2
Embedded Operating Systems 1

Category

Software Development 106
Artificial Intelligence 74
System 15
Games 14
Education 9
Multimedia 9
Scientific/Engineering 9
Business 8
Internet 8
Formats and Protocols 6
Security 5
Communications 4
Desktop Environment 4
Text Editors 4
Database 3
Blockchain 1
Productivity 1
Terminals 1

License

OSI-Approved Open Source 185
Public Domain 6
Other License 4
Creative Commons Attribution License 3

Translations

Programming Language

C++ 129
Python 49
C 16
Java 16
More...
C# 7
JavaScript 6
TypeScript 6
BASIC 5
Delphi/Kylix 3
Go 3
PHP 2
Rust 2
Visual Basic .NET 2
ActionScript 1
Ada 1
Assembly 1
Fortran 1
Lua 1
Perl 1
PL/SQL 1
Ruby 1
Swift 1
Unix Shell 1
VBScript 1

Status

Production/Stable 45
Beta 32
Pre-Alpha 20
Alpha 20
More...
Planning 15
Inactive 5
Mature 3

Showing 309 open source projects for "llama-cpp-python.whl"

View related business solutions

Windows Clear Filters & Widen Search

Enterprise-grade ITSM, for every business
Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.

Try it Free
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
1

Jan.ai

Open source alternative to ChatGPT that runs 100% offline

Jan.ai is an open-source, privacy-focused AI assistant that serves as an alternative to ChatGPT, running completely locally on your device. It allows you to download and run LLMs (local language models) offline while also offering optional integration with cloud-based model providers—giving you full control over your data and AI interactions.

Downloads: 13 This Week

Last Update: 7 hours ago
See Project
2

tinygrad

Deep learning framework

This may not be the best deep learning framework, but it is a deep learning framework. Due to its extreme simplicity, it aims to be the easiest framework to add new accelerators to, with support for both inference and training. If XLA is CISC, tinygrad is RISC.

Downloads: 5 This Week

Last Update: 2025-08-19
See Project
3

SGLang

SGLang is a fast serving framework for large language models

SGLang is a fast serving framework for large language models and vision language models. It makes your interaction with models faster and more controllable by co-designing the backend runtime and frontend language.

Downloads: 6 This Week

Last Update: 5 days ago
See Project
4

Eliza

Autonomous agents for everyone

Build and deploy autonomous AI agents with consistent personalities across Discord, Twitter, and Telegram. Full support for voice, text, and media interactions. Built-in RAG memory system, document processing, media analysis, and autonomous trading capabilities. Supports multiple AI models including Llama, GPT-4, and Claude. Create custom actions, add new platform integrations, and extend functionality through a modular plugin system. Full TypeScript support.

Downloads: 4 This Week

Last Update: 3 days ago
See Project
Crowdtesting That Delivers | Testeum
Unfixed bugs delaying your launch? Test with real users globally – check it out for free, results in days.

Testeum connects your software, app, or website to a worldwide network of testers, delivering detailed feedback in under 48 hours. Ensure functionality and refine UX on real devices, all at a fraction of traditional costs. Trusted by startups and enterprises alike, our platform streamlines quality assurance with actionable insights. Click to perfect your product now.

Click to perfect your product now.
5

bert4torch

An elegent pytorch implement of transformers

An elegant PyTorch implement of transformers.

Downloads: 3 This Week

Last Update: 2025-07-31
See Project
6

VoxelCore

Voxel game engine in C++ with OpenGL

VoxelEngine-Cpp is a minimal voxel engine written in modern C++ using OpenGL, GLFW, and GLM, inspired by Minecraft-style block worlds. It offers a clean foundation for learning and experimenting with voxel-based rendering and world generation. With features like chunk loading, perlin noise terrain generation, and basic lighting, the engine is a perfect starting point for developers who want to create sandbox games or explore the technical aspects of 3D voxel environments.

Downloads: 6 This Week

Last Update: 2025-07-26
See Project
7

far2l

Linux port of FAR v2

Linux fork of FAR Manager v2. Works also on OSX/MacOS and BSD (but the latter is not tested on a regular manner). Plug-ins that are currently working: NetRocks (SFTP/SCP/FTP/FTPS/SMB/NFS/WebDAV), colorer, multiarc, tmppanel, align, autowrap, drawing, edit case, SimpleIndent, Calculator, Python (optional scripting support).

Downloads: 6 This Week

Last Update: 2025-03-30
See Project
8

LLM Foundry

LLM training code for MosaicML foundation models

Introducing MPT-7B, the first entry in our MosaicML Foundation Series. MPT-7B is a transformer trained from scratch on 1T tokens of text and code. It is open source, available for commercial use, and matches the quality of LLaMA-7B. MPT-7B was trained on the MosaicML platform in 9.5 days with zero human intervention at a cost of ~$200k. Large language models (LLMs) are changing the world, but for those outside well-resourced industry labs, it can be extremely difficult to train and deploy...

Downloads: 3 This Week

Last Update: 2025-07-29
See Project
9

h2oGPT

Private chat with local GPT with document, images, video, etc.

h2oGPT is an open-source platform that allows users to interact with local GPT models in a completely private environment. It supports a variety of document types, including PDFs, Word files, images, video frames, and even audio, enabling users to query and analyze their documents or engage in a private chat with AI. The platform is designed to be secure and offline, ensuring that all data remains private and under the user's control. h2oGPT supports several AI models, including oLLaMa and...

Downloads: 5 This Week

Last Update: 2025-02-22
See Project
Build Securely on AWS with Proven Frameworks
Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.

Download Now
10

Tribe AI

Low code tool to rapidly build and coordinate multi-agent teams

Low code tool to rapidly build and coordinate multi-agent teams. Have you heard the saying, 'Two minds are better than one'? That's true for agents too. Tribe leverages on the langgraph framework to let you customize and coordinate teams of agents easily. By splitting up tough tasks among agents who are good at different things, each one can focus on what it does best. This makes solving problems faster and better.

Downloads: 4 This Week

Last Update: 2024-10-07
See Project
11

LLamaSharp

C#/.NET binding of llama.cpp, including LLaMa/GPT model inference

The C#/.NET binding of llama.cpp. It provides APIs to infer the LLaMa Models and deploy it on the local environment. It works on both Windows, Linux and MAC without the requirement for compiling llama.cpp yourself. Its performance is close to llama.cpp. Furthermore, it provides integrations with other projects such as BotSharp to provide higher-level applications and UI.

Downloads: 2 This Week

Last Update: 2025-08-16
See Project
12

Pruna AI

Pruna is a model optimization framework built for developers

Pruna is an open-source, self-hostable AI inference engine designed to help teams deploy and manage large language models (LLMs) efficiently across private or hybrid infrastructures. Built with performance and developer ergonomics in mind, Pruna simplifies inference workflows by enabling multi-model orchestration, autoscaling, GPU resource allocation, and compatibility with popular open-source models. It is ideal for companies or teams looking to reduce reliance on external APIs while...

Downloads: 3 This Week

Last Update: 2025-08-13
See Project
13

kotaemon

An open-source RAG-based tool for chatting with your documents

An open-source clean & customizable RAG UI for chatting with your documents. Built with both end users and developers in mind. This project serves as a functional RAG UI for both end users who want to do QA on their documents and developers who want to build their own RAG pipeline.

Downloads: 0 This Week

Last Update: 2025-07-04
See Project
14

Axolotl

Go ahead and axolotl questions

Axolotl is a powerful and flexible framework for fine-tuning large language models on custom datasets. Built for researchers and developers, Axolotl simplifies the process of adapting LLMs for specific tasks, including chat, code generation, and instruction following. It supports a wide variety of model architectures and offers out-of-the-box optimization strategies for efficient training.

Downloads: 2 This Week

Last Update: 2025-08-18
See Project
15

LazyLLM

Easiest and laziest way for building multi-agent LLMs applications

LazyLLM is an optimized, lightweight LLM server designed for easy and fast deployment of large language models. It is fully compatible with the OpenAI API specification, enabling developers to integrate their own models into applications that normally rely on OpenAI’s endpoints. LazyLLM emphasizes low resource usage and fast inference while supporting multiple models.

Downloads: 2 This Week

Last Update: 2025-08-18
See Project
16

Bee Agent Framework

The framework for building scalable agentic applications

Open-source framework for building, deploying, and serving powerful agentic workflows at scale. The Bee Agent Framework makes it easy to build scalable agent-based workflows with your model of choice. The framework is been designed to perform robustly with IBM Granite and Llama 3.x models, and we’re actively working on optimizing its performance with other popular LLMs. Our goal is to empower developers to adopt the latest open-source and proprietary models with minimal changes to their current...

Downloads: 2 This Week

Last Update: 1 day ago
See Project
17

Lepton AI

A Pythonic framework to simplify AI service building

A Pythonic framework to simplify AI service building. Cutting-edge AI inference and training, unmatched cloud-native experience, and top-tier GPU infrastructure. Ensure 99.9% uptime with comprehensive health checks and automatic repairs.

Downloads: 2 This Week

Last Update: 3 days ago
See Project
18

AWS IoT Device SDK for C++ v2

Next generation AWS IoT Client SDK for C++ using AWS Common Runtime

...-crt-CPP package.

Downloads: 2 This Week

Last Update: 2025-08-08
See Project
19

Unsloth

Finetune Llama 3.2, Mistral, Phi & Gemma LLMs 2-5x faster

Unsloth is a framework designed to significantly improve the performance of Llama 3.3, DeepSeek-R1, and other reasoning large language models (LLMs). It optimizes these models to run up to 2x faster while using 70% less memory. Unsloth aims to make finetuning large models more efficient, offering users a simple, resource-efficient solution for customizing LLMs with their datasets. It provides a user-friendly experience through free notebooks and the ability to export finetuned models to various...

Downloads: 1 This Week

Last Update: 2025-08-08
See Project
20

llama2-webui

Run any Llama 2 locally with gradio UI on GPU or CPU from anywhere

Running Llama 2 with gradio web UI on GPU or CPU from anywhere (Linux/Windows/Mac).

Downloads: 0 This Week

Last Update: 2023-10-04
See Project
21

promptfoo

Evaluate and compare LLM outputs, catch regressions, improve prompts

Ensure high-quality LLM outputs with automatic evals. Use a representative sample of user inputs to reduce subjectivity when tuning prompts. Use built-in metrics, LLM-graded evals, or define your own custom metrics. Compare prompts and model outputs side-by-side, or integrate the library into your existing test/CI workflow. Use OpenAI, Anthropic, and open-source models like Llama and Vicuna, or integrate custom API providers for any LLM API.

Downloads: 1 This Week

Last Update: 19 hours ago
See Project
22

Agents-Flex

Agents-Flex is an elegant LLM Application Framework like LangChain

Agents-Flex includes a variety of network protocols for connecting LLMs, such as HTTP, SSE and WS. Its simple and flexible design allows developers to easily connect to various LLMs, including OpenAI, LLama, and other AI. Agents-Flex provides a rich set of development templates and Prompt Frameworks, including FEW-SHOT, CRISPE, BROKE, and ICIO. Developers can also customize their own unique prompt templates. Agents-Flex has a very flexible Function Calling component. It supports local method...

Downloads: 1 This Week

Last Update: 2025-04-07
See Project
23

QuivrHQ

Opiniated RAG for integrating GenAI in your apps

Quivr is an open-source platform that leverages Retrieval-Augmented Generation (RAG) to integrate Generative AI into applications. It serves as a "second brain," enabling users to build powerful AI-driven assistants that can process and retrieve information efficiently. Quivr supports various large language models and vector stores, providing flexibility and customization for developers.

Downloads: 0 This Week

Last Update: 2025-05-30
See Project
24

Curated Transformers

PyTorch library of curated Transformer models and their components

State-of-the-art transformers, brick by brick. Curated Transformers is a transformer library for PyTorch. It provides state-of-the-art models that are composed of a set of reusable components. Supports state-of-the-art transformer models, including LLMs such as Falcon, Llama, and Dolly v2. Implementing a feature or bugfix benefits all models. For example, all models support 4/8-bit inference through the bitsandbytes library and each model can use the PyTorch meta device to avoid unnecessary...

Downloads: 0 This Week

Last Update: 2024-04-17
See Project
25

OpenLLM

Operating LLMs in production

An open platform for operating large language models (LLMs) in production. Fine-tune, serve, deploy, and monitor any LLMs with ease. With OpenLLM, you can run inference with any open-source large-language models, deploy to the cloud or on-premises, and build powerful AI apps. Built-in supports a wide range of open-source LLMs and model runtime, including Llama 2， StableLM, Falcon, Dolly, Flan-T5, ChatGLM, StarCoder, and more. Serve LLMs over RESTful API or gRPC with one command, query via WebUI...

Downloads: 0 This Week

Last Update: 2025-04-21
See Project

Previous
1
You're on page 2
3
4
5
6
Next

Related Searches

unity 3d minecraft

far2l

ai

ai agents

asp.net mvc projects

python ai

dev-c++

voxel game engine

rpg game engine

game engines

Related Categories

Software Development

Artificial Intelligence

System

Games

Education

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
225 Broadway Suite 1600
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2025 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise

Want the latest updates on software, tech news, and AI?

Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.

Thanks for helping keep SourceForge clean.

X

You seem to have CSS turned off. Please don't fill out this field.

You seem to have CSS turned off. Please don't fill out this field.

Briefly describe the problem (required):

Upload screenshot of ad (required):

Select a file, or drag & drop file here.

✔

✘

Screenshot instructions:

Click URL instructions:
Right-click on the ad, choose "Copy Link", then paste here →
(This may not be possible with some types of ads)

More information about our ad policies

Ad destination/click URL: