Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Search Results

Search Results for "llama-cpp-python.whl" - Page 3

x

Sort By:

Relevance

OS

Windows 313
Linux 276
Mac 235
More...
BSD 104
ChromeOS 73
Desktop Operating Systems 21
Mobile Operating Systems 7
Server Operating Systems 3
Embedded Operating Systems 1

Category

Software Development 131
Artificial Intelligence 79
System 22
Games 15
Scientific/Engineering 13
Education 10
Business 9
Internet 9
Multimedia 9
Communications 8
Formats and Protocols 8
Text Editors 6
Database 5
Desktop Environment 5
Security 5
Blockchain 2
Mobile 1
Productivity 1
Terminals 1

License

OSI-Approved Open Source 242
Public Domain 7
Other License 5
Creative Commons Attribution License 4

Translations

Programming Language

C++ 171
Python 55
C 27
Java 18
More...
C# 8
TypeScript 8
BASIC 6
JavaScript 6
Assembly 3
Delphi/Kylix 3
Go 3
Lua 3
Unix Shell 3
Visual Basic .NET 3
PHP 2
Ruby 2
Rust 2
Swift 2
ActionScript 1
Ada 1
Forth 1
Fortran 1
Objective C 1
Perl 1
PL/SQL 1
VBScript 1
Visual Basic 1
XSL (XSLT/XPath/XSL-FO) 1
Yacc 1

Status

Production/Stable 51
Beta 42
Pre-Alpha 28
Planning 27
More...
Alpha 25
Inactive 6
Mature 3

Showing 437 open source projects for "llama-cpp-python.whl"

View related business solutions

Keep company data safe with Chrome Enterprise
Protect your business with AI policies and data loss prevention in the browser

Make AI work your way with Chrome Enterprise. Block unapproved sites and set custom data controls that align with your company's policies.

Download Chrome
Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
1

SillyTavern

LLM Frontend for Power Users

Mobile-friendly, Multi-API (KoboldAI/CPP, Horde, NovelAI, Ooba, OpenAI, OpenRouter, Claude, Scale), VN-like Waifu Mode, Horde SD, System TTS, WorldInfo (lorebooks), customizable UI, auto-translate, and more prompt options than you'd ever want or need. Optional Extras server for more SD/TTS options + ChromaDB/Summarize. SillyTavern is a user interface you can install on your computer (and Android phones) that allows you to interact with text generation AIs and chat/roleplay with characters you or the community create. ...

Downloads: 148 This Week

Last Update: 2025-11-22
See Project
2

OpenLLM

Operating LLMs in production

...With OpenLLM, you can run inference with any open-source large-language models, deploy to the cloud or on-premises, and build powerful AI apps. Built-in supports a wide range of open-source LLMs and model runtime, including Llama 2， StableLM, Falcon, Dolly, Flan-T5, ChatGLM, StarCoder, and more. Serve LLMs over RESTful API or gRPC with one command, query via WebUI, CLI, our Python/Javascript client, or any HTTP client.

Downloads: 3 This Week

Last Update: 2025-04-21
See Project
3

LazyLLM

Easiest and laziest way for building multi-agent LLMs applications

LazyLLM is an optimized, lightweight LLM server designed for easy and fast deployment of large language models. It is fully compatible with the OpenAI API specification, enabling developers to integrate their own models into applications that normally rely on OpenAI’s endpoints. LazyLLM emphasizes low resource usage and fast inference while supporting multiple models.

Downloads: 0 This Week

Last Update: 2025-11-01
See Project
4

BrowserAI

Run local LLMs like llama, deepseek, kokoro etc. inside your browser

BrowserAI is a cutting-edge platform that allows users to run large language models (LLMs) directly in their web browser without the need for a server. It leverages WebGPU for accelerated performance and supports offline functionality, making it a highly efficient and privacy-conscious solution. The platform provides a developer-friendly SDK with pre-configured popular models, and it allows for seamless switching between MLC and Transformer engines. Additionally, it supports features such as...

Downloads: 2 This Week

Last Update: 2025-05-21
See Project
Incredable is the first DLT-secured platform that allows you to save time, eliminate errors, and ensure your organization is compliant all in one place.
For healthcare Providers and Facilities

Incredable streamlines and simplifies the complex process of medical credentialing for hospitals and medical facilities, helping you save valuable time, reduce costs, and minimize risks. With Incredable, you can effortlessly manage all your healthcare providers and their credentials within a single, unified platform. Our state-of-the-art technology ensures top-notch data security, giving you peace of mind.

Learn More
5

VoxelCore

Voxel game engine in C++ with OpenGL

VoxelEngine-Cpp is a minimal voxel engine written in modern C++ using OpenGL, GLFW, and GLM, inspired by Minecraft-style block worlds. It offers a clean foundation for learning and experimenting with voxel-based rendering and world generation. With features like chunk loading, perlin noise terrain generation, and basic lighting, the engine is a perfect starting point for developers who want to create sandbox games or explore the technical aspects of 3D voxel environments.

Downloads: 4 This Week

Last Update: 11 hours ago
See Project
6

far2l

Linux port of FAR v2

Linux fork of FAR Manager v2. Works also on OSX/MacOS and BSD (but the latter is not tested on a regular manner). Plug-ins that are currently working: NetRocks (SFTP/SCP/FTP/FTPS/SMB/NFS/WebDAV), colorer, multiarc, tmppanel, align, autowrap, drawing, edit case, SimpleIndent, Calculator, Python (optional scripting support).

Downloads: 4 This Week

Last Update: 2025-10-26
See Project
7

promptfoo

Evaluate and compare LLM outputs, catch regressions, improve prompts

...Use built-in metrics, LLM-graded evals, or define your own custom metrics. Compare prompts and model outputs side-by-side, or integrate the library into your existing test/CI workflow. Use OpenAI, Anthropic, and open-source models like Llama and Vicuna, or integrate custom API providers for any LLM API.

Downloads: 1 This Week

Last Update: 6 days ago
See Project
8

SGLang

SGLang is a fast serving framework for large language models

SGLang is a fast serving framework for large language models and vision language models. It makes your interaction with models faster and more controllable by co-designing the backend runtime and frontend language.

Downloads: 0 This Week

Last Update: 2025-11-15
See Project
9

h2oGPT

Private chat with local GPT with document, images, video, etc.

h2oGPT is an open-source platform that allows users to interact with local GPT models in a completely private environment. It supports a variety of document types, including PDFs, Word files, images, video frames, and even audio, enabling users to query and analyze their documents or engage in a private chat with AI. The platform is designed to be secure and offline, ensuring that all data remains private and under the user's control. h2oGPT supports several AI models, including oLLaMa and...

Downloads: 3 This Week

Last Update: 2025-02-22
See Project
Smart Business Texting that Generates Pipeline
Create and convert pipeline at scale through industry leading SMS campaigns, automation, and conversation management.

TextUs is the leading text messaging service provider for businesses that want to engage in real-time conversations with customers, leads, employees and candidates. Text messaging is one of the most engaging ways to communicate with customers, candidates, employees and leads. 1:1, two-way messaging encourages response and engagement. Text messages help teams get 10x the response rate over phone and email. Business text messaging has become a more viable form of communication than traditional mediums. The TextUs user experience is intentionally designed to resemble the familiar SMS inbox, allowing users to easily manage contacts, conversations, and campaigns. Work right from your desktop with the TextUs web app or use the Chrome extension alongside your ATS or CRM. Leverage the mobile app for on-the-go sending and responding.

Learn More
10

Dear ImGui

Bloat-free Graphical User interface for C++ with minimal dependencies

Dear ImGui is a bloat-free graphical user interface library for C++. It outputs optimized vertex buffers that you can render anytime in your 3D-pipeline enabled application. It is fast, portable, renderer agnostic and self-contained (no external dependencies). Dear ImGui is designed to enable fast iterations and to empower programmers to create content creation tools and visualization / debug tools (as opposed to UI for the average end-user). It favors simplicity and productivity toward this...

Downloads: 28 This Week

Last Update: 2025-11-20
See Project
11

Pruna AI

Pruna is a model optimization framework built for developers

Pruna is an open-source, self-hostable AI inference engine designed to help teams deploy and manage large language models (LLMs) efficiently across private or hybrid infrastructures. Built with performance and developer ergonomics in mind, Pruna simplifies inference workflows by enabling multi-model orchestration, autoscaling, GPU resource allocation, and compatibility with popular open-source models. It is ideal for companies or teams looking to reduce reliance on external APIs while...

Downloads: 0 This Week

Last Update: 2025-11-10
See Project
12

Curated Transformers

PyTorch library of curated Transformer models and their components

...It provides state-of-the-art models that are composed of a set of reusable components. Supports state-of-the-art transformer models, including LLMs such as Falcon, Llama, and Dolly v2. Implementing a feature or bugfix benefits all models. For example, all models support 4/8-bit inference through the bitsandbytes library and each model can use the PyTorch meta device to avoid unnecessary allocations and initialization.

Downloads: 0 This Week

Last Update: 2024-04-17
See Project
13

Modern C++ Programming

Modern C++ Programming Course

Modern-CPP-Programming is a teaching repository that introduces practical C++11/14/17 features through focused examples, exercises, and notes. It walks through core language topics like RAII, move semantics, templates and metaprogramming, lambdas, and smart pointers with an eye toward real-world patterns. Concurrency and performance enter the picture via threads, atomics, futures, and memory considerations, helping learners reason about correctness and speed.

Downloads: 13 This Week

Last Update: 2025-09-05
See Project
14

Tribe AI

Low code tool to rapidly build and coordinate multi-agent teams

Low code tool to rapidly build and coordinate multi-agent teams. Have you heard the saying, 'Two minds are better than one'? That's true for agents too. Tribe leverages on the langgraph framework to let you customize and coordinate teams of agents easily. By splitting up tough tasks among agents who are good at different things, each one can focus on what it does best. This makes solving problems faster and better.

Downloads: 0 This Week

Last Update: 2024-10-07
See Project
15

Penzai

A JAX research toolkit to build, edit, & visualize neural networks

Penzai, developed by Google DeepMind, is a JAX-based library for representing, visualizing, and manipulating neural network models as functional pytree data structures. It is designed to make machine learning research more interpretable and interactive, particularly for tasks like model surgery, ablation studies, architecture debugging, and interpretability research. Unlike conventional neural network libraries, Penzai exposes the full internal structure of models, enabling fine-grained...

Downloads: 0 This Week

Last Update: 2025-10-09
See Project
16

Synthetic Data Kit

Tool for generating high quality Synthetic datasets

Synthetic Data Kit is a CLI-centric toolkit for generating high-quality synthetic datasets to fine-tune Llama models, with an emphasis on producing reasoning traces and QA pairs that line up with modern instruction-tuning formats. It ships an opinionated, modular workflow that covers ingesting heterogeneous sources (documents, transcripts), prompting models to create labeled examples, and exporting to fine-tuning schemas with minimal glue code.

Downloads: 0 This Week

Last Update: 2025-10-25
See Project
17

MCP Hub

An MCP client for Neovim that seamlessly integrates MCP servers

mcphub.nvim is an MCP (Model Context Protocol) client plugin for Neovim that seamlessly integrates MCP servers into your editing workflow with an intuitive interface for managing, testing, and using MCP servers with your favorite chat plugins. Create your first MCP capable agent you need only 6 lines of code. Works with any langchain-supported LLM that supports tool calling (OpenAI, Anthropic, Groq, LLama etc.) Explore MCP capabilities and generate starter code with the interactive code builder. An MCP client for Neovim that seamlessly integrates MCP servers into your editing workflow with an intuitive interface for managing, testing, and using MCP servers with your favorite chat plugins.

Downloads: 0 This Week

Last Update: 2025-08-15
See Project
18

Agents-Flex

Agents-Flex is an elegant LLM Application Framework like LangChain

Agents-Flex includes a variety of network protocols for connecting LLMs, such as HTTP, SSE and WS. Its simple and flexible design allows developers to easily connect to various LLMs, including OpenAI, LLama, and other AI. Agents-Flex provides a rich set of development templates and Prompt Frameworks, including FEW-SHOT, CRISPE, BROKE, and ICIO. Developers can also customize their own unique prompt templates. Agents-Flex has a very flexible Function Calling component. It supports local method definitions, parsing, callbacks through LLMs, and executing local methods to obtain results. ...

Downloads: 0 This Week

Last Update: 2025-11-17
See Project
19

AWS IoT Device SDK for C++ v2

Next generation AWS IoT Client SDK for C++ using AWS Common Runtime

...This SDK is built on the AWS Common Runtime, a collection of libraries (AWS-c-common, aws-c-io, aws-c-mqtt, aws-c-http, aws-c-cal, aws-c-auth, s2n...) written in C to be cross-platform, high-performance, secure, and reliable. The libraries are bound to C++ by the AWS-crt-CPP package.

Downloads: 1 This Week

Last Update: 2025-11-06
See Project
20

Speech-AI-Forge

Speech-AI-Forge is a project developed around TTS generation model

...It is model-agnostic and advertises support for a variety of TTS and speech models such as ChatTTS, CosyVoice, Fish-Speech, FireredTTS and others, as well as Whisper-based ASR, giving you a flexible playground for experimenting with different speech stacks. The project also integrates with general-purpose LLMs (for example GPT- or LLaMA-style models), which can be used to pre-process text, manage conversations.

Downloads: 1 This Week

Last Update: 3 days ago
See Project
21

CogVLM2

GPT4V-level open-source multi-modal model based on Llama3-8B

CogVLM2 is the second generation of the CogVLM vision-language model series, developed by ZhipuAI and released in 2024. Built on Meta-Llama-3-8B-Instruct, CogVLM2 significantly improves over its predecessor by providing stronger performance across multimodal benchmarks such as TextVQA, DocVQA, and ChartQA, while introducing extended context length support of up to 8K tokens and high-resolution image input up to 1344×1344. The series includes models for both image understanding and video understanding, with CogVLM2-Video supporting up to 1-minute videos by analyzing keyframes. ...

Downloads: 0 This Week

Last Update: 6 days ago
See Project
22

C++ lectures in russian

Open lectures in C++

Открытые лекции по C++ являются многолетним конспектом соответствующего учебного курса, с существенными расширениями (о которых можно судить по приложенному плану курса). Проект служит основой для преподавания современного C++, а также постоянно обновляется по мере обновления языка. Этот материал в перспективе должен стать лучшим бесплатным русскоязычным учебником по C++

1 Review

Downloads: 394 This Week

Last Update: 1 day ago
See Project
23

palm-cpp-gamekit

PalmOS game kit (coded in C++)

Downloads: 0 This Week

Last Update: 2024-10-14
See Project
24

Vigenère

A powerful tool for cracking the Vigenère cipher

Vigenère is a program for encrypting and decrypting text using the Vigenère cipher. Built-in cryptanalysis tools make it possible to crack the cipher without needing the key — or even knowing its length.

Downloads: 2 This Week

Last Update: 2025-10-31
See Project
25

cpp studies - Rubik Engine

Downloads: 0 This Week

Last Update: 2025-08-04
See Project

Previous
1
2
You're on page 3
4
5
6
7
Next

Related Searches

c++

dev-c++

dev c++ 5.11

dev-c

dev c++

sillytavern

unity 3d minecraft

far2l

video ai

c# source code

Related Categories

Software Development

Artificial Intelligence

System

Games

Scientific/Engineering

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2025 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise

×

Thanks for helping keep SourceForge clean.

X

You seem to have CSS turned off. Please don't fill out this field.

You seem to have CSS turned off. Please don't fill out this field.

Briefly describe the problem (required):

Upload screenshot of ad (required):

Select a file, or drag & drop file here.

✔

✘

Screenshot instructions:

Click URL instructions:
Right-click on the ad, choose "Copy Link", then paste here →
(This may not be possible with some types of ads)

More information about our ad policies

Ad destination/click URL: