llama-cpp-python.whl free download

llama.cpp

Port of Facebook's LLaMA model in C/C++

The llama.cpp project enables the inference of Meta's LLaMA model (and other models) in pure C/C++ without requiring a Python runtime. It is designed for efficient and fast model execution, offering easy integration for applications needing LLM-based capabilities. The repository focuses on providing a highly optimized and portable implementation for running large language models directly within C/C++ environments.

1 Review

Downloads: 153 This Week

Last Update: 12 hours ago

See Project

Ollama

Get up and running with Llama 2 and other large language models

Run, create, and share large language models (LLMs). Get up and running with large language models, locally. Run Llama 2 and other models on macOS. Customize and create your own.

Downloads: 220 This Week

Last Update: 2025-11-19

See Project

llama2.c

Inference Llama 2 in one file of pure C

llama2.c is a minimalist implementation of the Llama 2 language model architecture designed to run entirely in pure C. Created by Andrej Karpathy, this project offers an educational and lightweight framework for performing inference on small Llama 2 models without external dependencies. It provides a full training and inference pipeline: models can be trained in PyTorch and later executed using a concise 700-line C program (run.c).

Downloads: 2 This Week

Last Update: 6 days ago

See Project

AI File Sorter

AI File Sorter uses AI to help you organize your files and folders

...When you're ready, AI FileSorter creates the right folder structure and moves everything into place for you. You can use a remote AI model or download a local one (like Mistral 7B or LLaMa 3B) for faster, private file sorting - your choice.

Downloads: 252 This Week

Last Update: 1 day ago

See Project

Alpaca.cpp

Locally run an Instruction-Tuned Chat-Style LLM

Run a fast ChatGPT-like model locally on your device. This combines the LLaMA foundation model with an open reproduction of Stanford Alpaca a fine-tuning of the base model to obey instructions (akin to the RLHF used to train ChatGPT) and a set of modifications to llama.cpp to add a chat interface. Download the zip file corresponding to your operating system from the latest release. The weights are based on the published fine-tunes from alpaca-lora, converted back into a PyTorch checkpoint with a modified script and then quantized with llama.cpp the regular way.

1 Review

Downloads: 5 This Week

Last Update: 2023-03-24

See Project

Search Results for "llama-cpp-python.whl"

Showing 5 open source projects for "llama-cpp-python.whl"

llama.cpp

Ollama

llama2.c

AI File Sorter

Alpaca.cpp

Search Results for "llama-cpp-python.whl"

Showing 5 open source projects for "llama-cpp-python.whl"

llama.cpp

Ollama

llama2.c

AI File Sorter

Alpaca.cpp

Related Searches

Related Categories