llama.cpp

The llama.cpp project enables the inference of Meta's LLaMA model (and other models) in pure C/C++ without requiring a Python runtime. It is designed for efficient and fast model execution, offering easy integration for applications needing LLM-based capabilities. The repository focuses on providing a highly optimized and portable implementation for running large language models directly within C/C++ environments.

Features

Pure C/C++ implementation for efficient LLM inference.
Supports LLaMA models and other variants.
Optimized for performance and portability.
No dependency on Python, ensuring a lightweight deployment.
Provides easy integration into C/C++-based applications.
Scalable for large language model execution.
Open-source, under the MIT license.
Lightweight setup with minimal requirements.
Active development and community contributions.

Project Samples

Project Activity

See All Activity >

License

MIT License

Follow llama.cpp

llama.cpp Web Site

Other Useful Business Software

MongoDB Atlas runs apps anywhere

Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free

Rate This Project

User Ratings

5.0 out of 5 stars

★★★★★

★★★★

★★★

★★

★

ease 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 5 / 5

features 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 5 / 5

design 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 5 / 5

support 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 5 / 5

User Reviews

Filter Reviews:

All

justinj24 Posted 2023-04-04

Awesome. Democratizing AI for everyone. And it works great!

Additional Project Details

Operating Systems

Linux, Mac, Windows

Programming Language

C, C++

Related Categories

C++ Large Language Models (LLM), C++ Generative AI, C++ AI Models, C++ LLM Inference Tool, C Large Language Models (LLM), C Generative AI, C AI Models, C LLM Inference Tool

Registered

2023-03-23

Similar Business Software

LM-Kit.NET

LM-Kit.NET is a cutting-edge, high-level inference SDK designed specifically to bring the advanced capabilities of Large Language Models (LLM) into the C# ecosystem. Tailored for developers working within .NET, LM-Kit.NET provides a comprehensive suite of powerful Generative AI tools, making...

See Software
Vertex AI

Build, deploy, and scale machine learning (ML) models faster, with fully managed ML tools for any use case. Through Vertex AI Workbench, Vertex AI is natively integrated with BigQuery, Dataproc, and Spark. You can use BigQuery ML to create and execute machine learning models in BigQuery...

See Software
RunPod

RunPod offers a cloud-based platform designed for running AI workloads, focusing on providing scalable, on-demand GPU resources to accelerate machine learning (ML) model training and inference. With its diverse selection of powerful GPUs like the NVIDIA A100, RTX 3090, and H100, RunPod supports...

See Software
Google Cloud BigQuery

BigQuery is a serverless, multicloud data warehouse that simplifies the process of working with all types of data so you can focus on getting valuable business insights quickly. At the core of Google’s data cloud, BigQuery allows you to simplify data integration, cost effectively and securely...

See Software
Google AI Studio

Google AI Studio is a comprehensive, web-based development environment that democratizes access to Google's cutting-edge AI models, notably the Gemini family, enabling a broad spectrum of users to explore and build innovative applications. This platform facilitates rapid prototyping by providing...

See Software
LTX

Control every aspect of your video using AI, from ideation to final edits, on one holistic platform. We’re pioneering the integration of AI and video production, enabling the transformation of a single idea into a cohesive, AI-generated video. LTX empowers individuals to share their visions,...

See Software

Report inappropriate content

llama.cpp

Port of Facebook's LLaMA model in C/C++

Get an email when there's a new version of llama.cpp

Features

Project Samples

Project Activity

Categories

License

Follow llama.cpp

User Ratings

User Reviews

Additional Project Details

Operating Systems

Programming Language

Related Categories

Registered