llama-cpp-python.whl free download

llama.cpp

Port of Facebook's LLaMA model in C/C++

The llama.cpp project enables the inference of Meta's LLaMA model (and other models) in pure C/C++ without requiring a Python runtime. It is designed for efficient and fast model execution, offering easy integration for applications needing LLM-based capabilities. The repository focuses on providing a highly optimized and portable implementation for running large language models directly within C/C++ environments.

1 Review

Downloads: 152 This Week

Last Update: 7 hours ago

See Project

DeepSeek R1

Open-source, high-performance AI model with advanced reasoning

...This approach has resulted in performance comparable to leading models like OpenAI's o1, while maintaining cost-efficiency. To further support the research community, DeepSeek has released distilled versions of the model based on architectures such as LLaMA and Qwen.

1 Review

Downloads: 61 This Week

Last Update: 2025-07-09

See Project

CSM (Conversational Speech Model)

A Conversational Speech Generation Model

The CSM (Conversational Speech Model) is a speech generation model developed by Sesame AI that creates RVQ audio codes from text and audio inputs. It uses a Llama backbone and a smaller audio decoder to produce audio codes for realistic speech synthesis. The model has been fine-tuned for interactive voice demos and is hosted on platforms like Hugging Face for testing. CSM offers a flexible setup and is compatible with CUDA-enabled GPUs for efficient execution.

Downloads: 4 This Week

Last Update: 2025-03-19

See Project

Llama-3.2-1B

Llama 3.2–1B: Multilingual, instruction-tuned model for mobile AI

meta-llama/Llama-3.2-1B is a lightweight, instruction-tuned generative language model developed by Meta, optimized for multilingual dialogue, summarization, and retrieval tasks. With 1.23 billion parameters, it offers strong performance in constrained environments like mobile devices, without sacrificing versatility or multilingual support.

Downloads: 0 This Week

Last Update: 2025-07-02

See Project

Llama-3.2-1B-Instruct

Instruction-tuned 1.2B LLM for multilingual text generation by Meta

Llama-3.2-1B-Instruct is Meta’s multilingual, instruction-tuned large language model with 1.24 billion parameters, optimized for dialogue, summarization, and retrieval tasks. It builds upon the Llama 3.1 architecture and incorporates fine-tuning techniques like SFT, DPO, and quantization-aware training for improved alignment, efficiency, and safety.

Downloads: 0 This Week

Last Update: 2025-07-02

See Project

Hermes 4

Hermes 4 FP8: hybrid reasoning Llama-3.1-405B model by Nous Research

Hermes 4 405B FP8 is a cutting-edge large language model developed by Nous Research, built on Llama-3.1-405B and optimized for frontier reasoning and alignment. It introduces a hybrid reasoning mode with explicit <think> segments, enabling the model to deliberate deeply when needed and switch to faster responses when desired. Post-training improvements include a vastly expanded corpus with ~60B tokens, boosting performance across math, code, STEM, logic, creativity, and structured outputs. ...

Downloads: 0 This Week

Last Update: 2025-09-01

See Project

Mellum-4b-base

JetBrains’ 4B parameter code model for completions

Mellum-4b-base is JetBrains’ first open-source large language model designed and optimized for code-related tasks. Built with 4 billion parameters and a LLaMA-style architecture, it was trained on over 4.2 trillion tokens across multiple programming languages, including datasets such as The Stack, StarCoder, and CommitPack. With a context window of 8,192 tokens, it excels at code completion, fill-in-the-middle tasks, and intelligent code suggestions for professional developer tools and IDEs. ...

Downloads: 0 This Week

Last Update: 2025-09-11

See Project

OpenVLA 7B

Vision-language-action model for robot control via images and text

...It takes camera images and natural language instructions as input and outputs normalized 7-DoF robot actions, enabling control of multiple robot types across various domains. Built on top of LLaMA-2 and DINOv2/SigLIP visual backbones, it allows both zero-shot inference for known robot setups and parameter-efficient fine-tuning for new domains. The model supports real-world robotics tasks, with robust generalization to environments seen in pretraining. Its actions include delta values for position, orientation, and gripper status, and can be un-normalized based on robot-specific statistics. ...

Downloads: 0 This Week

Last Update: 2025-07-23

See Project

Search Results for "llama-cpp-python.whl"

8 projects for "llama-cpp-python.whl" with 2 filters applied:

llama.cpp

DeepSeek R1

CSM (Conversational Speech Model)

Llama-3.2-1B

Llama-3.2-1B-Instruct

Hermes 4

Mellum-4b-base

OpenVLA 7B

Search Results for "llama-cpp-python.whl"

8 projects for "llama-cpp-python.whl" with 2 filters applied:

llama.cpp

DeepSeek R1

CSM (Conversational Speech Model)

Llama-3.2-1B

Llama-3.2-1B-Instruct

Hermes 4

Mellum-4b-base

OpenVLA 7B

Related Searches

Related Categories