Run a fast ChatGPT-like model locally on your device. This combines the LLaMA foundation model with an open reproduction of Stanford Alpaca a fine-tuning of the base model to obey instructions (akin to the RLHF used to train ChatGPT) and a set of modifications to llama.cpp to add a chat interface. Download the zip file corresponding to your operating system from the latest release. The weights are based on the published fine-tunes from alpaca-lora, converted back into a PyTorch checkpoint with a modified script and then quantized with llama.cpp the regular way.
Features
- Run a fast ChatGPT-like model locally on your device
- If you have more than 10GB of RAM, you can use the higher quality 13B model
- Combines the LLaMA foundation model with an open reproduction of Stanford Alpaca
- The weights are based on the published fine-tunes from alpaca-lora,
- You can add other launch options
- You can now type to the AI in the terminal and it will reply
License
MIT LicenseFollow Alpaca.cpp
Other Useful Business Software
Gen AI apps are built with MongoDB Atlas
MongoDB Atlas is the developer-friendly database used to build, scale, and run gen AI and LLM-powered apps—without needing a separate vector database. Atlas offers built-in vector search, global availability across 115+ regions, and flexible document modeling. Start building AI apps faster, all in one place.
Rate This Project
Login To Rate This Project
User Reviews
-
Really great for running a ChatGPT-style model on my own device.