Port of Facebook's LLaMA model in C/C++
Superduper: Integrate AI models and machine learning workflows
Phi-3.5 for Mac: Locally-run Vision and Language Models
Run Local LLMs on Any Device. Open-source
Official inference library for Mistral models
Implementation of model parallel autoregressive transformers on GPUs