Simplifies the local serving of AI models from any source
Run Local LLMs on Any Device. Open-source
State-of-the-art diffusion models for image and audio generation
Phi-3.5 for Mac: Locally-run Vision and Language Models
AIMET is a library that provides advanced quantization and compression
Official inference library for Mistral models
Sparsity-aware deep learning inference runtime for CPUs
Powering Amazon custom machine learning chips
Operating LLMs in production
20+ high-performance LLMs with recipes to pretrain, finetune at scale
The official Python client for the Huggingface Hub
Replace OpenAI GPT with another LLM in your app
A graphical manager for ollama that can manage your LLMs
Run 100B+ language models at home, BitTorrent-style
Training & Implementation of chatbots leveraging GPT-like architecture