Run Local LLMs on Any Device. Open-source
Open-source tool designed to enhance the efficiency of workloads
Official inference library for Mistral models
Powering Amazon custom machine learning chips
Simplifies the local serving of AI models from any source
AIMET is a library that provides advanced quantization and compression
State-of-the-art diffusion models for image and audio generation
A set of Docker images for training and serving models in TensorFlow
The official Python client for the Huggingface Hub
20+ high-performance LLMs with recipes to pretrain, finetune at scale
Phi-3.5 for Mac: Locally-run Vision and Language Models
Operating LLMs in production
Sparsity-aware deep learning inference runtime for CPUs
A high-performance ML model serving framework, offers dynamic batching
Replace OpenAI GPT with another LLM in your app
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT method
A graphical manager for ollama that can manage your LLMs
Visual Instruction Tuning: Large Language-and-Vision Assistant
Open platform for training, serving, and evaluating language models
OpenMMLab Model Deployment Framework
A computer vision framework to create and deploy apps in minutes
Run any Llama 2 locally with gradio UI on GPU or CPU from anywhere
Run 100B+ language models at home, BitTorrent-style
Implementation of "Tree of Thoughts
Sequence-to-sequence framework, focused on Neural Machine Translation