A Pythonic framework to simplify AI service building
Integrate, train and manage any AI models and APIs with your database
Port of Facebook's LLaMA model in C/C++
AI interface for tinkerers (Ollama, Haystack RAG, Python)
Superduper: Integrate AI models and machine learning workflows
Open-Source AI Camera. Empower any camera/CCTV
A library to communicate with ChatGPT, Claude, Copilot, Gemini
Phi-3.5 for Mac: Locally-run Vision and Language Models
Simplifies the local serving of AI models from any source
Run Local LLMs on Any Device. Open-source
State-of-the-art diffusion models for image and audio generation
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
Operating LLMs in production
Neural Network Compression Framework for enhanced OpenVINO
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
Bring the notion of Model-as-a-Service to life
AIMET is a library that provides advanced quantization and compression
Official inference library for Mistral models
20+ high-performance LLMs with recipes to pretrain, finetune at scale
Sparsity-aware deep learning inference runtime for CPUs
Build your chatbot within minutes on your favorite device
Tensor search for humans
Replace OpenAI GPT with another LLM in your app
MII makes low-latency and high-throughput inference possible
LLM training code for MosaicML foundation models