A Pythonic framework to simplify AI service building
AI interface for tinkerers (Ollama, Haystack RAG, Python)
Port of Facebook's LLaMA model in C/C++
Superduper: Integrate AI models and machine learning workflows
Integrate, train and manage any AI models and APIs with your database
A library to communicate with ChatGPT, Claude, Copilot, Gemini
Open-Source AI Camera. Empower any camera/CCTV
Phi-3.5 for Mac: Locally-run Vision and Language Models
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
Database system for building simpler and faster AI-powered application
State-of-the-art diffusion models for image and audio generation
Run Local LLMs on Any Device. Open-source
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
Sparsity-aware deep learning inference runtime for CPUs
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
Neural Network Compression Framework for enhanced OpenVINO
The unofficial python package that returns response of Google Bard
Operating LLMs in production
Official inference library for Mistral models
Tensor search for humans
20+ high-performance LLMs with recipes to pretrain, finetune at scale
AIMET is a library that provides advanced quantization and compression
MII makes low-latency and high-throughput inference possible
Replace OpenAI GPT with another LLM in your app
LLM training code for MosaicML foundation models