Phi-3.5 for Mac: Locally-run Vision and Language Models
Operating LLMs in production
Trainable models and NN optimization tools
Unified Model Serving Framework
GPU environment management and cluster orchestration
Neural Network Compression Framework for enhanced OpenVINO
State-of-the-art diffusion models for image and audio generation
A lightweight vision library for performing large object detection
The official Python client for the Huggingface Hub
LLM training code for MosaicML foundation models
AIMET is a library that provides advanced quantization and compression
Integrate, train and manage any AI models and APIs with your database
Python Package for ML-Based Heterogeneous Treatment Effects Estimation
Sparsity-aware deep learning inference runtime for CPUs
Large Language Model Text Generation Inference
PyTorch library of curated Transformer models and their components
MII makes low-latency and high-throughput inference possible
Tensor search for humans
A library for accelerating Transformer models on NVIDIA GPUs
Uncover insights, surface problems, monitor, and fine tune your LLM
AI interface for tinkerers (Ollama, Haystack RAG, Python)
PyTorch extensions for fast R&D prototyping and Kaggle farming
Superduper: Integrate AI models and machine learning workflows
Standardized Serverless ML Inference Platform on Kubernetes
A high-performance ML model serving framework, offers dynamic batching