Training and deploying machine learning models on Amazon SageMaker
Run Local LLMs on Any Device. Open-source
Port of Facebook's LLaMA model in C/C++
Single-cell analysis in Python
Everything you need to build state-of-the-art foundation models
Gaussian processes in TensorFlow
The official Python client for the Huggingface Hub
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
A unified framework for scalable computing
A high-throughput and memory-efficient inference and serving engine
DoWhy is a Python library for causal inference
AI interface for tinkerers (Ollama, Haystack RAG, Python)
Python Package for ML-Based Heterogeneous Treatment Effects Estimation
Adversarial Robustness Toolbox (ART) - Python Library for ML security
Operating LLMs in production
Optimizing inference proxy for LLMs
Uplift modeling and causal inference with machine learning algorithms
Uncover insights, surface problems, monitor, and fine tune your LLM
LMDeploy is a toolkit for compressing, deploying, and serving LLMs
A Pythonic framework to simplify AI service building
Integrate, train and manage any AI models and APIs with your database
Pytorch domain library for recommendation systems
Bring the notion of Model-as-a-Service to life
Lightweight Python library for adding real-time multi-object tracking
MII makes low-latency and high-throughput inference possible