Trainable models and NN optimization tools
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
Library for serving Transformers models on Amazon SageMaker
Single-cell analysis in Python
A unified framework for scalable computing
Large Language Model Text Generation Inference
Data manipulation and transformation for audio signal processing
PyTorch library of curated Transformer models and their components
Uplift modeling and causal inference with machine learning algorithms
Superduper: Integrate AI models and machine learning workflows
Trainable, memory-efficient, and GPU-friendly PyTorch reproduction
Integrate, train and manage any AI models and APIs with your database
Efficient few-shot learning with Sentence Transformers
A Pythonic framework to simplify AI service building
DoWhy is a Python library for causal inference
A high-performance ML model serving framework, offers dynamic batching
LLM training code for MosaicML foundation models
PyTorch extensions for fast R&D prototyping and Kaggle farming
Deep learning optimization library: makes distributed training easy
Libraries for applying sparsification recipes to neural networks
Gaussian processes in TensorFlow
Neural Network Compression Framework for enhanced OpenVINO
Openai style api for open large language models
Sparsity-aware deep learning inference runtime for CPUs
Pytorch domain library for recommendation systems