Neural Network Compression Framework for enhanced OpenVINO
Openai style api for open large language models
Sparsity-aware deep learning inference runtime for CPUs
Large Language Model Text Generation Inference
Easiest and laziest way for building multi-agent LLMs applications
Efficient few-shot learning with Sentence Transformers
Official inference library for Mistral models
A Pythonic framework to simplify AI service building
Adversarial Robustness Toolbox (ART) - Python Library for ML security
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT method
Deep learning optimization library: makes distributed training easy
Uplift modeling and causal inference with machine learning algorithms
DoWhy is a Python library for causal inference
Pytorch domain library for recommendation systems
Simplifies the local serving of AI models from any source
Python Package for ML-Based Heterogeneous Treatment Effects Estimation
A high-performance ML model serving framework, offers dynamic batching
Unified Model Serving Framework
Low-latency REST API for serving text-embeddings
Trainable models and NN optimization tools
Probabilistic reasoning and statistical analysis in TensorFlow
Integrate, train and manage any AI models and APIs with your database
State-of-the-art diffusion models for image and audio generation
An MLOps framework to package, deploy, monitor and manage models
PyTorch extensions for fast R&D prototyping and Kaggle farming