Standardized Serverless ML Inference Platform on Kubernetes
Sparsity-aware deep learning inference runtime for CPUs
Probabilistic reasoning and statistical analysis in TensorFlow
State-of-the-art diffusion models for image and audio generation
Uncover insights, surface problems, monitor, and fine tune your LLM
Openai style api for open large language models
Deep learning optimization library: makes distributed training easy
OpenMLDB is an open-source machine learning database
A Unified Library for Parameter-Efficient Learning
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT method
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
A lightweight vision library for performing large object detection
Open-Source AI Camera. Empower any camera/CCTV
Fast inference engine for Transformer models
A toolkit to optimize ML models for deployment for Keras & TensorFlow
Unified Model Serving Framework
Multi-Modal Neural Networks for Semantic Search, based on Mid-Fusion
Tensor search for humans
Powering Amazon custom machine learning chips
A GPU-accelerated library containing highly optimized building blocks
GPU environment management and cluster orchestration
Easy-to-use deep learning framework with 3 key features
PyTorch library of curated Transformer models and their components
A graphical manager for ollama that can manage your LLMs
A library to communicate with ChatGPT, Claude, Copilot, Gemini