Run Local LLMs on Any Device. Open-source
Ready-to-use OCR with 80+ supported languages
A Pythonic framework to simplify AI service building
State-of-the-art diffusion models for image and audio generation
LLM training code for MosaicML foundation models
Bring the notion of Model-as-a-Service to life
Library for OCR-related tasks powered by Deep Learning
A library for accelerating Transformer models on NVIDIA GPUs
Replace OpenAI GPT with another LLM in your app
A unified framework for scalable computing
Powering Amazon custom machine learning chips
Unified Model Serving Framework
Trainable, memory-efficient, and GPU-friendly PyTorch reproduction
A high-performance ML model serving framework, offers dynamic batching
Easy-to-use Speech Toolkit including Self-Supervised Learning model
Library for serving Transformers models on Amazon SageMaker
The Triton Inference Server provides an optimized cloud
Lightweight Python library for adding real-time multi-object tracking
GPU environment management and cluster orchestration
A graphical manager for ollama that can manage your LLMs
PyTorch library of curated Transformer models and their components
The unofficial python package that returns response of Google Bard
Open platform for training, serving, and evaluating language models
High quality, fast, modular reference implementation of SSD in PyTorch
Database system for building simpler and faster AI-powered application