Port of OpenAI's Whisper model in C/C++
Deep Learning API and Server in C++14 support for Caffe, PyTorch
Run local LLMs like llama, deepseek, kokoro etc. inside your browser
A Pythonic framework to simplify AI service building
Library for serving Transformers models on Amazon SageMaker
Bring the notion of Model-as-a-Service to life
Easiest and laziest way for building multi-agent LLMs applications
Open-Source AI Camera. Empower any camera/CCTV
State-of-the-art diffusion models for image and audio generation
LLM training code for MosaicML foundation models
A general-purpose probabilistic programming system
A high-performance ML model serving framework, offers dynamic batching
Easy-to-use Speech Toolkit including Self-Supervised Learning model
GPU environment management and cluster orchestration
Protect and discover secrets using Gitleaks
Large Language Model Text Generation Inference
A unified framework for scalable computing
Fast inference engine for Transformer models
Openai style api for open large language models
Powering Amazon custom machine learning chips
Open-source tool designed to enhance the efficiency of workloads
A library for accelerating Transformer models on NVIDIA GPUs
Lightweight Python library for adding real-time multi-object tracking
PyTorch library of curated Transformer models and their components
Low-latency REST API for serving text-embeddings