Port of OpenAI's Whisper model in C/C++
Run Local LLMs on Any Device. Open-source
Training and deploying machine learning models on Amazon SageMaker
Port of Facebook's LLaMA model in C/C++
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
A high-throughput and memory-efficient inference and serving engine
The official Python client for the Huggingface Hub
Open-Source AI Camera. Empower any camera/CCTV
A set of Docker images for training and serving models in TensorFlow
Lightweight inference library for ONNX files, written in C++
LMDeploy is a toolkit for compressing, deploying, and serving LLMs
Bring the notion of Model-as-a-Service to life
Everything you need to build state-of-the-art foundation models
State-of-the-art diffusion models for image and audio generation
Create HTML profiling reports from pandas DataFrame objects
A Pythonic framework to simplify AI service building
Optimizing inference proxy for LLMs
20+ high-performance LLMs with recipes to pretrain, finetune at scale
Large Language Model Text Generation Inference
Data manipulation and transformation for audio signal processing
Easy-to-use deep learning framework with 3 key features
Deep learning optimization library: makes distributed training easy
Fast inference engine for Transformer models
Operating LLMs in production
A unified framework for scalable computing