Training and deploying machine learning models on Amazon SageMaker
Run Local LLMs on Any Device. Open-source
Port of Facebook's LLaMA model in C/C++
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
A high-throughput and memory-efficient inference and serving engine
The official Python client for the Huggingface Hub
Open-Source AI Camera. Empower any camera/CCTV
A set of Docker images for training and serving models in TensorFlow
Trainable, memory-efficient, and GPU-friendly PyTorch reproduction
Everything you need to build state-of-the-art foundation models
State-of-the-art diffusion models for image and audio generation
Create HTML profiling reports from pandas DataFrame objects
Optimizing inference proxy for LLMs
A Pythonic framework to simplify AI service building
20+ high-performance LLMs with recipes to pretrain, finetune at scale
Large Language Model Text Generation Inference
Data manipulation and transformation for audio signal processing
Easy-to-use deep learning framework with 3 key features
Deep learning optimization library: makes distributed training easy
Fast inference engine for Transformer models
A general-purpose probabilistic programming system
Powering Amazon custom machine learning chips
An easy-to-use LLMs quantization package with user-friendly apis
A unified framework for scalable computing
Operating LLMs in production