LLM training code for MosaicML foundation models
Library for serving Transformers models on Amazon SageMaker
Bring the notion of Model-as-a-Service to life
Trainable, memory-efficient, and GPU-friendly PyTorch reproduction
Replace OpenAI GPT with another LLM in your app
GPU environment management and cluster orchestration
Low-latency REST API for serving text-embeddings
Integrate, train and manage any AI models and APIs with your database
A graphical manager for ollama that can manage your LLMs
Run 100B+ language models at home, BitTorrent-style
Implementation of "Tree of Thoughts
Toolbox of models, callbacks, and datasets for AI/ML researchers
Lightweight anchor-free object detection model
Toolkit for allowing inference and serving with MXNet in SageMaker
Deploy a ML inference service on a budget in 10 lines of code