Large Language Model Text Generation Inference
Easy-to-use Speech Toolkit including Self-Supervised Learning model
Efficient few-shot learning with Sentence Transformers
State-of-the-art diffusion models for image and audio generation
Phi-3.5 for Mac: Locally-run Vision and Language Models
Framework that is dedicated to making neural data processing
MII makes low-latency and high-throughput inference possible
LLM training code for MosaicML foundation models
Tensor search for humans
Low-latency REST API for serving text-embeddings
Framework for Accelerating LLM Generation with Multiple Decoding Heads
Implementation of "Tree of Thoughts
Training & Implementation of chatbots leveraging GPT-like architecture
CPU/GPU inference server for Hugging Face transformer models