Large Language Model Text Generation Inference
Library for OCR-related tasks powered by Deep Learning
Ready-to-use OCR with 80+ supported languages
Tensor search for humans
Low-latency REST API for serving text-embeddings
Efficient few-shot learning with Sentence Transformers
Phi-3.5 for Mac: Locally-run Vision and Language Models
LLM training code for MosaicML foundation models
Easy-to-use Speech Toolkit including Self-Supervised Learning model
State-of-the-art diffusion models for image and audio generation
MII makes low-latency and high-throughput inference possible
Framework that is dedicated to making neural data processing
A graphical manager for ollama that can manage your LLMs
Framework for Accelerating LLM Generation with Multiple Decoding Heads
Implementation of "Tree of Thoughts
Training & Implementation of chatbots leveraging GPT-like architecture
CPU/GPU inference server for Hugging Face transformer models