Large Language Model Text Generation Inference
Ready-to-use OCR with 80+ supported languages
Efficient few-shot learning with Sentence Transformers
State-of-the-art diffusion models for image and audio generation
Library for OCR-related tasks powered by Deep Learning
Easy-to-use Speech Toolkit including Self-Supervised Learning model
LLM training code for MosaicML foundation models
Framework for Accelerating LLM Generation with Multiple Decoding Heads
Tensor search for humans
Phi-3.5 for Mac: Locally-run Vision and Language Models
Low-latency REST API for serving text-embeddings
Framework that is dedicated to making neural data processing
MII makes low-latency and high-throughput inference possible
Implementation of "Tree of Thoughts
A graphical manager for ollama that can manage your LLMs
Training & Implementation of chatbots leveraging GPT-like architecture
CPU/GPU inference server for Hugging Face transformer models