Large Language Model Text Generation Inference
The free, Open Source alternative to OpenAI, Claude and others
Unofficial (Golang) Go bindings for the Hugging Face Inference API
Run local LLMs like llama, deepseek, kokoro etc. inside your browser
OpenAI swift async text to image for SwiftUI app using OpenAI
Tensor search for humans
A RWKV management and startup tool, full automation, only 8MB
Low-latency REST API for serving text-embeddings
Phi-3.5 for Mac: Locally-run Vision and Language Models
Efficient few-shot learning with Sentence Transformers
Private Open AI on Kubernetes
State-of-the-art diffusion models for image and audio generation
LLM training code for MosaicML foundation models
MII makes low-latency and high-throughput inference possible
Deep Learning API and Server in C++14 support for Caffe, PyTorch
A graphical manager for ollama that can manage your LLMs
Framework that is dedicated to making neural data processing
Framework for Accelerating LLM Generation with Multiple Decoding Heads
Implementation of "Tree of Thoughts
The deep learning toolkit for speech-to-text
Training & Implementation of chatbots leveraging GPT-like architecture
CPU/GPU inference server for Hugging Face transformer models