Run any Llama 2 locally with gradio UI on GPU or CPU from anywhere
Framework for Accelerating LLM Generation with Multiple Decoding Heads
Run 100B+ language models at home, BitTorrent-style
Implementation of "Tree of Thoughts
Toolbox of models, callbacks, and datasets for AI/ML researchers
High-level Deep Learning Framework written in Kotlin
llama.go is like llama.cpp in pure Golang
Lightweight anchor-free object detection model
Implementation of model parallel autoregressive transformers on GPUs
LLM Chatbot Assistant for Openfire server
A graphical manager for ollama that can manage your LLMs
Sequence-to-sequence framework, focused on Neural Machine Translation
A real time inference engine for temporal logical specifications
Open Source and Lightweight Local LLM Platform
OpenFieldAI is an AI based Open Field Test Rodent Tracker
A computer vision framework to create and deploy apps in minutes
OpenMMLab Video Perception Toolbox
The deep learning toolkit for speech-to-text
Training & Implementation of chatbots leveraging GPT-like architecture
Guide to deploying deep-learning inference networks
Toolkit for allowing inference and serving with MXNet in SageMaker
CPU/GPU inference server for Hugging Face transformer models
Deep learning inference framework optimized for mobile platforms
Uniform deep learning inference framework for mobile
Deploy a ML inference service on a budget in 10 lines of code