Fast stable diffusion on CPU and AI PC
Fast inference engine for Transformer models
A system monitoring tool that exposes system metrics
A TTS that fits in your CPU (and pocket)
State-of-the-art TTS model under 25MB
Fast and accurate AI powered file content types detection
LLM inference in C/C++
Port of OpenAI's Whisper model in C/C++
A high-quality rapid TTS voice cloning model
Ultra-Efficient AI Assistant in Go
High-speed Large Language Model Serving for Local Deployment
Real-time NVIDIA GPU dashboard
Python-free Rust inference server
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
Running large language models on a single GPU
Multilingual Automatic Speech Recognition with word-level timestamps
A lightweight text-to-speech model with zero-shot voice cloning
Generate audiobooks from e-books
Automatic AI-powered timeline of your daily work activity logs
Easy-to-use deep learning framework with 3 key features
Open platform for training, serving, and evaluating language models
Calculate token/s & GPU memory requirement for any LLM
UME is an in-app debug kits platform for Flutter
PyTorch tutorials and fun projects including neural talk