Fast stable diffusion on CPU and AI PC
Fast inference engine for Transformer models
A system monitoring tool that exposes system metrics
A TTS that fits in your CPU (and pocket)
State-of-the-art TTS model under 25MB
Fast and accurate AI powered file content types detection
Port of OpenAI's Whisper model in C/C++
A high-quality rapid TTS voice cloning model
Faster Whisper transcription with CTranslate2
High-speed Large Language Model Serving for Local Deployment
Ultra-Efficient AI Assistant in Go
Real-time NVIDIA GPU dashboard
Python-free Rust inference server
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
Running large language models on a single GPU
A lightweight text-to-speech model with zero-shot voice cloning
Multilingual Automatic Speech Recognition with word-level timestamps
Generate audiobooks from e-books
Easy-to-use deep learning framework with 3 key features
AI-powered PC monitoring that explains. Not shows numbers/spikes.
Calculate token/s & GPU memory requirement for any LLM
UME is an in-app debug kits platform for Flutter
Fast and user-friendly runtime for transformer inference
PyTorch tutorials and fun projects including neural talk