Here comes a selection of technology stacks and tool repositories
Open Source OCR Engine
Local-first AI Notepad for Private Meetings
Port of OpenAI's Whisper model in C/C++
Build your own AI friend
Speech-to-text, text-to-speech, and speaker recognition
Port of Facebook's LLaMA model in C/C++
Run Local LLMs on Any Device. Open-source
Audio Plugin for Audio to MIDI transcription using deep learning
Distribute and run LLMs with a single file
Awesome multilingual OCR toolkits based on PaddlePaddle
Structure-from-Motion and Multi-View Stereo
High-performance neural network inference framework for mobile
Stable Diffusion web UI
LLM inference in C/C++
Diffusion model(SD,Flux,Wan,Qwen Image,Z-Image,...) inference
ONNX Runtime: cross-platform, high performance ML inferencing
Machine Learning
Agentic browser; privacy-first alternative to ChatGPT Atlas
The most powerful local music generation model
Open Source Computer Vision Library
OCR offline image text recognition command line windows program
AlphaFold 3 inference pipeline
Google Testing and Mocking Framework
Speech Note Linux app. Note taking, reading and translating