Automate native Android apps with AI using accessibility APIs
State-of-the-art TTS model under 25MB
Speech recognition module for Python
Reading book source
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Build Vision Agents quickly with any model or video provider
An open phone agent model & framework
An open sourced end-to-end VLM-based GUI Agent
Run GGUF models easily with a UI or API. One File. Zero Install.
Ainee - AI Notetaking and Learning Companion
OpenSourceTelegramRAT - Remote PC access via Telegram Bot.
Mycroft Core, the Mycroft Artificial Intelligence platform
IPTV/NVR/CCTV/Video cloud https://fastocloud.com