A framework to enable multimodal models to operate a computer
Run your own AI cluster at home with everyday devices
Python Crypto Bot (PyCryptoBot)
State-of-the-art TTS model under 25MB
An open phone agent model & framework
Python client for the Telegram's tdlib
A Telegram bot that integrates with OpenAI's official ChatGPT APIs
Datasets, transforms and models specific to Computer Vision
General proxy performance testing tool based on Clash using Telegram
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Enable AI to control your desktop, mobile and HMI devices
Powerful AI language model (MoE) optimized for efficiency/performance
Speech recognition module for Python
A natural language interface for computers
Open-source, high-performance AI model with advanced reasoning
Automate native Android apps with AI using accessibility APIs
On-device Speech-to-Intent engine powered by deep learning
RL research on Android devices
TensorFlow is an open source library for machine learning
MCP integration platforms for AI agents to use tools at any scale
The Pocket Datalab
Building a Secure and Interoperable Future for AI-Driven Payments
AI memory OS for LLM and Agent systems
Interact with your documents using the power of GPT
UI-TARS-desktop version that can operate on your local personal device