The AI-powered coding wizard
Talk to Your AI Agents from Anywhere
On-device Speech-to-Intent engine powered by deep learning
UI-TARS-desktop version that can operate on your local personal device
A python tool that uses GPT-4, FFmpeg, and OpenCV
Implementation of "MobileCLIP" CVPR 2024
Python Crypto Bot (PyCryptoBot)
Multimodal Agents as Smartphone Users, an LLM-based multimodal agent
Adds support for Yandex Smart Home (Alice voice assistant)
Real-World Centric Foundation GUI Agents
Diffusion model(SD,Flux,Wan,Qwen Image,Z-Image,...) inference
A toolkit to optimize ML models for deployment for Keras & TensorFlow
A neural network that transforms a design mock-up into static websites
InvokeAI is a leading creative engine for Stable Diffusion models
Data science on data without acquiring a copy
Build Vision Agents quickly with any model or video provider
The most powerful Android RPA agent framework
Ultra-Efficient LLMs on End Device
Overcoming Group Chat Scenarios with LLM-based Technical Assistance
Framework for building neural networks
Framework for building AI-powered interactive digital humans and agent
Open Source Computer Vision Library
Python 3 package for easy bypass reCAPTCHA/reCAPTCHA Mobile/hCaptcha
Run GGUF models easily with a UI or API. One File. Zero Install.
minimal suckless android web browser with unlimited power