Showing 37 open source projects for "android ocr apps"

View related business solutions
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    Android Use

    Android Use

    Automate native Android apps with AI using accessibility APIs

    android-action-kernel is an open source Python library designed to let AI agents control and automate native Android applications running on real devices or emulators. It fills a gap in automation tooling by focusing on mobile-first workflows where traditional browser or desktop-based automation doesn’t work; such as logistics, gig work, field operations, and other industries reliant on phones or tablets. The project works by using Android’s accessibility API to extract structured UI state...
    Downloads: 12 This Week
    Last Update:
    See Project
  • 2
    AndroidEnv

    AndroidEnv

    RL research on Android devices

    android_env is a reinforcement learning (RL) environment developed by Google DeepMind that enables agents to interact with Android applications directly as a learning environment. It provides a standardized API for training agents to perform tasks on Android apps, supporting tasks ranging from games to productivity apps, making it suitable for research in real-world RL settings.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Fastbot-Android Open Source Handbook

    Fastbot-Android Open Source Handbook

    Testing tool for modeling GUI transitions

    Fastbot_Android (Fastbot 2.0) is a model-based automated testing tool by ByteDance designed to discover stability or usability issues in Android apps by modeling GUI transitions rather than relying purely on random interactions. It blends machine learning and reinforcement-learning approaches to build a transition graph of UI states and use that model to intelligently explore possible user interactions — aiming to replicate more human-like usage patterns and uncover hidden bugs, crashes, or edge cases. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Chatbox

    Chatbox

    The Ultimate AI Copilot on Your Desktop

    Chatbox is a cross-platform desktop AI client designed to give you a fast, polished, and private way to work with modern language models. It runs locally on Windows, macOS, and Linux, keeping your conversations and data stored on your own device. Chatbox acts as a unified interface for popular LLMs like ChatGPT, Claude, Gemini, and local models via Ollama, making it easy to switch providers without changing tools. Built with an ergonomic UI, it’s optimized for long sessions, prompt...
    Downloads: 41 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    shadowhook

    shadowhook

    Android inline hook library which supports thumb, arm32 and arm64

    shadowhook is an open-source native code hooking library for Android — designed to let developers intercept and override native (C/C++) functions inside Android apps at runtime. It supports both ARM32 and ARM64 architectures (and the respective “thumb” instruction set) and works across a wide range of Android OS versions. The library allows you to specify hook targets either by function address or by library name + function name, and it automatically handles newly loaded shared libraries (ELFs), ensuring hooks remain effective even when code is dynamically loaded at runtime. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 6
    Open-AutoGLM

    Open-AutoGLM

    An open phone agent model & framework

    Open-AutoGLM is an open-source framework and model designed to empower autonomous mobile intelligent assistants by enabling AI agents to understand and interact with phone screens in a multimodal manner, blending vision and language capability to control real devices. It aims to create an “AI phone agent” that can perceive on-screen content, reason about user goals, and execute sequences of taps, swipes, and text input via automated device control interfaces like ADB, enabling hands-off...
    Downloads: 13 This Week
    Last Update:
    See Project
  • 7
    OpenBot

    OpenBot

    OpenBot leverages smartphones as brains for low-cost robots

    OpenBot leverages smartphones as brains for low-cost robots. We have designed a small electric vehicle that costs about $50 and serves as a robot body. Our software stack for Android smartphones supports advanced robotics workloads such as person following and real-time autonomous navigation. Current robots are either expensive or make significant compromises on sensory richness, computational power, and communication capabilities. We propose to leverage smartphones to equip robots with...
    Downloads: 15 This Week
    Last Update:
    See Project
  • 8
    Google AI Edge Gallery

    Google AI Edge Gallery

    A gallery that showcases on-device ML/GenAI use cases

    Gallery is a curated collection of on-device machine learning examples, demo apps, and model artifacts designed to help developers experiment with and deploy ML at the edge. The project bundles runnable samples that show how to run TensorFlow Lite/Edge TPU models (and similar lightweight runtimes) on mobile and embedded platforms, demonstrating common tasks like image classification, object detection, audio recognition, and pose estimation. Each sample is intended to be both a learning aid...
    Downloads: 173 This Week
    Last Update:
    See Project
  • 9
    CodeLocator

    CodeLocator

    Android toolset that includes the Android SDK and Android Studio

    CodeLocator is a developer tool designed to help analyze, navigate, and understand large Android codebases — especially useful when working with obfuscated code, many dependencies, or when you don’t have full context (e.g. in large apps or reverse-engineering contexts). It provides capabilities to quickly locate references, method calls, resource usages, and dependency paths across the project, which can dramatically speed up understanding code flow, tracing bugs, or auditing code. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • 10
    MiniCPM-o

    MiniCPM-o

    A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming

    MiniCPM-o 2.6 is a cutting-edge multimodal large language model (MLLM) designed for high-performance tasks across vision, speech, and video. Capable of running on end-side devices such as smartphones and tablets, it provides powerful features like real-time speech conversation, video understanding, and multimodal live streaming. With 8 billion parameters, MiniCPM-o 2.6 surpasses its predecessors in versatility and efficiency, making it one of the most robust models available. It supports...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    MNN

    MNN

    MNN is a blazing fast, lightweight deep learning framework

    ...MNN Workbench could be downloaded from MNN's homepage, which provides pretrained models, visualized training tools, and one-click deployment of models to devices. Android platform, core so size is about 400KB, OpenCL so is about 400KB, Vulkan so is about 400KB. Supports hybrid computing on multiple devices. Currently supports CPU and GPU.
    Downloads: 13 This Week
    Last Update:
    See Project
  • 12
    ncnn

    ncnn

    High-performance neural network inference framework for mobile

    ncnn is a high-performance neural network inference computing framework designed specifically for mobile platforms. It brings artificial intelligence right at your fingertips with no third-party dependencies, and speeds faster than all other known open source frameworks for mobile phone cpu. ncnn allows developers to easily deploy deep learning algorithm models to the mobile platform and create intelligent APPs. It is cross-platform and supports most commonly used CNN networks, including...
    Downloads: 57 This Week
    Last Update:
    See Project
  • 13
    AiToEarn

    AiToEarn

    Let's use AI to Earn

    AiToEarn is an open-source, AI-powered platform designed to help creators, brands, and businesses automate the entire content marketing lifecycle, from ideation and production to distribution and monetization. It aims to be a unified solution where users can generate content, tailor it for multiple platforms, and publish it across social networks with minimal manual effort. The project supports matrix publishing to major global platforms like TikTok, YouTube, Instagram, Facebook, Pinterest,...
    Downloads: 14 This Week
    Last Update:
    See Project
  • 14
    Omi

    Omi

    AI that sees your screen and listens to conversations

    The Omi project is an open-source AI wearable ecosystem developed by Based Hardware that combines hardware, software, and cloud infrastructure to create a persistent “second brain” for capturing and processing real-world interactions. It is designed as a system that continuously listens to conversations and monitors screen activity, converting this input into structured data such as transcripts, summaries, and actionable insights in real time. The platform operates across multiple...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 15
    CamScanner

    CamScanner

    Scanner

    OCR Scanner
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Amical

    Amical

    Open Source AI Dictation App

    Amical is an open source, AI-powered desktop dictation and note-taking application that enables users to dictate hands-free, transcribe meetings, and capture notes effortlessly with unmatched speed, accuracy, and privacy. It leverages both local and cloud-based AI models, letting users seamlessly switch between providers for the ideal balance of speed, precision, and control, and understands the context of each app in use to automatically format text in a tone and style appropriate to the...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    bitfarm-Archiv Document Management - DMS
    bitfarm-Archiv is a powerful Document Management (DMS), Enterprise Content Management (ECM) and Knowledge Management System (KMS) with Workflow Components. Help us! As we live in the internet age, the best thing, you can help, is to write a short statement about your scenario and your use of the DMS, along with your experiences and put it on your own website or in a blog or forum. It would help us best, if you can also add a hyperlink to our site http://www.bitfarm-archiv.com. By this...
    Downloads: 12 This Week
    Last Update:
    See Project
  • 18

    uweb browser: unlimited power

    minimal suckless android web browser with unlimited power

    - AI bot as search engine; append file content as input for complex query. - Powerful: html5 enhancement; any urls to host a website; javascript and shell scripting for general processing; and more with Termux. - Customizable: user-defined menus, (new) buttons and gestures for user agents, bookmarklets, url services, shell commands, internal functionality links and text processing etc. - Convenient: book/dictionary/txt/command line/app can be search engine. - Tiny: less than 200k -...
    Downloads: 11 This Week
    Last Update:
    See Project
  • 19
    Poker Bot AI

    Poker Bot AI

    Artificial Intelligence Poker Bot for popular apps on Android

    Official site: https://pokerbotai.com/ Poker bot guides: https://pokerbotai.com/knowledge-hub/ Poker Bot AI+ is advanced poker bot software designed for research, strategy training, and decision simulations. The bot continuously analyzes table state in real time, acting as both an AI poker assistant and a full autopilot poker bot depending on your configuration. You can receive live hints from the AI (like RTA‑style advice) or choose automated play under your defined...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    OCR Manga Reader for Android

    OCR Manga Reader for Android

    Android Manga reader with Japanese OCR and dictionary capabilities

    OCR Manga Reader is a free and open source Android app that allows you to quickly OCR and lookup Japanese words in real-time. It does not have ads or telemetry/spyware and does not require an Internet connection. Supports both EDICT and EPWING dictionaries. Requires Android 4.0 (Ice Cream Sandwich) or higher. See http://ocrmangareaderforandroid.sourceforge.net/ for details.
    Leader badge
    Downloads: 29 This Week
    Last Update:
    See Project
  • 21
    pipeless

    pipeless

    A computer vision framework to create and deploy apps in minutes

    Pipeless is an open-source computer vision framework to create and deploy applications without the complexity of building and maintaining multimedia pipelines. It ships everything you need to create and deploy efficient computer vision applications that work in real-time in just minutes. Pipeless is inspired by modern serverless technologies. It provides the development experience of serverless frameworks applied to computer vision. You provide some functions that are executed for new...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    TTS Server

    TTS Server

    Android system TTS application with Microsoft demo interface

    tts-server-android is an Android system TTS application that acts both as a powerful local text-to-speech engine and as a flexible TTS “server” for other apps via HTTP. It includes a built-in Microsoft TTS demo interface and lets users configure custom HTTP requests, making it possible to route TTS through various cloud providers or local servers.
    Downloads: 31 This Week
    Last Update:
    See Project
  • 23
    wechat-chatgpt

    wechat-chatgpt

    Use ChatGPT On Wechat via wechaty

    Use ChatGPT On Wechat via wechaty Interact with WeChat and ChatGPT: Use ChatGPT on WeChat with wechaty and Official API Add conversation support Support command setting Deployment and configuration options: Add Dockerfile, deployable with docker Support deployment using docker compose Support Railway and Fly.io deployment Other features: Support Dall·E Support whisper Support setting prompt Support proxy (in development)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Alpaca.cpp

    Alpaca.cpp

    Locally run an Instruction-Tuned Chat-Style LLM

    Run a fast ChatGPT-like model locally on your device. This combines the LLaMA foundation model with an open reproduction of Stanford Alpaca a fine-tuning of the base model to obey instructions (akin to the RLHF used to train ChatGPT) and a set of modifications to llama.cpp to add a chat interface. Download the zip file corresponding to your operating system from the latest release. The weights are based on the published fine-tunes from alpaca-lora, converted back into a PyTorch checkpoint...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 25
    tgcf

    tgcf

    The ultimate tool to automate custom telegram message forwarding

    The ultimate tool to automate custom telegram message forwarding. Live-syncer, Auto-poster, backup-bot, cloner, chat-forwarder, duplicator, ... Call it whatever you like! tgcf is an advanced telegram chat forwarding automation tool that can fulfill all your custom needs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next