ocr application python free download

AI Runner

Offline inference engine for art, real-time voice conversations

AI Runner is an offline inference engine designed to run a collection of AI workloads on your own machine, including image generation for art, real-time voice conversations, LLM-powered chatbots and automated workflows. It is implemented as a desktop-oriented Python application and emphasizes privacy and self-hosting, allowing users to work with text-to-speech, speech-to-text, text-to-image and multimodal models without sending data to external services. At the core of its LLM stack is a mode-based architecture with specialized “modes” such as Author, Code, Research, QA and General, and a workflow manager that automatically routes user requests to the right agent based on the task. ...

Downloads: 2 This Week

Last Update: 2025-12-11

See Project

FastRTC

The python library for real-time communication

FastRTC is a Python library designed to simplify real-time communication (RTC), especially for audio and video streaming applications. It abstracts away much of the complexity that typically comes with implementing WebRTC by providing a simple interface — e.g. a Stream class — that can be mounted within a web backend (for example a FastAPI application).

Downloads: 0 This Week

Last Update: 2025-11-28

See Project

OuteTTS

Interface for OuteTTS models

OuteTTS is an interface library for running OuteTTS text-to-speech models across a range of backends, making it easier to deploy the same model on different hardware and runtimes. It provides a high-level Interface API that wraps model configuration, speaker handling, and audio generation so you can focus on integrating speech into your application rather than wiring up low-level engines. The project supports multiple backends including llama.cpp (Python bindings and server), Hugging Face Transformers, ExLlamaV2, VLLM and a JavaScript interface via Transformers.js, allowing it to run on CPUs, NVIDIA CUDA GPUs, AMD ROCm, Vulkan-capable GPUs, and Apple Metal. It also includes a notion of speaker profiles: you can create a speaker from a short audio sample, save it as JSON, and reuse it for consistent voice identity across generations and sessions. ...

Downloads: 0 This Week

Last Update: 2025-11-28

See Project

AarTon

AarTon is an automated text-to-speech application. It allows user to enter text in a web-based front-end and render these texts via a multi-channel sound card.

Downloads: 0 This Week

Last Update: 2013-11-14

See Project

voicecommand

Run Bash commands using Google voice recognition

This simple pygtk application uses ffmpeg and arecord; to record sound Google's unofficial text to speech service; to convert sound to text The python subprocess module to run the text as a shell command. The text to speech service used by this application is unofficial, and this program should therefore be considered a complete hack.

Downloads: 0 This Week

Last Update: 2013-05-30

See Project

Search Results for "ocr application python"

Showing 5 open source projects for "ocr application python"

AI Runner

FastRTC

OuteTTS

AarTon

voicecommand

Search Results for "ocr application python"

Showing 5 open source projects for "ocr application python"

AI Runner

FastRTC

OuteTTS

AarTon

voicecommand

Related Searches

Related Categories