speech to text in java free download

Handy STT

A free, open source, and extensible speech-to-text application

Handy is a free, open-source, offline speech-to-text application built for privacy, accessibility, and extensibility. Developed using Tauri (Rust + React/TypeScript), it runs natively across Windows, macOS, and Linux while performing local speech recognition without sending any audio to cloud servers. Handy allows users to start transcription instantly using a configurable keyboard shortcut—press to record, release to transcribe—and automatically pastes the resulting text into any active text field. ...

Downloads: 33 This Week

Last Update: 1 day ago

See Project

Polyglot

Cross-platform AI language practice app

Polyglot is a cross platform AI language practice application that runs as a desktop app and also offers a web version. It is built around conversational large language models and Azure based text to speech services, turning them into an interactive environment for speaking practice in multiple languages. Users can define custom AI personas, choose languages, and configure their own OpenAI and Azure keys so they retain control over which backends they use. The app supports speech recognition with quick keyboard shortcuts, allowing learners to hold down a key to speak and release it to submit for recognition and response. ...

Downloads: 3 This Week

Last Update: 2 days ago

See Project

EasyVoice

Open source text-to-speech tool, supports extra-long text

easyVoice is an open-source text-to-speech platform aimed at turning long-form text and novels into high-quality audio, with a strong focus on usability and scalability. It provides a web interface where users can paste or upload large texts and generate speech and subtitles in a single workflow, even for works exceeding 100,000 characters. The system supports multi-role voice acting, letting users assign different neural voices to different characters or narrative roles and configure parameters such as rate, pitch, and volume per role. ...

Downloads: 1 This Week

Last Update: 2 days ago

See Project

BrowserAI

Run local LLMs like llama, deepseek, kokoro etc. inside your browser

...The platform provides a developer-friendly SDK with pre-configured popular models, and it allows for seamless switching between MLC and Transformer engines. Additionally, it supports features such as speech recognition, text-to-speech, structured output generation, and Web Worker support for non-blocking UI performance.

Downloads: 1 This Week

Last Update: 2025-05-21

See Project

TTS WebUI

A single Gradio + React WebUI with extensions for ACE-Step

TTS-WebUI is a unified Gradio + React web interface that brings together a large ecosystem of text-to-speech, voice conversion, and audio generation models under a single UI. It supports a wide range of models such as Bark, MusicGen, Tortoise, RVC, StyleTTS2, ParlerTTS, CosyVoice, XTTSv2, Stable Audio, SeamlessM4T, and many others, exposing them as interchangeable backends for speech and music synthesis. The project provides an installer that sets up Conda, Python environments, and all necessary dependencies, so users can focus on experimenting with voices instead of managing tooling. ...

Downloads: 3 This Week

Last Update: 2 days ago

See Project

Readest

Readest is a modern, feature-rich ebook reader

Readest is a project meant to facilitate reading, studying, or consuming content by integrating reading tools with AI-powered assistance. Although the repository is not as widely documented or popular as some, the idea is that Readest supports features to help with reading comprehension — likely combining OCR / text retrieval, translation, note-taking, or summarization for reading materials (eBooks, articles, PDFs). The goal appears to be to let users feed in arbitrary reading material and...

Downloads: 3 This Week

Last Update: 2 days ago

See Project

Amica

Amica is an open source interface for interactive communication

...Under the hood, Amica leverages modern web and desktop technologies: three.js and three-vrm for 3D rendering, Transformers.js for running models in the browser, Whisper and Silero VAD for speech recognition and voice-activity detection, and a variety of LLM backends such as llama.cpp servers, ChatGPT-compatible APIs, Ollama, KoboldCpp, and others. It also integrates multiple text-to-speech providers, including ElevenLabs, OpenAI, Coqui, RVC, and AllTalkTTS.

Downloads: 0 This Week

Last Update: 15 hours ago

See Project

Ito

Ito, smart dictation in every application

ito is an open‑source JavaScript library for serverless, browser‑to‑browser communication designed for use on devices with or without user input interfaces, such as IoT devices, mobile devices, tablets, and desktops, enabling peer messaging and data sharing via short passcodes and cloud‑backed pairing without an application server.

Downloads: 0 This Week

Last Update: 6 days ago

See Project

Mastra

The TypeScript AI agent framework

Mastra is a TypeScript-first framework for building AI-powered applications and agents, designed to take projects from prototype to production on a modern JavaScript/TypeScript stack. It integrates cleanly with React, Next.js, and Node-based backends, but can also run as a standalone server, giving teams flexibility in how they deploy their AI logic. At its core, Mastra provides abstractions for agents, workflows, tools, memory, retrieval, and model routing, so developers can focus on...

Downloads: 1 This Week

Last Update: 2 days ago

See Project

OpenAI Translator

Browser extension and cross-platform desktop app based on ChatGPT API

...Our tool allows for mutual translation, polishing and summarization across 55 different languages. Streaming mode is supported! It allows users to customize their translation text. One-click copying, Text-to-Speech (TTS). Available on all platforms (Windows, macOS, and Linux) for both browsers and Desktop.

Downloads: 9 This Week

Last Update: 4 days ago

See Project

Brokk

Brokk brings code intelligence to AI

Brokk is a code intelligence assistant framework designed to let large language models (LLMs) understand code semantically (not just as raw text) so that they can work effectively on large codebases that don’t fit wholly in a prompt context. It helps bridge the gap between LLMs and real-world engineering code by offering tooling to index, analyze, query, and augment code context, so that AI can meaningfully reason about existing code, suggest edits, and navigate across projects. Modular...

Downloads: 5 This Week

Last Update: 2025-11-23

See Project

TTS-Vue

Microsoft speech synthesis tool, built with Electron

TTS-Vue is a desktop text-to-speech application built with Electron, Vue, ElementPlus, and Vite, focused on using Microsoft’s official Speech API for high-quality neural synthesis. It wraps the Microsoft TTS WebSocket interface in a clean UI so users can paste or load text, choose voices, tweak parameters, and export audio without touching raw API calls. The app supports SSML (Speech Synthesis Markup Language), letting power users specify fine-grained control over pronunciation, pauses, prosody, and emphasis using XML-like markup. ...

Downloads: 2 This Week

Last Update: 2 days ago

See Project

Search Results for "speech to text in java"

Showing 12 open source projects for "speech to text in java"

Handy STT

Polyglot

EasyVoice

BrowserAI

TTS WebUI

Readest

Amica

Ito

Mastra

OpenAI Translator

Brokk

TTS-Vue

Search Results for "speech to text in java"

Showing 12 open source projects for "speech to text in java"

Handy STT

Polyglot

EasyVoice

BrowserAI

TTS WebUI

Readest

Amica

Ito

Mastra

OpenAI Translator

Brokk

TTS-Vue

Related Searches

Related Categories