voice browser free download

Showing 54 open source projects for "voice browser"

View related business solutions

Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
$300 Free Credits to Build on Google Cloud
New to Google Cloud? Get $300 in credits to explore Compute Engine, BigQuery, Cloud Run, Gemini Enterprise Agent Platform, and more.

Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query petabytes in BigQuery, or build agents with Gemini Enterprise Agent Platform. Once your credits are used, keep building with 20+ always-free tier products including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. No commitment required—just sign up and start building.

Claim $300 Free
1

clone-voice

A sound cloning tool with a web interface, using your voice

Clone-voice is a local voice-cloning tool that lets you synthesize speech in any target voice or convert one recording into another voice using the same timbre. It is built around Coqui’s XTTS-v2 model, so it inherits multilingual support and modern neural TTS quality while wrapping it in a user-friendly desktop workflow. The app is designed to be very easy to use: you download a precompiled package, double-click app.exe, and it launches a browser-based web interface where you control cloning and synthesis. ...

Downloads: 38 This Week

Last Update: 2025-11-28
See Project
2

TTS Voice Wizard

Speech to Text to Speech, sends text as OSC messages

...The app can translate your speech from one language to over 20 other support languages. There are 100+ different voices with various customization options so you can pick a voice that best suits you. Display the current song you are listening to on Spotify or via your browser. Display tracker and controller battery life in conjunction with XSOverlay. Use in conjunction with HRtoVRChat_OSC to enable you to display your heartrate in VRChat's Chatbox.

Downloads: 14 This Week

Last Update: 2026-05-08
See Project
3

Applio

A simple, high-quality voice conversion tool focused on ease of use

Applio is a high-quality voice conversion toolkit designed to make modern RVC/VITS-based voice cloning accessible to non-experts. It focuses strongly on ease of use: installation scripts for Windows, Linux, and macOS set up dependencies and then launch a browser-based Gradio interface. Within that interface, users can train and run voice conversion models for tasks like singing conversion, speech-to-speech transformation, and voice cloning.

Downloads: 83 This Week

Last Update: 2026-06-28
See Project
4

Happy Coder

Mobile and Web client for Codex and Claude Code, with realtime voice

Happy is an open-source, cross-platform mobile and web client designed to bring powerful AI coding agents such as Claude Code and Codex to your fingertips no matter where you are. At its core, Happy wraps existing AI coding tools with a unified interface, providing real-time voice interactions, encrypted communication, and seamless device switching between desktop and mobile. You can start a coding session locally through the Happy CLI or connect from a phone or browser, allowing developers to inspect, interact with, and guide the AI as it generates, tests, or explains code. The project includes components like a dedicated backend server for encrypted sync, a rich front-end experience across web and native apps, and support for push notifications when your coding agent encounters permission requests or errors. ...

Downloads: 17 This Week

Last Update: 2026-06-23
See Project
Host LLMs in Production With On-Demand GPUs
NVIDIA L4 GPUs. 5-second cold starts. Scale to zero when idle.

Deploy your model, get an endpoint, pay only for compute time. No GPU provisioning or infrastructure management required.

Try Free
5

OpenAI.fm

Code for openai.fm, a demo for the OpenAI Speech API

OpenAI.fm is an official interactive demo application built to showcase the OpenAI Speech API and its advanced text-to-speech capabilities, providing developers and creators with a hands-on web interface to convert text into high-quality, customizable audio using state-of-the-art TTS models. Developed using Next.js and the OpenAI Speech API, this demo illustrates how the latest neural voice models can produce natural, expressive speech with adjustable styles and voices, highlighting features like emotional range, tone, and real-time playback. Users can experiment with different input text and voice options directly in their browser, gaining a sense of how high-fidelity AI audio can be integrated into applications ranging from podcasts and narration to accessibility tools and interactive agents. ...

Downloads: 22 This Week

Last Update: 2026-01-28
See Project
6

OpenAI Realtime Agents

This is a simple demonstration of more advanced, agentic patterns

This repository demonstrates how to build low-latency, streaming “voice + chat” agents using OpenAI’s Realtime API combined with the OpenAI Agents SDK. The demo shows patterns for connecting a realtime voice stream (audio in/out) with agents that can use tools, maintain state, and orchestrate multi-agent workflows. The SDK offers abstractions such as agent orchestration, event handling, handoffs, state management, and guardrails, tailored to support realtime, conversational systems. ...

Downloads: 0 This Week

Last Update: 2026-01-07
See Project
7

Manyfold

A self-hosted digital asset manager for 3d print files

Manyfold is an open-source 3D collaboration platform that reimagines how distributed teams and communities can meet, create, and interact in immersive spatial environments through the web. Instead of forcing users to download native apps or create accounts on closed metaverse services, Manyfold runs entirely in the browser, letting people join 3D spaces with simple links and participate in real time using avatars, voice chat, and object interaction. Users can build or import shared 3D worlds, arrange media, embed content, and design interactive layouts that support presentations, workshops, social events, games, and team gatherings without heavy software installations. ...

Downloads: 8 This Week

Last Update: 2026-07-02
See Project
8

Chatterbox TTS Server

Self-host the powerful Chatterbox TTS model

...Its main value is packaging a powerful TTS workflow into a practical service that can be accessed through a browser or integrated into other software.

Downloads: 3 This Week

Last Update: 2026-06-08
See Project
9

Whisper-WebUI

A Web UI for easy subtitle using whisper model

Whisper WebUI is an open-source browser-based interface that simplifies the use of Whisper speech recognition models by providing an intuitive graphical environment for transcription, translation, and subtitle generation. Built with Gradio, it allows users to upload audio or video files, process them locally, and generate accurate text outputs without relying on command-line tools.

Downloads: 12 This Week

Last Update: 2026-03-18
See Project
99.99% Uptime for MySQL and PostgreSQL Databases
Sub-second maintenance. 2x read/write performance. Built-in vector search for AI apps.

Cloud SQL Enterprise Plus delivers near-zero downtime with 35 days of point-in-time recovery. Supports MySQL, PostgreSQL, and SQL Server.

Try Free
10

edge-tts

Use Microsoft Edge's online text-to-speech service from Python

edge-tts is a Python module and command-line tool that gives you direct access to Microsoft Edge’s online text-to-speech service without needing the Edge browser, Windows, or any API key. It wraps the same cloud voices used by Edge, exposing them through a simple CLI (edge-tts, edge-playback) and a Python API, so you can script high-quality speech generation in your own applications. The tool lets you list available voices, specify locale and voice name, and generate audio files in common formats like MP3 or WAV. ...

Downloads: 35 This Week

Last Update: 2026-03-22
See Project
11

OpenClaw

Your own personal AI assistant. Any OS. Any Platform.

OpenClaw (formerly Clawdbot/Moltbot) is an open-source, self-hosted autonomous AI assistant designed to run on user-controlled hardware and bridge conversational natural language with real-world task execution, effectively acting as a proactive digital assistant rather than a reactive chatbot. It lets you send instructions through familiar messaging platforms like WhatsApp, Telegram, Discord, Slack, Signal, iMessage, and more, and then interprets those instructions to carry out actions such...

1 Review

Downloads: 91 This Week

Last Update: 2026-06-30
See Project
12

ChatTTS webUI & API

A simple native web interface that uses ChatTTS to synthesize text

...From version 0.96 onward, ffmpeg installation is required for deployment, and previous CSV/PT voice tables are no longer valid, so users instead work with updated “voice value” parameters. For convenience, there is a prepackaged Windows build: you download a release archive, extract it, and double-click app.exe to start the web UI, which opens on localhost:9966.

Downloads: 11 This Week

Last Update: 2026-06-14
See Project
13

Telegram Web A

Telegram Web A, GPL v3

...The project achieved recognition (winning first prize in the Telegram Lightweight Client Contest) and serves as the code base behind the official web client available at web.telegram.org/a. The architecture takes advantage of advanced browser capabilities: WebSockets for real-time messaging, Web Workers and WebAssembly for performance-critical tasks, multi-level caching and PWA features for offline or near-offline usability, voice recording and media streaming, raw binary data handling and cryptographic operations. It also handles rich UI/UX elements such as CSS/Canvas/SVG animations, reactive data streams, etc.

Downloads: 11 This Week

Last Update: 2026-05-15
See Project
14

Puter

🌐 The Internet Computer! Free, Open-Source, and Self-Hostable

Puter is an open-source, self-hostable internet computer platform that combines cloud storage, applications, development tools, and computing resources into a single web-based environment. Designed as a complete digital workspace, it enables users to access files, apps, and services from anywhere through a browser-based desktop experience. Puter includes built-in productivity applications while also providing developers with cloud infrastructure such as AI services, databases, storage, and...

Downloads: 10 This Week

Last Update: 6 days ago
See Project
15

Everywhere

Context-aware desktop AI assistant that understands screen content

Everywhere is a context-aware desktop AI assistant designed to interact directly with the content displayed on a user’s screen. It distinguishes itself from traditional AI tools by eliminating the need for manual input methods such as copying text or taking screenshots, instead allowing users to invoke assistance instantly through a shortcut. It can analyze on-screen information in real time and provide contextual responses, making it useful for tasks like troubleshooting errors, summarizing...

Downloads: 0 This Week

Last Update: 2026-06-19
See Project
16

WhisperLive

A nearly-live implementation of OpenAI's Whisper

WhisperLive is a “nearly live” implementation of OpenAI’s Whisper model focused on real-time transcription. It runs as a server–client system in which the server hosts a Whisper backend and clients stream audio to be transcribed with very low delay. The project supports multiple inference backends, including Faster-Whisper, NVIDIA TensorRT, and OpenVINO, allowing you to target GPUs and different CPU architectures efficiently. It can handle microphone input, pre-recorded audio files, and...

Downloads: 45 This Week

Last Update: 2026-06-02
See Project
17

Abaddon

An alternative Discord client with voice support made with C++ and GTK

Alternative Discord client made in C++ with GTK. Abaddon tries its best (though is not perfect) to make Discord think it's a legitimate web client. Some of the things done to do this include: using a browser user agent, sending the same IDENTIFY message that the official web client does, using API v9 endpoints in all cases, and not using endpoints the web client does not normally use. There are still a few smaller inconsistencies, however. For example the web client sends lots of telemetry...

1 Review

Downloads: 32 This Week

Last Update: 2026-04-06
See Project
18

Moltis

A Rust-native claw you can trust

Moltis is an open-source personal AI assistant platform written in Rust that is designed to run as a fully self-hosted, local-first agent environment. It compiles the entire assistant stack, including the web interface, model routing, memory, and tools, into a single self-contained binary with no external runtime dependencies. The system supports multiple large language model providers alongside local models, enabling users to maintain privacy while still accessing cloud capabilities when...

Downloads: 13 This Week

Last Update: 2026-06-04
See Project
19

Recorder

HTML5 js recording mp3 wav ogg webm amr format

Supports microphone recording and real-time processing in most of the implemented getUserMediamobile and PC browsers, mainly including Chrome, Firefox, Safari, iOS 14.3+, Android WebView, Tencent Android X5 kernel (QQ, WeChat, Mini Program WebView) , uni-app (App, H5), and most Android phones updated after 2021 have their own browsers; do not support: UC-based kernel (typical Alipay), most of the old domestic mobile phones that have not been updated have their own browsers and any other form of browser (including PWA, WebClip, any App) on low-version iOS (11.0-14.2) except Safari inside page). Provides multiple plug-in function support. Rich audio visualization, variable speed and pitch processing, speech recognition, audio stream playback, etc.; with powerful real-time processing support, it can be used in various web applications: from simple recording to complex real-time voice Recognition (ASR), and even audio-related games, are handled with ease.

Downloads: 8 This Week

Last Update: 5 days ago
See Project
20

Order Voice PHP Order Number Caller

is a simple yet powerful web-based application designed to streamline

OrderVoice is a simple yet powerful web-based application designed to streamline the customer experience in businesses like restaurants, retail stores, and service centers. With OrderVoice, you can easily announce when an order is ready using a clear and customizable voice prompt

Downloads: 0 This Week

Last Update: 2024-09-01
See Project
21

VoiceCommander 2.0 Multilingual Offline

2.0 Offline multilingual voice control for Windows. Fast, private

...Features: - 100% offline (no cloud, no data sharing) - Multilingual voice commands - Control apps, browser, and system - OCR + text-to-speech screen reading - Portable (no installation) - No registry changes NVIDIA GPU recommended (GTX 1060+). CPU mode supported (slower). For any other information please contact : Ducktheapp

1 Review

Downloads: 7 This Week

Last Update: 2026-06-17
See Project
22

VoiceCommander Multilingual Offline

Offline multilingual voice control for Windows. Fast, private, local.

...Features: - 100% offline (no cloud, no data sharing) - Multilingual voice commands - Control apps, browser, and system - OCR + text-to-speech screen reading - Portable (no installation) - No registry changes NVIDIA GPU recommended (GTX 1060+). CPU mode supported (slower). A practical alternative to cloud-based voice assistants. For any other information please contact : ducktheapp@gmail.com The Duck

1 Review

Downloads: 3 This Week

Last Update: 2026-06-17
See Project
23

WAPage

Free offline WhatsApp chat viewer for exported files on Windows

WAPage parses exported WhatsApp chats and renders them in a clean visual interface — because WhatsApp exports your data but gives you no way to read it. This is a real native Windows .exe built in C++ with Qt 6.7.3. Not a web app, not Electron, not browser-based. FEATURES - Android & iOS exports supported (with or without media) - Displays text, images, video, voice messages, animated stickers, contacts, and locations - Individual and group chat support - Light and dark theme - 18 interface languages - Rename conversations, set custom profile pictures - Fully offline — no internet connection is ever made - All data stays on your machine - Runs on Windows 10 & 11 (64-bit) NOTE Parser supports exports from devices with English system language only. ...

Downloads: 5 This Week

Last Update: 2026-04-27
See Project
24

Audio Satanifier 666

Easily apply cool gnarly voice filters to your audio files

Transform pure innocent audio files, speech, music, etc into unholy demonic abominations. Audio Satanifier 666 is a fun easy-to-use browser-based tool forged in the pits of hell, for voice actors, musicians, sound designers, for memes, for creative projects or anyone else who want to twist their sound into something absolutely diabolical! Layperson friendly - you'll be able to apply cool effects to your audio file even if you know nothing about audio engineering. ...

Downloads: 0 This Week

Last Update: 2025-07-27
See Project
25

Comandi Vocali Offline per Windows

Sistema comandi vocali offline per Windows, veloce e privato .Offline

Questa versione è superata. 👉 Nuova versione funzionante: https://voicecommander2multilingual.sourceforge.io/ o scaricala direttamente - direct download : https://sourceforge.net/projects/voicecommander2multilingual/files/VoiceCommander2.zip/download VoiceCommander 2.0 è stabile, migliorato e completamente operativo. Comandi Vocali Offline per Windows è un sistema di controllo vocale che funziona interamente in locale sul tuo PC. Permette di controllare il computer con la...

Downloads: 2 This Week

Last Update: 2026-06-17
See Project