google voice free download

Showing 65 open source projects for "google voice"

View related business solutions

Build Agents and Models on One Platform
Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.

Try It Free
Custom VMs From 1 to 96 vCPUs With 99.95% Uptime
General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.

Try Free
1

Voice-Pro

Comprehensive Gradio WebUI for audio processing

Voice-Pro is the best gradio WebUI for transcription, translation and text-to-speech. It can be easily installed with one click. Create a virtual environment using Miniconda, running completely separate from the Windows system (fully portable). Supports real-time transcription and translation, as well as batch mode.

1 Review

Downloads: 31 This Week

Last Update: 18 hours ago
See Project
2

Leku

Map location picker component for Android

Map location picker component for Android. Based on Google Maps. An alternative to Google Place Picker. Component library for Android that uses Google Maps and returns a latitude, longitude and an address based on the location picked with the Activity provided. Note that you have the voice_search_extra_language that is used for the language of the voice recognition. Replace it with the allowed voice recognition locale for your language.

Downloads: 0 This Week

Last Update: 2026-01-13
See Project
3

Porcupine

On-device wake word detection powered by deep learning

...Raspberry Pi, NVIDIA Jetson Nano, and BeagleBone. Android and iOS. Chrome, Safari, Firefox, and Edge. Linux (x86_64), macOS (x86_64, arm64), and Windows (x86_64). Scalable. It can detect multiple always-listening voice commands with no added runtime footprint. Self-service. Developers can train custom wake word models using Picovoice Console. Porcupine is the right product if you need to detect one or a few static (always-listening) voice commands. If you want to create voice experiences similar to Alexa or Google, see the Picovoice platform.

Downloads: 6 This Week

Last Update: 2025-12-11
See Project
4

Rasa

Open source machine learning framework to automate text conversations

Rasa is an open source machine learning framework to automate text-and voice-based conversations. With Rasa, you can build contextual assistants on Facebook Messenger, Slack, Google Hangouts, Webex Teams, Microsoft Bot Framework, Rocket.Chat, Mattermost, Telegram, and Twilio or on your own custom conversational channels. Rasa helps you build contextual assistants capable of having layered conversations with lots of back-and-forths.

Downloads: 10 This Week

Last Update: 2025-01-14
See Project
Host LLMs in Production With On-Demand GPUs
NVIDIA L4 GPUs. 5-second cold starts. Scale to zero when idle.

Deploy your model, get an endpoint, pay only for compute time. No GPU provisioning or infrastructure management required.

Try Free
5

Translate-Subtitle-File

Subtitle Creation Assistant

Subtitle group machine translation assistant - [Function 1: Translate subtitle file] .srt .ass .vtt [Function 2: Voice to text] (Drag in video or audio to recognize subtitles) (The latest version v4.1.0 Update time 2021 2 May 23) 12 translation service providers can be configured, such as Google, Baidu, Tencent, Caiyun, IBM, Azure, Amazon, etc. (6 voice service providers can be configured: Alibaba Cloud, Xunfei, Tencent Cloud, IBM, Azure, Amazon ) Advantages: 1. ...

Downloads: 15 This Week

Last Update: 2026-07-06
See Project
6

ChatOllama

ChatOllama is an open-source AI chatbot

ChatOllama is an open-source chatbot platform built with Nuxt 3 and designed to provide a private, extensible interface for working with multiple modern language model providers. It goes beyond a basic chat UI by supporting a broad model ecosystem that includes OpenAI, Azure OpenAI, Anthropic, Google Gemini, Groq, Moonshot, Ollama, and other OpenAI-compatible services. The platform also includes higher-level capabilities such as AI agents, document-backed knowledge bases, real-time voice chat, and Model Context Protocol integration for external tools. Its RAG functionality allows document upload and knowledge-base-driven interaction, while vector database support adds more scalable retrieval options. ...

Downloads: 0 This Week

Last Update: 2026-05-28
See Project
7

SEO Machine

A specialized Claude Code workspace for creating long-form

...The system uses specialized commands and agents to perform tasks such as keyword research, competitor analysis, content drafting, and optimization. It incorporates real data sources like Google Analytics and Search Console to guide decision-making and improve content effectiveness. The architecture emphasizes context-awareness, using brand voice, style guides, and keyword strategies to maintain consistency across outputs. It also includes performance evaluation tools that score content and suggest improvements before publishing.

Downloads: 0 This Week

Last Update: 2026-04-10
See Project
8

Jovo Framework

The React for Voice and Chat, build apps for Alexa, Google Assistant

The multimodal experience platform enables professional teams to build and run apps that work across smart speakers, the web, mobile, and more. Fully customizable and open source. The Jovo product ecosystem allows you to build, test, and run powerful experiences for voice, chat, and web platforms. From local development to production, Jovo allows you to build robust experiences, faster. Build across devices and platforms and use all supported modalities thanks to the Jovo output template...

Downloads: 0 This Week

Last Update: 2024-06-12
See Project
9

Better Chatbot

Just a Better Chatbot. Powered by MCP Client & Workflows

Better‑chatbot is an AI chatbot framework powered by MCP protocols and workflows, allowing developers to deploy and integrate AI-powered chat systems with ease. Integrates all major LLMs: OpenAI, Anthropic, Google, xAI, Ollama, and more. MCP protocol, web search, JS/Python code execution, data visualization. Custom agents, visual workflows, artifact generation. Custom agents, visual workflows, artifact generation. Realtime voice chat with full MCP tool integration.

Downloads: 8 This Week

Last Update: 2025-11-22
See Project
99.99% Uptime for MySQL and PostgreSQL Databases
Sub-second maintenance. 2x read/write performance. Built-in vector search for AI apps.

Cloud SQL Enterprise Plus delivers near-zero downtime with 35 days of point-in-time recovery. Supports MySQL, PostgreSQL, and SQL Server.

Try Free
10

RealtimeTTS

Converts text to speech in realtime

RealtimeTTS is a low-latency text-to-speech library built for real-time applications such as voice chat with LLMs, assistants, and interactive tools. It is designed around a streaming model: you can feed it text incrementally (for example, as an LLM responds) and get audio output almost immediately, which keeps end-to-end latency very low. The library is engine-agnostic and plugs into a wide range of cloud and local TTS systems, including OpenAI, ElevenLabs, Azure, Coqui, Piper, StyleTTS2, Edge TTS, Google TTS, system TTS and others, so you can swap providers without rewriting your pipeline. ...

Downloads: 9 This Week

Last Update: 2026-05-25
See Project
11

PyGPT

Open source personal AI Assistant for Linux, Windows and Mac

PyGPT is a desktop application that allows you to talk to OpenAI's LLM models such as GPT4 and GPT3 using your own computer and OpenAI API. It allows you to talk in chat mode and in completion mode, as well as generate images using DALL-E 2. PyGPT also adds access to the Internet for GPT via Google Custom Search API and Wikipedia API and includes voice synthesis using Microsoft Azure Text-to-Speech API. Moreover, the application has implemented context memory support, context storage, history of contexts, which can be restored at any time and e.g. continue the conversation from point in history, and also has a convenient and intuitive system of presets that allows you to quickly and pleasantly create and manage your prompts. ...

Downloads: 2 This Week

Last Update: 2026-02-06
See Project
12

ChatTTS_colab

One-click deployment (including offline integration package)

...A distinctive feature is the “voice gacha” system, which batch-generates many distinct voice timbres and allows users to save the ones they like into a curated voice library. It has first-class support for long-form audio generation, making it suitable for audiobooks, podcasts, or long narration tasks. The project also implements multi-speaker or role-based reading, letting users assign different voices to different characters in a script and even use a large language model to generate that script in one step.

Downloads: 1 This Week

Last Update: 2025-11-28
See Project
13

Auto Synced & Translated Dubs

Automatically translates the text of a video based on a subtitle file

Auto-Synced-Translated-Dubs is a toolchain that automatically translates and re-dubs videos using AI voices while keeping the new speech aligned to the original timing via subtitle files. It assumes you have a human-made SRT (or similar) subtitle file; the script then uses translation services such as Google Cloud or DeepL to generate translated subtitle tracks in one or more target languages. Using the timestamps of each subtitle line, it computes the required duration of each spoken...

Downloads: 0 This Week

Last Update: 2025-11-28
See Project
14

Networked-Aframe

Web framework for building multi-user virtual reality experiences

Multi-user VR on the Web. A framework for writing multi-user VR apps in HTML and JS. Built on top of A-Frame. Follow the NAF Getting Started tutorial to build your own example from scratch, including setting up a local server. Networked-Aframe works by syncing entities and their components to connected users. To connect to a room you need to add the networked-scene component to the a-scene element. For an entity to be synced, add the networked component to it. By default the position and...

Downloads: 7 This Week

Last Update: 2026-03-30
See Project
15

Tock

Tock, the open source conversational AI toolkit

...Try Tock online, deploy in minutes using Docker running configurations, or setup your own platform to the Cloud, On-Premise, even embedded. More and more teams and companies trust in Tock and open conversational platforms. Built-in connectors for numerous text/voice channels, Messenger, WhatsApp, Google Assistant, Alexa, Twitter and more.

Downloads: 0 This Week

Last Update: 2026-07-02
See Project
16

Personal A.I Assistant

An Open Source Personal A.I Assistant Based on the Google Gemini API that is Fully Customizable for Your Needs, Ask Questions, Request Real Time Data and Information, Play Music. Launch Programs and Open Websites on Your PC with Voice Commands. ***Requires Your Google Gemini API Key to Work***

Downloads: 21 This Week

Last Update: 2024-11-02
See Project
17

Open Camera

Camera app for Android.

Open Camera is an Open Source Camera app for Android(TM) phones and tablets. * Get it on Google Play: https://play.google.com/store/apps/details?id=net.sourceforge.opencamera * Get it on F-Droid: https://f-droid.org/en/packages/net.sourceforge.opencamera/ * Or you can download the APK/source directly, see "Files" above. Android, Google Play and the Google Play logo are trademarks of Google LLC.

48 Reviews

Downloads: 1,532 This Week

Last Update: 2026-06-03
See Project
18

Xiaomi.eu Multilang MIUI ROMs

OFFICIAL mirror of MIUI Multilang ROM by https://xiaomi.eu/

Included languages: English (en.miui.com, xiaomi.eu) Chinese (miui.com) Polish (Acid, miuipolska.pl) Slovak (ingbrzy, https://miuios.cz) Czech (VMach, https://miuios.cz) Bulgarian (Global) Hungarian (vagyula, miui.hu) French (Global) Swedish (Global) Russian (Global) Ukrainian (Global) Belorussian (Global) Croatian (Masva, MIUI Adria) Vietnamese (Belmont-Gabriel) Italian (miui.it) Greek (Global) Norwegian (Global) Dutch (cueie, mannyribo, redmaner) Spanish (nosijf982)...

103 Reviews

Downloads: 9,075 This Week

Last Update: 6 hours ago
See Project
19

Voice Accounting For Blind & Mute People

Free & Easy AI Voice Accounting Software For Blind & Speechless People

Just download the above zip file, extract it and then open the index.html file on internet browsers like Firefox ( preferable ) or Google Chrome. Also, please view and download my full collection of softwares for people with disabilities, here : https://sourceforge.net/projects/softwares-for-disabled-people/ This full collection also includes the Voice Accounting Software as well.

Downloads: 0 This Week

Last Update: 2024-04-30
See Project
20

Glint Translator

Glint Translator is a high-performance, privacy-focused Windows application for real-time in-game and voice translation without interrupting gameplay. Powered by leading cloud and offline/local AI models including Google Gemini, OpenAI, xAI Grok, DeepL, Azure, and Ollama (Gemma, Qwen), it seamlessly translates 240+ languages with an intuitive, plug-and-play interface. Example: They speak German → you see Turkish They speak Turkish → you see German 🧠 AI Model Support Google Gemini: 2.5 Flash, 2.5 Pro OpenAI: GPT-4o, GPT-4 Turbo xAI: Grok Local/Offline (via Ollama Engine): Gemma, Qwen

1 Review

Downloads: 30 This Week

Last Update: 2026-06-23
See Project
21

CerberusCMS5

Cerberus Content Management System

Cerberus Content Management System is a dynamic, secure and infinitely expandable CMS designed after a Unix-Like model. It is a custom written Web Application Framework ( W.A.F. ) with a consistent and custom written Pre-Hyper-Text-Post-Processor Programming Code Framework ( P.C.F. ). This Web Application Software Project' aim is to be the fastest and most secure Web Application Framework, Web Application Programming Code Framework, Text, Voice and Video Communications Platform and Content...

Downloads: 20 This Week

Last Update: 2024-11-26
See Project
22

Cerberus Content Management System 6

Cerberus Content Management System

Cerberus Content Management System is a dynamic, secure and infinitely expandable CMS designed after a Unix-Like model complete with a Unix-Like Kernel File named: Cerberus. It is a custom written Web Application Framework ( W.A.F. ) with a consistent and custom written Pre-Hyper-Text-Post-Processor Programming Code Framework ( P.C.F. ). This Web Application Software Project' aim is to be the fastest and most secure Web Application Framework, Web Application Programming Code Framework, Text,...

Downloads: 33 This Week

Last Update: 2026-06-02
See Project
23

Softwares For Blind, Deaf, Handicap

Easy AI Softwares for Blind, Deaf, Handicapped, Disabled People

Just download the above zip file, extract it first and then open the index.html file on internet browsers like Firefox ( preferable ) or Google Chrome. Also, keep NumLock ON while using the Numeric Keypad of any Keyboard. Can also attach an external USB keyboard, with seperate Numeric Keypad, if required. I have added some general guidelines for students, using these softwares, on the Wiki Page of this website. Please refer them for more instructions.

Downloads: 0 This Week

Last Update: 2026-01-18
See Project
24

cerberuscms2

Cerberus Content Management System

Cerberus Content Management System is a dynamic, secure and infinitely expandable CMS designed after a Unix-Like model. It is a custom written Web Application Framework ( W.A.F. ) with a consistent and custom written Pre-Hyper-Text-Post-Processor Programming Code Framework ( P.C.F. ). This Web Application Software Project' aim is to be the fastest and most secure Web Application Framework, Web Application Programming Code Framework, Text, Voice and Video Communications Platform and Content...

1 Review

Downloads: 1 This Week

Last Update: 2024-06-10
See Project
25

texttalk

Talk through typing the text

Speak with generated voice of text input. Using Google translate web service, the audio sound of the spoken text can be extracted. Ideal for helping remote support, voice proxy for privacy, etc.

Downloads: 0 This Week

Last Update: 2024-02-05
See Project