Showing 16 open source projects for "google tts nvda"

View related business solutions
  • Fully Managed MySQL, PostgreSQL, and SQL Server Icon
    Fully Managed MySQL, PostgreSQL, and SQL Server

    Automatic backups, patching, replication, and failover. Focus on your app, not your database.

    Cloud SQL handles your database ops end to end, so you can focus on your app.
    Try Free
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 1
    Cookbook (Google Gemini)

    Cookbook (Google Gemini)

    Examples and guides for using the Gemini API

    The Gemini Cookbook is an official repository of examples and guides for using Google’s Gemini API. It provides a structured learning path with quick-start tutorials for beginners and practical examples for advanced users. The repository covers a wide range of Gemini capabilities, including text, images, video, speech, robotics, and multimodal interactions. It highlights newly introduced features such as Gemini 2.5 models (Flash and Pro), Gemini’s native image generation, Veo for video...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 2
    emily-gr

    emily-gr

    Emily combines NVDA and MARY TTS to help people with disabilities.

    Emily is a multilingual text to speech application based on Mary TTS and NVDA. It supports English, French, German, Italian, Turkish and Greek languages with many voices. It can be used as a NVDA addon or as a standalone application by people with reading disabilities.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    RealtimeTTS

    RealtimeTTS

    Converts text to speech in realtime

    ...It is designed around a streaming model: you can feed it text incrementally (for example, as an LLM responds) and get audio output almost immediately, which keeps end-to-end latency very low. The library is engine-agnostic and plugs into a wide range of cloud and local TTS systems, including OpenAI, ElevenLabs, Azure, Coqui, Piper, StyleTTS2, Edge TTS, Google TTS, system TTS and others, so you can swap providers without rewriting your pipeline. It supports both internet-based engines and fully local engines, which lets you choose between privacy, cost, and quality trade-offs. RealtimeTTS also includes robustness features such as automatic fallbacks when a backend fails, so production systems can stay responsive even if one TTS provider is temporarily unavailable.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 4
    Voice-Pro

    Voice-Pro

    Comprehensive Gradio WebUI for audio processing

    Voice-Pro is the best gradio WebUI for transcription, translation and text-to-speech. It can be easily installed with one click. Create a virtual environment using Miniconda, running completely separate from the Windows system (fully portable). Supports real-time transcription and translation, as well as batch mode.
    Downloads: 31 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    gTTS

    gTTS

    Python library and CLI tool to interface with Google Translate

    gTTS (Google Text-to-Speech) is a Python library and command-line tool that wraps the speech functionality of Google Translate. It lets you send text to the Google Translate TTS endpoint and receive spoken audio back as MP3 data, either written to a file, a file-like object, or standard output. The library is designed to handle long texts, using a speech-specific sentence tokenizer that keeps intonation and punctuation natural while splitting requests into acceptable chunks. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 6
    Auto Synced & Translated Dubs

    Auto Synced & Translated Dubs

    Automatically translates the text of a video based on a subtitle file

    Auto-Synced-Translated-Dubs is a toolchain that automatically translates and re-dubs videos using AI voices while keeping the new speech aligned to the original timing via subtitle files. It assumes you have a human-made SRT (or similar) subtitle file; the script then uses translation services such as Google Cloud or DeepL to generate translated subtitle tracks in one or more target languages. Using the timestamps of each subtitle line, it computes the required duration of each spoken segment and synthesizes audio via neural TTS services, producing one audio clip per subtitle entry. The tool then time-stretches or compresses each TTS clip to match the original speech duration exactly, which preserves lip-sync and rhythm as closely as possible without manual editing. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    SillyTavern

    SillyTavern

    LLM Frontend for Power Users

    Mobile-friendly, Multi-API (KoboldAI/CPP, Horde, NovelAI, Ooba, OpenAI, OpenRouter, Claude, Scale), VN-like Waifu Mode, Horde SD, System TTS, WorldInfo (lorebooks), customizable UI, auto-translate, and more prompt options than you'd ever want or need. Optional Extras server for more SD/TTS options + ChromaDB/Summarize. SillyTavern is a user interface you can install on your computer (and Android phones) that allows you to interact with text generation AIs and chat/roleplay with characters...
    Downloads: 321 This Week
    Last Update:
    See Project
  • 8
    AudioLM - Pytorch

    AudioLM - Pytorch

    Implementation of AudioLM audio generation model in Pytorch

    Implementation of AudioLM, a Language Modeling Approach to Audio Generation out of Google Research, in Pytorch It also extends the work for conditioning with classifier free guidance with T5. This allows for one to do text-to-audio or TTS, not offered in the paper. Yes, this means VALL-E can be trained from this repository. It is essentially the same. This repository now also contains a MIT licensed version of SoundStream.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    ChatTTS_colab

    ChatTTS_colab

    One-click deployment (including offline integration package)

    ChatTTS_colab is a wrapper project around the ChatTTS model that focuses on “one-click” deployment, especially in Google Colab. It provides an integrated offline bundle and scripts for Windows and macOS so users can run ChatTTS locally without wrestling with complex environment setup. The repository includes Colab notebooks that launch a Gradio-based web UI and expose streaming TTS, making it possible to listen to generated audio as it is produced.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Build Securely on AWS with Proven Frameworks Icon
    Build Securely on AWS with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 10
    VALL-E X

    VALL-E X

    Open source implementation of Microsoft's VALL-E X zero-shot TTS model

    VALL-E-X is an open-source implementation of Microsoft’s VALL-E X zero-shot text-to-speech model, focused on multilingual, cross-lingual voice cloning. It is capable of synthesizing speech in English, Chinese, and Japanese from text while mimicking the voice characteristics of a speaker given only a short 3–10 second prompt. The model attempts to match not just timbre, but also tone, pitch, emotion, and prosody of the reference audio, resulting in highly personalized output. VALL-E-X...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 11
    flightcombat_chung
    Flightcombat chung is a 3D openGL space / air / sea / ground flight / car simulator written in compiled freebasic with combat against ships DCA and massive air /air /space dogfight.Give orders to your wingmens ,declare war or attack other planes,or fly in formation , refuel at airports or space stations and explore vastes satellite heightmap countries or planets by plane, spacecraft, foot or car. Can run on a small netbook with windows 7. Zipped and unzip with 7zip. (22/09/2015) added...
    Leader badge
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    guglinatts-en

    guglinatts-en

    Guglina TTS, special edition: in English (guglinatts-en)

    Guglina TTS, special edition: in English (guglinatts-en), is a voice synthesizer originally designed for Brazilian Portuguese. Uses the Google Translate text-to-speech API. Read screens for the visually impaired. Transforms text into audio, allowing blind or low-vision people to access content displayed on the screen. Although the main target audience for text-to-speech conversion systems - such as Guglina TTS EN - is people with visual impairment, this type of program can be used by people with dyslexia and other reading disabilities, people with severe as well as by pre-literate children. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    guglinatts-it

    guglinatts-it

    Guglina TTS, edizione speciale: in italiano (guglinatts-it)

    Guglina TTS, edizione speciale: in italiano (guglinatts-it), è un sintetizzatore vocale originariamente progettato per il portoghese brasiliano. Utilizza l'API di sintesi vocale di Google Traduttore. Leggi gli screenshot per gli ipovedenti. Trasforma il testo in audio, consentendo a persone non vedenti o ipovedenti di accedere al contenuto visualizzato sullo schermo.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    guglinatts

    guglinatts

    sintetizador de voz, em português do Brasil, que usa a API do Google

    Guglina TTS é um sintetizador de voz, em português do Brasil, que usa a API de conversão de texto em fala do Google Tradutor. Lê telas para portadores de deficiência visual. Transforma texto em áudio, permitindo que pessoas cegas ou com baixa visão tenham acesso ao conteúdo exibido na tela. Embora o principal público-alvo de sistemas de conversão texto-fala – como o Guglina TTS – seja formado por pessoas com deficiência visual, esse tipo de programa pode ser usado por pessoas com dislexia e outras dificuldades de leitura, pessoas com deficiência severa de fala, bem como por crianças pré-alfabetizadas. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15

    The News Book

    A News Program Which Extract Contents from Websites And Share Them...

    This Is A News Program Which Extracts Contents From Several News Websites, And Make Them Available For Share On Social Network Sites.Currently Supports Facebook, Working On More Social Networks.It Contains Some Other Features Like You Tube Client And You Tube Video Sharer On Social Networks And Blogs.A Gmail Client.Also Contains The Feature Of Bookmarks For Both You Tube Videos And News Articles.Bookmarks For News Articles Also Works Offline.The Program Maintains The Article Format According To The Websites.Also Supports NSM(News Speech Manager) Which Read The Articles Using The TTS(Text To Speech) Mechanism, Very Useful For Blind Peoples.Very Secure In Gmail Login Process, The Program Remember Password Only When It Is Running,And To Open A Mail You Have To Login First And Once Only, Automatically Sign Out From Google Account On Application Close.Contains A Smooth And Simple User Interface,Working On User Defined GUI. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Provides Text to speech synthesis systems for Indian Languages. It has festival speech synthesiser integrated with screen readers like NVDA and ORCA for windows and linux based systems respectively.
    Downloads: 6 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB