gTTS (Google Text-to-Speech) is a Python library and command-line tool that wraps the speech functionality of Google Translate. It lets you send text to the Google Translate TTS endpoint and receive spoken audio back as MP3 data, either written to a file, a file-like object, or standard output. The library is designed to handle long texts, using a speech-specific sentence tokenizer that keeps intonation and punctuation natural while splitting requests into acceptable chunks. It supports customizable text pre-processors, which can correct pronunciations, tweak formatting, or handle domain-specific vocabulary before sending it to the API. gTTS is primarily aimed at developers who want a quick way to add cloud-backed speech to scripts, apps, or pipelines without managing any model weights locally. A small CLI utility, gtts-cli, makes it easy to test or batch-generate MP3 files right from the shell.
Features
- Python library and CLI that interface with Google Translate’s TTS API
- Writes MP3 output to files, file-like objects, or stdout for easy piping
- Sentence tokenizer that supports long texts while keeping natural prosody
- Customizable text pre-processors for pronunciation and formatting tweaks
- Simple one-line CLI usage (gtts-cli) for quick audio generation
- Cross-platform support with docs, tests, and MIT licensing