Showing 30 open source projects for "wav to text"

View related business solutions
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    Voice-Pro

    Voice-Pro

    Comprehensive Gradio WebUI for audio processing

    Voice-Pro is the best gradio WebUI for transcription, translation and text-to-speech. It can be easily installed with one click. Create a virtual environment using Miniconda, running completely separate from the Windows system (fully portable). Supports real-time transcription and translation, as well as batch mode.
    Downloads: 42 This Week
    Last Update:
    See Project
  • 2
    edge-tts

    edge-tts

    Use Microsoft Edge's online text-to-speech service from Python

    edge-tts is a Python module and command-line tool that gives you direct access to Microsoft Edge’s online text-to-speech service without needing the Edge browser, Windows, or any API key. It wraps the same cloud voices used by Edge, exposing them through a simple CLI (edge-tts, edge-playback) and a Python API, so you can script high-quality speech generation in your own applications. The tool lets you list available voices, specify locale and voice name, and generate audio files in common formats like MP3 or WAV.
    Downloads: 28 This Week
    Last Update:
    See Project
  • 3
    Audiblez

    Audiblez

    Generate audiobooks from e-books

    Audiblez is a tool for generating high-quality .m4b audiobooks directly from .epub e-books using the Kokoro-82M neural text-to-speech model. It focuses on making audiobook creation easy and fast: from a single command, the tool splits an e-book into chapters, synthesizes audio for each section, and then merges the results into a structured audiobook with chapter-based WAV files and a final .m4b container. The Kokoro-82M model it uses is compact (82M parameters) yet natural sounding, trained on under 100 hours of audio, and supports multiple languages, including English (US/UK), Spanish, French, Hindi, Italian, Japanese, Brazilian Portuguese, and Mandarin Chinese. ...
    Downloads: 34 This Week
    Last Update:
    See Project
  • 4
    Bootleg Text Slicer

    Bootleg Text Slicer

    Text transcription & slicing tool with visual timeline and WAV output.

    ..." - Adjust timing offsets for the beginning and end of each word either globally or individually. - Play full audio or specific words directly from within the app. - Export words as separate `.wav` audio files. - Record the timeline position, along with the global and per‑word timing offsets for each exported word, into a cutTemplate.txt file so that the individual words can later be played using only the source audio file. GitHub repository: https://github.com/Northstrix/bootleg-text-slicer Successfully tested with English and Italian audio files. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    OpenAI-Compatible Edge-TTS API

    OpenAI-Compatible Edge-TTS API

    Free, high-quality text-to-speech API endpoint to replace OpenAI

    OpenAI-Compatible Edge-TTS API is a local, OpenAI-compatible text-to-speech API that uses edge-tts—Microsoft Edge’s online TTS service—as the backend. The project emulates the /v1/audio/speech endpoint used by OpenAI, so any client that can talk to the OpenAI TTS API can be redirected to this service with minimal changes. It exposes parameters for input text, voice selection, audio format, and playback speed, mirroring the OpenAI interface while mapping popular OpenAI voice names to...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Shutter Encoder

    Shutter Encoder

    Free professional video converter Windows|Mac|Linux

    Shutter Encoder is an video, audio and image converter based on FFmpeg and other great tools. It has been designed by video editors in order to be as accessible and efficient as possible. It's a swiss knife tool for any video editor. Link to website & downloads : https://www.shutterencoder.com - Without conversion: Cut without re-encoding, Replace audio, Rewrap, Conform, Merge, Extract, Subtitling, Video inserts - Sound conversions: WAV, AIFF, FLAC, ALAC, MP3, AAC, AC3,...
    Leader badge
    Downloads: 64 This Week
    Last Update:
    See Project
  • 7
    Hexen II: Hammer of Thyrion

    Hexen II: Hammer of Thyrion

    A cross-platform port of Hexen II game.

    Hammer of Thyrion (uHexen2) is a cross-platform port of Raven Software's Hexen II source. It is based on an older linux port, Anvil of Thyrion. HoT includes countless bug fixes, improved music, sound and video modes, opengl improvements, support for many operating systems and architectures, and documentation among many others.
    Leader badge
    Downloads: 240 This Week
    Last Update:
    See Project
  • 8
    Kisekae UltraKiss

    Kisekae UltraKiss

    Kisekae UltraKiss is a full featured integrated development environmen

    UltraKiss is a computer program that implements the Kisekae Set system, KiSS, a Japanese graphics system originally developed to facilitate costume changes on virtual dolls. UltraKiss was developed to help artists build their KiSS sets. It is a full featured viewer for all KiSS dolls, games, and visual applications. It is also a complete graphical development environment for creating KiSS applications. It fully implements the FKiSS event driven programming language up to and including...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 9

    Virtualdub Batch Video DeShake v26.0204

    Batch to compress [and deshake] all videos [or images] in folder

    Installation: Execute "DeShakInst.BAT" VirtualDub2 44282; AviSynth+ 3.7.5 updated to C:\DVD DESHAK.BAT updated to C:\UT and added to PATH Usage: DESHAK task[s] [parameters] Tasks: tp1: deshake pass1 LOG generation for 2nd pass tp2: deshake pass2 and compress video and audio to MP3 tcomp: compress (no deshake) twav: extract WAV and/or uses external WAV audio Parameters (more in help): vEXT: video extension (ie: vmov), default: vAVI qN: h264...
    Downloads: 4 This Week
    Last Update:
    See Project
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 10
    MATTA

    MATTA

    Morse Code Utilitiies to convert text messages to & from sound files.

    ...The wav file is expected to be international morse code, preferrably clean and properly spaced. Tonal frequency or wpm-speed does not seem to matter. Now includes an inverse commandline app, txt2wav that creates a morse code WAV file from English text. The proper command to extract the archive and maintain the directory structure is "7z x filename".
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    Ruby 2D

    Ruby 2D

    The Ruby 2D gem

    Ruby2D is a simple and elegant 2D graphics library for the Ruby programming language, designed to make it easy to build games, simulations, and interactive applications. Built atop SDL2 and OpenGL, Ruby2D abstracts away the complexity of low-level graphics programming while exposing enough control for performance and flexibility. It supports images, text, sounds, and basic geometric shapes, making it ideal for learning graphics or quickly prototyping ideas with Ruby. The library is...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    edge-TTS-record

    edge-TTS-record

    Tool that can record speech synthesis

    edge-TTS-record is a Windows-based tool that records speech synthesized by the Microsoft Edge browser’s online TTS voices and saves the result as .wav audio files. The idea is simple but effective: since Edge’s online TTS voices (such as “Xiaoxiao” or “Yunyang” for Chinese) are often high-quality, this tool provides a way to “capture” them offline for later use. Users can type or paste text, preview the speech, and then trigger the recorder; the system automatically captures the audio output from the browser and writes it to a WAV file. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 13
    Heads or tails - Retro Game with sound
    See the game video on youtube: https://youtu.be/mMhL_U42j-o The whole program is written with Java. Everything runs through text input. You will find 2 folders in this package. Once the "Head or Tail Game" folder, where you will find the game file .exe. The folder goodies contains the .java, the .jar and the .wav file of the game music. Have fun with the game
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Render32

    Render32

    Command-line video compositing and audio mixing tools

    Render is a program for creating composite BMP image sequences. These images are composited as specified in a text configuration file. Mixer is a program for mixing film soundtracks. It accepts input files in WAV format and outputs a mixed soundtrack in WAV format. Each input channel can contain one or more audio files that are edited and mixed using a cue sheet. The maximum number of channels is a compile-time parameter.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 15

    XiX Music Player

    XiX Music Player is a multi-platform music player

    XiX Player is a free easy to use multi-platform music player that currently runs in Linux, Linux ARM (Raspberry Pi), Windows & MacOS Intel Features: Supports the following file formats: MP3, OGG, M4A (non-DRM), AAC, FLAC, OPUS, APE, DFF, WAV Play & Rip your CD to MP3 or FLAC. CD-Text and CDDB support Rip DVD tracks to MP3 or FLAC. Needs mplayer. See albums the choosen artist is on and vice versa Create and use Playlists Online Radiostations + Presets Record Online Radiostations Schedule Radiostation recordings Listen & Download Podcasts Play License free audio from the Internet Archive Show the lyrics and CD-Covers of the song being played Shuffle and Repeat Reverse Play Crossfading & Trimming Search Rate your songs EQ + FXs (Flanger, Echo & Reverb) Set EQ & TRIM for individual songs Copy, Delete or Rename the file Change ID3 tag (only for MP3/OGG/FLAC/APE) Multi TAGGING/RENAMING Theme support (Basic)
    Downloads: 3 This Week
    Last Update:
    See Project
  • 16
    GloboNote

    GloboNote

    Create sticky notes, to-do list, journals & reminders all in one app

    GloboNote is a free and easy to use desktop note taking application. It lets you create sticky notes, to-do lists, journals, reminders and other notes in one place. There are no limits to the number of sticky notes you can create. Notes can be organize by groups and search using the search tool. GloboNote can be run in any OS that has Java 8 installed.
    Leader badge
    Downloads: 45 This Week
    Last Update:
    See Project
  • 17
    Silent Mantis

    Silent Mantis

    Event recorder software for animal and human behaviour research.

    Silent Mantis is a freely available, cross-platform event recorder software, which integrates the features of a video player, an image viewer and a spreadsheet editor. It can be used in animal or human behavioral research to manually record any animal or human bevavioral states or events from a video file. The results can be exported into text files for statistical evaluation purposes. SM was developed by Peter Szabo at the Institute of Biology of the University of Veterinary Medicine in...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Oasi -  Open Document Speaker

    Oasi - Open Document Speaker

    A simple Text2Audio

    Document Speaker - A simple Editor to give VOICE on Your Documents, save your doc as AudioBook or other format this app recognizes the language of the documents and converts them into audiobooks by recognizing texts in nearly 200 languages ... Open RTF & RTFD (mac format/inode directory) ODT,EPUB (unstable), PDF as plain Text to convert as MP4 or AudioBook. Convert Text to Voice Format: 3gp2 3GPP-2 Audio (.3g2) [Qclp,aac,aace,aacf,aach,aacl,aacp] 3gpp 3GP Audio (.3gp) [Qclp,aac,aace,aacf,aach,aacl,aacp] AIFC AIFC (.aifc,.aiff,.aif) [lpcm,ulaw,alaw,ima4,Qclp] AIFF AIFF (.aiff,.aif) [lpcm] NeXT NeXT/Sun (.snd,.au) [lpcm,ulaw] Sd2f Sound Designer II (.sd2) [lpcm] WAVE WAVE (.wav) [lpcm,ulaw,alaw] adts AAC ADTS (.aac,.adts) caff CAF (.caf) [Qclp,aac,aace,aacf,aach,aacl,aacp,alac,alaw,ilbc,ima4,lpcm,ulaw] m4af Apple MPEG-4 Audio (.m4a,.m4r) [aac,aace,aacf,aach,aacl,aacp,alac] m4bf Apple MPEG-4 AudioBooks (.m4b) [aac,aace,aacf,aach,aacl,aacp] mp4f MPEG-4 Audio (.mp4
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Mp3blaster is an interactive text-based program that plays MP3, Ogg Vorbis, wav, and sid audio files. One of its key features is its nested playlist editor which can group albums or genres.
    Leader badge
    Downloads: 5 This Week
    Last Update:
    See Project
  • 20
    BeepComp

    BeepComp

    Text-based chiptune creator

    Compose chiptunes with text files! And make mp3 and wav files of your songs to share with the world :) The audio synthesizer engine comes with 10 channels (9 music + 1 drum). The retro "beep" sounds reminiscent of old video game consoles and vintage PCs will take you back to the 8-bit era. You can shape your sound with waveforms - square, sawtooth etc. - and add LFO, delay and volume envelopes.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 21
    TSWeChat

    TSWeChat

    A WeChat alternative. Written in Swift 5

    TSWeChat - A WeChat alternative, updated to Swift 5. The cell image in TSChatImageCell is drawn by using a Mask Layer. The chat background can be changed freely so that UI will look perfect. Audio wav files can be automatically converted into amr files which facilitate file transfer to Android devices. Both of the two types of files have been cached.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22

    Text to Speech for Video

    create wav files for video character speech by typing in dialogue

    Choose from the "voices" available, and type in what you want the computer to say. A wave file called sounds.wav is stored to the output sub folder. Output is intended primarily for users who need speech for animated characters in videos.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Nightingale

    Nightingale

    A community supported fork of the Songbird media player and library.

    Nightingale is a community created fork of the Songbird media player. It is developed by a proud community and we are equally proud to bring you the most extensible, feature-rich media experience on Windows, Mac, and Linux. See the official website at http://getnightingale.com for the source, builds, and information. On Sourceforge, we provide our releases, the binary deps for building, as well as builds for testing purposes.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24

    Java-TTS Converter

    Text To Speech converter

    This application can convert the given text into speech.The speech may converted into seperate audio file for future use. we can give .txt,.doc,.docx text file as a input can convert the text in to audible .wav file.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25

    Simple Poker Tournament Clock

    Cross-platform Poker Tournament Clock

    A simple poker clock written in "bare" python, which supports different kinds of poker, XML-based tournament structures and display of banners for poker league sponsors.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB