Showing 145 open source projects for "voice"

View related business solutions
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 1
    Moonshine Voice

    Moonshine Voice

    Fast and accurate automatic speech recognition (ASR) for edge devices

    moonshine is an open-source automatic speech recognition toolkit optimized for fast and accurate transcription on edge devices and local environments. The project is designed to enable real-time voice applications such as live transcription, voice commands, and embedded speech interfaces without requiring heavy cloud infrastructure. Its architecture emphasizes low latency and flexible input handling, allowing audio streams of varying durations rather than relying on fixed processing windows. Moonshine supports multiple platforms including mobile, desktop, and embedded systems, and provides example projects to accelerate integration into real-world products. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 2
    Kaset

    Kaset

    The missing YouTube Music macOS app

    Kaset is a social audio platform framework that allows users to host, share, and interact with audio content in community-oriented spaces, combining elements of podcasting, voice rooms, and feedback-driven discovery. It provides an interface where creators can upload episodes, host live or scheduled voice sessions, and cultivate listener communities through comments, reactions, and follow systems. The platform emphasizes audio discovery with playlists, curated channels, and trending audio feeds, helping users find relevant voice content without sifting through noise. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 3
    VCClient

    VCClient

    Software that uses AI to perform real-time voice conversion

    VCClient is a real-time voice conversion system that uses machine learning models to transform a speaker’s voice into another voice with minimal latency. It is designed for live applications such as streaming, gaming, and virtual communication, where immediate feedback is essential. The system supports multiple voice conversion models, including RVC and other neural network-based approaches, allowing users to switch between different voices or customize their output. ...
    Downloads: 13 This Week
    Last Update:
    See Project
  • 4
    PersonaPlex

    PersonaPlex

    PersonaPlex code

    ...This architectural approach eliminates awkward pauses and makes conversations feel much more human-like, with natural behaviors such as overlapping speech, interruptions, and fluent turn-taking, traits that traditional AI assistants typically lack. PersonaPlex also supports persona and voice control, allowing developers to define the role and speaking style of the agent using text prompts and voice conditioning, making it suitable for applications like customized voice assistants, interactive character agents, or domain-specific conversational tools. Internally, it processes continuous audio streams in a hybrid input format so that speech understanding and generation occur jointly.
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    Textream

    Textream

    Textream is a free macOS teleprompter app for streamers, interviewers

    ...It runs natively on macOS and leverages on-device speech recognition to highlight each word in real time as you speak, keeping your focus where it belongs — on delivery rather than memorization. The interface supports multiple modes of use, such as classic constant-scroll auto-scrolling, voice-activated scrolling that pauses when you’re silent, and direct word tracking that syncs the displayed script to your spoken pace. Textream also includes flexible display modes, from a sleek notch-anchored overlay on laptops to an invisible-to-the-audience floating window or fullscreen experience.
    Downloads: 23 This Week
    Last Update:
    See Project
  • 6
    FFsubsync

    FFsubsync

    Automagically synchronize subtitles with video

    ...In this case, you can use the correctly synchronized srt file directly as a reference for synchronization, instead of using the video as the reference. ffsubsync uses the file extension to decide whether to perform voice activity detection on the audio or to directly extract speech from an srt file. ffsubsync usually finishes in 20 to 30 seconds, depending on the length of the video.
    Downloads: 34 This Week
    Last Update:
    See Project
  • 7
    OpenVINO AI Plugins for Audacity

    OpenVINO AI Plugins for Audacity

    A set of AI-enabled effects, generators, and analyzers for Audacity

    A set of AI-enabled effects, generators, and analyzers for Audacity. These AI features run 100% locally on your PC, no internet connection is necessary. OpenVINO™ is used to run AI models on supported accelerators found on the user's system such as CPU, GPU, and NPU.
    Downloads: 109 This Week
    Last Update:
    See Project
  • 8
    Speakr

    Speakr

    Speakr is a personal, self-hosted web application

    Speakr is an open-source, real-time text-to-speech (TTS) web application that allows users to convert written text into natural-sounding speech in just a few clicks. It provides a clean, user-friendly interface where users can input text, choose a voice style or language, and immediately hear the output, making it ideal for accessibility, content creation, and learning applications. Behind the scenes, Speakr leverages modern TTS engines and streaming audio technologies to deliver smooth and responsive speech generation without noticeable delay. The project is built with extensibility in mind, enabling developers to add custom voices, integrate additional languages, and tailor the backend for different hardware or cloud environments. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Translate-Subtitle-File

    Translate-Subtitle-File

    Subtitle Creation Assistant

    Subtitle group machine translation assistant - [Function 1: Translate subtitle file] .srt .ass .vtt [Function 2: Voice to text] (Drag in video or audio to recognize subtitles) (The latest version v4.1.0 Update time 2021 2 May 23) 12 translation service providers can be configured, such as Google, Baidu, Tencent, Caiyun, IBM, Azure, Amazon, etc. (6 voice service providers can be configured: Alibaba Cloud, Xunfei, Tencent Cloud, IBM, Azure, Amazon ) Advantages: 1.
    Downloads: 2 This Week
    Last Update:
    See Project
  • Application Monitoring That Won't Slow Your App Down Icon
    Application Monitoring That Won't Slow Your App Down

    AppSignal's Rust-based agent is lightweight and stable. Already running in thousands of production apps.

    Full APM with errors, performance, logs, and uptime monitoring. 99.999% uptime SLA on the platform itself.
    Start Free
  • 10
    RHVoice

    RHVoice

    Free open source speech synthesizer for Russian and other languages

    RHVoice is a free and open-source multilingual speech synthesizer. Its developers hope to give more visually impaired people the ability to use a good free synthesis voice reading in their native language with their screen reader. We are especially interested in supporting those languages for which there are currently no good voices that could be used with a screen reader. The creator of RHVoice, Olga Yakovleva, is blind herself. Many of the contributors to the RHVoice project, both programmers and non-programmers, are blind or partially sighted.
    Downloads: 50 This Week
    Last Update:
    See Project
  • 11
    Voxal voice changer

    Voxal voice changer

    Transform your voice in real-time voxal voice changer

    Voxal Voice Changer is a program that allows you to modify your voice by applying various effects (e.g. pitch change, echo, etc.) in real-time. Effects can be added in any sequence and in any combination, allowing you to distort your voice beyond recognition. Take your audio to the next level! Our powerful Voice Changer software lets you morph your voice in real-time with stunning AI-powered quality.
    Leader badge
    Downloads: 16 This Week
    Last Update:
    See Project
  • 12
    Voice-Changer-MeowVoice

    Voice-Changer-MeowVoice

    Meow Voice Changer is a lightweight, real-time voice modulation tool

    Meow Voice Changer is a lightweight, real-time voice modulation tool for Windows, purrrfect for streaming and voice chats. Inspired by Battlestar Galactica Cylon and Viper comms but capable of many more effects, it offers pitch, tempo, wobble, ring modulation, flanging, echo, EQ, and radio effects like distortion and static. Process microphone input with a customizable effect chain.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 13
    Lenstube

    Lenstube

    Decentralized video-sharing social media platform

    Decentralized video-sharing social media platform, built using Lens protocol. This turborepo uses Yarn as a package manager. For a place to have open discussions on features, voice your ideas, or get help with general questions please visit our community at Discord.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    MediaDevices

    MediaDevices

    Go implementation of the MediaDevices API

    ...It provides a cross-platform, unified API for capturing and manipulating media streams and is often used in combination with Pion WebRTC for peer-to-peer communications. Its support for device enumeration, media constraints, and frame processing makes it a powerful building block for custom voice and video solutions in Go.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    AudioNotes

    AudioNotes

    Extract audio and video content and organize it into a Markdown note

    AudioNotes is an application (or proof-of-concept) that likely combines audio recording or playback with note-taking or annotation functionality — enabling users to record voice or audio and attach textual or timestamped notes, making it ideal for lectures, interviews, meetings, or personal memos. Such a tool offers a more expressive and flexible way to capture and revisit information: instead of just typed notes or raw audio, users get both audio context and structured notes. As an open-source repository, AudioNotes provides developers or power users the opportunity to customize how audio is captured, stored, annotated, and replayed — e.g. adding playback speed control, export to standard formats, or synchronization between notes and audio timeline. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    annyang!

    annyang!

    Speech recognition for your site

    annyang is a tiny javascript library that lets your visitors control your site with voice commands. annyang supports multiple languages, has no dependencies, weighs just 2kb and is free to use. annyang understands commands with named variables, splats, and optional words. Use named variables for one word arguments in your command. Use splats to capture multi-word text at the end of your command (greedy). Use optional words or phrases to define a part of the command as optional. annyang plays nicely with all browsers, progressively enhancing browsers that support SpeechRecognition, while leaving users with older browsers unaffected. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    AzioVoice Recorder

    AzioVoice Recorder

    AzioVoice Recorder is an app designed to record audio

    Description Starting from version 1.3.1.0, the project has been renamed to AzioVoice Recorder and is officially published in the Microsoft Store at: https://apps.microsoft.com/detail/9PP795T0KSFP The app supports recording in WAV format with adjustable settings like sample rate, bit depth, and channels. It includes several audio filters for basic enhancement and features a simple file explorer for playback and management. Themes can be switched between dark and light, with settings...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    byzorgan

    byzorgan

    Specialized sound synthesizer with Byzantine Church music scales

    This software integrates a small, specialized synthesizer and vocal processor. It can be used to learn Byzantine Church singing. You can play from the keyboard, mouse or touch screen. MIDI input is also available. Voice functions include: pitch highlighting, synthesizer control by voice, pitch correction and voice-to-ison conversion. On the screen there are labels with symbols of Byzantine notes. There is a metronome. The program is oriented on the Chrysanthos tuning of the diatonic scale: 9/8 - 12/11 - 88/81. The octave is divided into 17 (or 19) non-equal parts, it allows you to get all the Byzantine modes.
    Downloads: 21 This Week
    Last Update:
    See Project
  • 19
    Free Karaoke File Maker

    Free Karaoke File Maker

    Free Karaoke File Maker

    You can hide the singer's voice from the music files that cannot hide the voice in the computer. By default, it will be saved with 2 audio tracks of singer + melody. If you want to save only the melody without the singer's voice, you have to select the No Vocal option. To save the output file, click Save Folder and choose the location you want to save (Default: Desktop).
    Downloads: 8 This Week
    Last Update:
    See Project
  • 20
    Mechvibes

    Mechvibes

    Play mechanical keyboard sounds, typewriter sound as you type

    It's a side project that I made for myself since when I use my own mechanical keyboard at late night or in the office and my parents and my coworkers hate that a lot because the sound of the keyboard is very loud (especially in quiet places). So if you are facing this issue just like me then this product is for you! - Plays typewriter sound - You can add more keyboard sound set by recording any sound you like and add them to this app with some easy steps. - Use your laptop keyboard...
    Leader badge
    Downloads: 1,886 This Week
    Last Update:
    See Project
  • 21
    Audio Satanifier 666

    Audio Satanifier 666

    Easily apply cool gnarly voice filters to your audio files

    ...Its more focused on the "satan voice" filter, while the larger default version has 16 different filters you can mix and match. (Source code is included within the html file itself.)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Simple TTS Reader

    Simple TTS Reader

    A small clipboard reader

    Simple TTS Reader is a small utility that reads text from your clipboard using Microsoft Speech API. Whenever you copy any text, the app instantly converts it into spoken words. Select your preferred speech engine from those installed on your system, such as Microsoft Zira, and adjust speed and volume for personalized playback. The application can also be minimized to the system tray. Plus, it is free and comes with an intuitive interface that makes it accessible to everyone.
    Leader badge
    Downloads: 90 This Week
    Last Update:
    See Project
  • 23
    eGuideDog free software for the blind
    eGuideDog project develops free software for the blind. Currently, we focus on WebSpeech, Ekho TTS and WebAnywhere.
    Leader badge
    Downloads: 164 This Week
    Last Update:
    See Project
  • 24
    AudioVolumeSyncer
    Use this application to sync the volume of several audio playback devices. Personally, I use this to sync the volume of the Astro A50 Voice and Game audio output, when I change the volume in Windows.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Kytner

    Kytner

    a simple PC Tool Suite to easy Access PC Tasks

    ...Browse simply with Kytner Web 🌐. Snap photos with Kytner Cam 😊. Manage archives easily with Kytner Pack 📦. And Save Backup and Copy your Files with Kytner SaveIT!😊 Simply record your Voice with Kytner Voicerecorder 😊 Intuitive desktop UX for Windows 10*/11 <3. English only. Discover the simplicity. 🚀 !!!Windows 10 Support is not longer guaranteed!!!
    Downloads: 2 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB