Showing 21 open source projects for "you"

View related business solutions
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 1
    SpeechRecognition

    SpeechRecognition

    Speech recognition module for Python

    ...The easiest way to install this is using pip install SpeechRecognition. The first software requirement is Python 2.6, 2.7, or Python 3.3+. This is required to use the library. PyAudio is required if and only if you want to use microphone input (Microphone). PyAudio version 0.2.11+ is required, as earlier versions have known memory management bugs when recording from microphones in certain situations. To hack on this library, first make sure you have all the requirements listed in the "Requirements" section.
    Downloads: 12 This Week
    Last Update:
    See Project
  • 2
    PersonaPlex

    PersonaPlex

    PersonaPlex code

    PersonaPlex is an open-source real-time conversational speech AI model that goes beyond traditional text chat by providing full-duplex speech-to-speech interaction, meaning it can listen and talk at the same time instead of waiting for you to finish speaking before responding. This architectural approach eliminates awkward pauses and makes conversations feel much more human-like, with natural behaviors such as overlapping speech, interruptions, and fluent turn-taking, traits that traditional AI assistants typically lack. PersonaPlex also supports persona and voice control, allowing developers to define the role and speaking style of the agent using text prompts and voice conditioning, making it suitable for applications like customized voice assistants, interactive character agents, or domain-specific conversational tools. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    annyang!

    annyang!

    Speech recognition for your site

    ...Use optional words or phrases to define a part of the command as optional. annyang plays nicely with all browsers, progressively enhancing browsers that support SpeechRecognition, while leaving users with older browsers unaffected. Grab the latest version of annyang.min.js, drop it in your html, and start adding commands. You can easily add a GUI for the user to interact with Speech Recognition using Speech KITT. Speech KITT is fully customizable and comes with many different themes, and instructions on how to create your own designs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Buzz

    Buzz

    Transcribe and translate audio offline on your personal computer

    ...Get linux versions from: - https://flathub.org/apps/io.github.chidiwilliams.Buzz - https://snapcraft.io/buzz Home page of Buzz https://github.com/chidiwilliams/buzz Note for Windows: App is not signed, you will get a warning when you install it. Select More info -> Run anyway.
    Leader badge
    Downloads: 4,400 This Week
    Last Update:
    See Project
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 5
    Voxal voice changer

    Voxal voice changer

    Transform your voice in real-time voxal voice changer

    Voxal Voice Changer is a program that allows you to modify your voice by applying various effects (e.g. pitch change, echo, etc.) in real-time. Effects can be added in any sequence and in any combination, allowing you to distort your voice beyond recognition. Take your audio to the next level! Our powerful Voice Changer software lets you morph your voice in real-time with stunning AI-powered quality.
    Leader badge
    Downloads: 19 This Week
    Last Update:
    See Project
  • 6
    Audio Satanifier 666

    Audio Satanifier 666

    Easily apply cool gnarly voice filters to your audio files

    ...Audio Satanifier 666 is a fun easy-to-use browser-based tool forged in the pits of hell, for voice actors, musicians, sound designers, for memes, for creative projects or anyone else who want to twist their sound into something absolutely diabolical! Layperson friendly - you'll be able to apply cool effects to your audio file even if you know nothing about audio engineering. Theres also a Mini version that has a voice deepening that doesn't slow the speed of the audio. Its more focused on the "satan voice" filter, while the larger default version has 16 different filters you can mix and match. (Source code is included within the html file itself.)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    DeepSpeech

    DeepSpeech

    Open source embedded speech-to-text engine

    ...Project DeepSpeech uses Google's TensorFlow to make the implementation easier. A pre-trained English model is available for use and can be downloaded following the instructions in the usage docs. If you want to use the pre-trained English model for performing speech-to-text, you can download it (along with other important inference material) from the DeepSpeech releases page.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 8
    TTS

    TTS

    Deep learning for text to speech

    ...Text2Spec models (Tacotron, Tacotron2, Glow-TTS, SpeedySpeech). Speaker Encoder to compute speaker embeddings efficiently. Vocoder models (MelGAN, Multiband-MelGAN, GAN-TTS, ParallelWaveGAN, WaveGrad, WaveRNN). If you are only interested in synthesizing speech with the released TTS models, installing from PyPI is the easiest option.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 9
    ILA - teachable voice assistant

    ILA - teachable voice assistant

    ILA is a fully customizable and teachable voice assistant for Java

    ...The goal is to make ILA completely free of Google by improving all aspects of the open source systems. Since version 3.3 users can also write own add-ons to extend ILA. ILA's successor is the SEPIA Framework: https://sepia-framework.github.io/ Hope you enjoy ILA - Florian
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    FM2TXT

    FM2TXT

    RtlSdr listen to radio, recognize audio, and writes text file log

    Just log your favorite FM station speech to a text file using rtl-sdr dongle and speech recognition. Cross-platform tool. Follow the README on the download page for Windows installation. https://sourceforge.net/projects/fm2txt-rtlsdr/files/ If you prefer GitHub source, not SF: https://github.com/randaller/fm2txt For those, who want to recognize from soundcard, not from rtl-sdr (this allows to transcribe NFM etc): https://github.com/randaller/souncard2txt
    Downloads: 10 This Week
    Last Update:
    See Project
  • 11

    DailyText-Voice

    Read out jw.org daily text on mobile

    The DailyText-Voice android app crawls jw.org website and reads out loud the daily text in the notification bar of your android device.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12

    Text to Speech for Video

    create wav files for video character speech by typing in dialogue

    Choose from the "voices" available, and type in what you want the computer to say. A wave file called sounds.wav is stored to the output sub folder. Output is intended primarily for users who need speech for animated characters in videos.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13

    Steel TTS

    A cross-platform wrapper for common text-to-speech engines in Python

    ...Steel currently supports the following TTS software: - Microsoft Speech API 5 (SAPI5) - eSpeak - NS Speech Synthesis - FreeTTS Documentation: http://sourceforge.net/p/steeltts/wiki/ Bug Tracker: http://sourceforge.net/p/steeltts/tickets/ If you are interested in contributing to the Steel TTS codebase, or would like to make a feature-request, please contact the lead developer, Jasper Danielson, at jrd4@rice.edu.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14

    Bermuda Text-to-Speech

    This project includes basic NLP and DSP techniques for Text-to-Speech

    ...page=tts This is an entirely written in JAVA project which includes a set of tools and methods designed to enable Multilingual Text-to-Speech (TTS) synthesis. We currently support English and Romanian but we will soon train more models and make them available for download. If you want to read more about our other NLP and TTS tools check out http://nlptools.racai.ro.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    InproTK

    InproTK

    An Incremental Spoken Dialogue Processing Toolkit

    InproTK is an Incremental Spoken Dialogue Processing Toolkit, that is, a toolkit to help you build dialogue systems that listen and talk incrementally, allowing for advanced interactional behaviour. Please see our Wiki for more information: http://sourceforge.net/p/inprotk/wiki/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Zanzibar is a complete, standards based IVR. It includes an MRCPv2 Server with ASR and TTS engines as well as an voiceXML interpreter so that you can deploy and run voiceXML applications. It integrates with VOIP PBX’s (like Asterisk) using SIP and RTP.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    This is an application where one can set the task to be reminded of in future and you will be notified at that time by voice. You have the option of choosing male/female voice too. Besides, you can choose the time in seconds, minutes or hours.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Performs actions on detected volume threshold Examples : - Launch music on clap - Launch speech recording when you start speaking - Launch guard webcam when a significant sound is detected - Increase or decrease headphones volume when ambient noise pass
    Downloads: 2 This Week
    Last Update:
    See Project
  • 19
    A simple software that speaks a text. You can type the text or appoint a file. Fala is just a frontend to festival. It's designed for GNOME, but if you have gtk, pyhton and festival you are able to run it.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    PhoneBlogger allows you to post to a weblog by phone. PhoneBlogger is written in VoiceXML, Python, and JavaScript.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    A XFig based rapid prototype yeilding an audio speed alteration tool. This tool lets you arbitrarily alter the speed of audio files. It uses the WSOLA algorithm for audio speed alteration without pitch change.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB