Showing 16 open source projects for "output"

View related business solutions
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    Let your crypto work for you

    Put idle assets to work with competitive interest rates, borrow without selling, and trade with precision. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    Moshi

    Moshi

    A speech-text foundation model for real time dialogue

    ...Moshi models two streams of audio: one corresponds to Moshi, and the other one to the user. At inference, the stream from the user is taken from the audio input, and the one for Moshi is sampled from the model's output. Along these two audio streams, Moshi predicts text tokens corresponding to its own speech, its inner monologue, which greatly improves the quality of its generation. A small Depth Transformer models inter codebook dependencies for a given time step, while a large, 7B parameter Temporal Transformer models the temporal dependencies.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    PersonaPlex

    PersonaPlex

    PersonaPlex code

    PersonaPlex is an open-source real-time conversational speech AI model that goes beyond traditional text chat by providing full-duplex speech-to-speech interaction, meaning it can listen and talk at the same time instead of waiting for you to finish speaking before responding. This architectural approach eliminates awkward pauses and makes conversations feel much more human-like, with natural behaviors such as overlapping speech, interruptions, and fluent turn-taking, traits that traditional...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    annyang!

    annyang!

    Speech recognition for your site

    annyang is a tiny javascript library that lets your visitors control your site with voice commands. annyang supports multiple languages, has no dependencies, weighs just 2kb and is free to use. annyang understands commands with named variables, splats, and optional words. Use named variables for one word arguments in your command. Use splats to capture multi-word text at the end of your command (greedy). Use optional words or phrases to define a part of the command as optional. annyang plays...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Simple TTS Reader

    Simple TTS Reader

    A small clipboard reader

    Simple TTS Reader is a small utility that reads text from your clipboard using Microsoft Speech API. Whenever you copy any text, the app instantly converts it into spoken words. Select your preferred speech engine from those installed on your system, such as Microsoft Zira, and adjust speed and volume for personalized playback. The application can also be minimized to the system tray. Plus, it is free and comes with an intuitive interface that makes it accessible to everyone.
    Leader badge
    Downloads: 71 This Week
    Last Update:
    See Project
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 5
    VCClient

    VCClient

    Software that uses AI to perform real-time voice conversion

    ...The system supports multiple voice conversion models, including RVC and other neural network-based approaches, allowing users to switch between different voices or customize their output. It provides both a graphical user interface and API access, making it suitable for casual users as well as developers who want to integrate voice transformation into their own applications. The project also supports GPU acceleration, enabling faster inference and smoother real-time performance on compatible hardware. Additionally, it includes tools for training and managing voice models, giving users the ability to create personalized voice profiles.
    Downloads: 21 This Week
    Last Update:
    See Project
  • 6
    Voice-Changer-MeowVoice

    Voice-Changer-MeowVoice

    Meow Voice Changer is a lightweight, real-time voice modulation tool

    Meow Voice Changer is a lightweight, real-time voice modulation tool for Windows, purrrfect for streaming and voice chats. Inspired by Battlestar Galactica Cylon and Viper comms but capable of many more effects, it offers pitch, tempo, wobble, ring modulation, flanging, echo, EQ, and radio effects like distortion and static. Process microphone input with a customizable effect chain. Features a clean console UI, low RAM/CPU usage, Free for non-commercial use, it supports virtual audio cables...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 7
    RemoteTTS

    RemoteTTS

    Tool to remotely activate Text-To-Speech (TTS) on a server

    The tool provides a simple TCP/UDP interface to let a remote machine perform TTS outputs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    Text to Speech for Video

    create wav files for video character speech by typing in dialogue

    Choose from the "voices" available, and type in what you want the computer to say. A wave file called sounds.wav is stored to the output sub folder. Output is intended primarily for users who need speech for animated characters in videos.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9

    eNTranslator

    To aid translation of satsangs of Paramhamsa Nithyananda

    ...The auto generated translations are then enriched with human alternation using an easy graphical user interface. Time stamp information may be synched and a subtitle file or a simple textual output may be generated. Additionally it is planned to use google voice tools to also add voice over from these translated text. Finally the subtitle, translated audio (if any) would be muxed with the original video and uploaded.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 10

    Accelerated Feature Extraction Tool

    A fast GPU accelerated feature extraction software for speech analysis

    ...It uses GPU acceleration if compatible GPU available (CUDA as weel as OpenCL, NVIDIA, AMD, and Intel GPUs are supported). CPU SSE intrinsic instruction set is used in cases where no compatible GPU present. The output files are stored in HTK format. The software is developed at Department of Cybernetics at University of West Bohemia in Pilsen.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    NoiseGator (Noise Gate)

    NoiseGator (Noise Gate)

    A simple noise gate app intended for use with VOIPs like Skype.

    Ever wanted to cut out background noise when talking with others on Skype? Now it's possible! NoiseGator is a light-weight noise gate application that routes audio through an audio input to an audio output. In real-time the audio level is analysed and if the average level is higher than the threshold the audio bypasses as normal. However, if the average level goes below the threshold, the gate closes and the audio is cut. When used with a virtual audio cable it can act as a noise gate for a either a sound input(microphone) or sound output(speakers). ...
    Leader badge
    Downloads: 329 This Week
    Last Update:
    See Project
  • 12
    Osmosis TTS

    Osmosis TTS

    Text to Speech application with searching capabilities.

    Osmosis TTS is a Text-to-Speech application with a built in browser and the ability to easily search for terms in the text using configurable search providers including search engines and dictionaries. It is particularly useful for language learning as one can easily search for foreign words using external dictionary websites. Text is spoken by copying text to the clipboard, and text can be queued up by continuing to copy new text to the clipboard while Osmosis TTS is speaking....
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    Virtual Speech Mechanism System

    Virtual Speech Mechanism System

    Virtual Speech Mechanism System converts text to voice.

    Virtual Speech Mechanism System is .NET based application written in C#. It can convert text to speech either in interactive mode or take input from a TEXT file. It's output can either be directed to speakers or saved as WAV file that can be played with any audio player. Output wave can be selected to be of channel 1 or 2. It is 2 by default. The speech rate can be controlled by -10 to 10 points depending upon the requirements along with volume ranging from 0 to 100%. VSMS uses a binary library called NIDHI named after famous Indi-Pop singer "Sunidhi Chauhan". ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    Automatically translate english/french/german text to german/french/english text and output speech in appropriate language. All Automagically with the power of the inter-webs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    EQ2Speak is source code developed to turn a log file, being parsed into speech. Application was developed to allow EQ2 game players to listen to "TELLS" from the game and spoken through output devices.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    A machine translation program designed to accept verbal or text input and provide text or speech synthesized voice translation as output. Makes use of 3 current open-source projects. The source is currently C/C++ and embedded perl.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB