Showing 19 open source projects for "virtual audio"

View related business solutions
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • Error to trace to log to deploy. One click. No SSH. Icon
    Error to trace to log to deploy. One click. No SSH.

    Catch the cause before the pager goes off.

    AppSignal links every error to the trace, the trace to the log, the log to the deploy that shipped it.
    Free 30 days.
  • 1
    Fun Audio Chat

    Fun Audio Chat

    Large Audio Language Model built for natural interactions

    Fun Audio Chat is an interactive voice-first conversational AI platform designed to let users engage in natural spoken dialogue with large language models in real time, turning speech into context-aware responses while maintaining a smooth back-and-forth experience. It combines speech recognition, audio processing, and AI generation so users can speak simply and receive spoken replies, enabling applications such as virtual assistants, voice bots, and hands-free chat interfaces. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Voice-Pro

    Voice-Pro

    Comprehensive Gradio WebUI for audio processing

    Voice-Pro is the best gradio WebUI for transcription, translation and text-to-speech. It can be easily installed with one click. Create a virtual environment using Miniconda, running completely separate from the Windows system (fully portable). Supports real-time transcription and translation, as well as batch mode.
    Downloads: 32 This Week
    Last Update:
    See Project
  • 3
    Miso TTS

    Miso TTS

    Miso TTS is an 8 billion, highly emotive text-to-speech model

    Miso TTS is an advanced 8-billion-parameter text-to-speech model developed by Miso Labs for generating highly expressive and natural-sounding conversational speech. Built on an RVQ Transformer architecture inspired by Sesame CSM, it combines a powerful Llama-based backbone with an autoregressive audio decoder to produce high-quality audio from text. The model supports both standard speech synthesis and voice-conditioned generation using optional audio prompts for voice cloning. Miso TTS generates Mimi audio codes and can leverage conversation history to create more contextually aware and realistic dialogue. Designed for local deployment, it offers watermarking by default to help promote responsible use of generated audio. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 4
    LiveAvatar

    LiveAvatar

    Streaming Real-time Audio-Driven Avatar Generation

    ...LiveAvatar focuses on delivering not just high-quality visuals but also the responsiveness necessary for immersive conversational experiences, making it suitable for advanced AI agents, virtual assistants, and interactive streaming contexts.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • 5
    HY-World 1.5

    HY-World 1.5

    A Systematic Framework for Interactive World Modeling

    HY-WorldPlay is a Hunyuan AI project focusing on immersive multimodal content generation and interaction within virtual worlds or simulated environments. It aims to empower AI agents with the capability to both understand and generate multimedia content — including text, audio, image, and potentially 3D or game-world elements — enabling lifelike dialogue, environmental interpretations, and responsive world behavior. The platform targets use cases in digital entertainment, game worlds, training simulators, and interactive storytelling, where AI agents need to adapt to real-time user inputs and changes in environment state. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    Diffusers

    Diffusers

    State-of-the-art diffusion models for image and audio generation

    ...Pretrained models that can be used as building blocks, and combined with schedulers, for creating your own end-to-end diffusion systems. We recommend installing Diffusers in a virtual environment from PyPi or Conda. For more details about installing PyTorch and Flax, please refer to their official documentation.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 7
    OpenPiano — Virtual Piano for Windows

    OpenPiano — Virtual Piano for Windows

    Desktop piano playable with a PC keyboard, mouse, or MIDI device.

    OpenPiano is a Windows desktop piano application that allows you to play, practice, and record music using your PC keyboard, mouse, or a MIDI device. It supports real-time playback using SoundFonts and provides on-screen piano layouts for visual feedback while playing. OpenPiano is designed to run entirely locally. It does not require accounts, cloud services, or an internet connection for core functionality. Project links: Website:...
    Leader badge
    Downloads: 204 This Week
    Last Update:
    See Project
  • 8
    Streamer-Sales

    Streamer-Sales

    LLM Large Model of Selling Anchor

    Streamer-Sales is an open-source large language model system designed specifically for e-commerce live streaming and automated product promotion. The project focuses on generating persuasive product descriptions and live presentation scripts that mimic the style of professional online sales hosts. By analyzing product characteristics and marketing information, the model can produce engaging explanations that emphasize benefits, features, and emotional appeal to encourage viewers to make...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    GLM-4-Voice

    GLM-4-Voice

    GLM-4-Voice | End-to-End Chinese-English Conversational Model

    GLM-4-Voice is an open-source speech-enabled model from ZhipuAI, extending the GLM-4 family into the audio domain. It integrates advanced voice recognition and generation with the multimodal reasoning capabilities of GLM-4, enabling smooth natural interaction via spoken input and output. The model supports real-time speech-to-text transcription, spoken dialogue understanding, and text-to-speech synthesis, making it suitable for conversational AI, virtual assistants, and accessibility applications. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • Secure File Transfer for Windows with Cerberus by Redwood Icon
    Secure File Transfer for Windows with Cerberus by Redwood

    Protect and share files over FTP/S, SFTP, HTTPS and SCP with the #1 rated Windows file transfer server.

    Cerberus supports unlimited users and connections on a single IP, with built-in encryption, 2FA, and a browser-based web client — all deployable in under 15 minutes with a 25-day free trial.
    Try for Free
  • 10
    MetaVoice-1B

    MetaVoice-1B

    Foundational model for human-like, expressive TTS

    ...With that scale and dataset volume, MetaVoice aims to push the boundary of what open-source TTS models can achieve: high fidelity, natural prosody, and robustness even for edge cases. As a foundational model, it can serve as the backbone for downstream tasks — such as voice generation, voice cloning, speech generation for virtual agents, or even audio production pipelines.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    TITTSE

    TITTSE

    Two Integrated Text To Speech Engines uses MMS & Silero

    ...All you need is a text file ending in the tittse extension with 4 header lines including the TITTSE language code (see documentation for your language), the 'base' file name for the audio files TITTSE creates, voice gender (girl or boy), offset (file numbers added to base file name start at this number). After those first four lines, every paragraph is created as a single audio file. Install_TITTSE.sh is a Bash script that installs Python 3.1 and all needed dependencies in a virtual environment on your Linux system. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    VCClient

    VCClient

    Software that uses AI to perform real-time voice conversion

    VCClient is a real-time voice conversion system that uses machine learning models to transform a speaker’s voice into another voice with minimal latency. It is designed for live applications such as streaming, gaming, and virtual communication, where immediate feedback is essential. The system supports multiple voice conversion models, including RVC and other neural network-based approaches, allowing users to switch between different voices or customize their output. It provides both a...
    Downloads: 44 This Week
    Last Update:
    See Project
  • 13
    Audio Webui

    Audio Webui

    A webui for different audio related Neural Networks

    Audio Webui is a Gradio-based web user interface that unifies a wide range of audio-related neural networks under a single, accessible front end. It is designed as an “all-in-one” environment where users can experiment with text-to-speech, voice cloning, generative music, and other neural audio models without writing boilerplate code.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14

    PMS for REGZA

    A DLNA-compliant UPnP Media Server

    PMS for REGZA is a DLNA-compliant Media Server. As a fork build of well-known "PS3 Media Server", This aims especially to improve functionality on TOSHIBA REGZA TVs With preserving applicabilities to other Renderers. Details: Home Page: http://www32.atwiki.jp/pms_regza
    Leader badge
    Downloads: 9 This Week
    Last Update:
    See Project
  • 15
    CorEngine
    CorEngine is a work in progress, OpenGL graphics powered 3D game engine designed to help independent game developers with quick prototyping and game/virtual environment creation. The engine supports a standard set of features, like skeletal animation, post processing, Lua/C programming, physics powered by Bullet Physics, GUI and 2D/3D Audio.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    A musician's live performance VST host. This software is intended to help integrate VST technology into your live performances on stage, whether it'd be using virtual instruments or effects, or to automate audio routing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Cyn.in - Open Source Group Collaboration
    Cyn.in helps teams to build collaborative knowledge by sharing & discussing digital content within secure & unified application. It combines the capabilities of wikis, social network, blogs, files, microblogs, discussions into secure enterprise platform.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    The Virtual Drum Machine
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Sonar 2D is a live signal processing application where an audio signal is used to navigate between 'zones' in a virtual 2d space. Each zone represents a given processing module, which when activated will be applied to the sound.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
Auth0 Logo