audio server free download

16 projects for "audio server" with 2 filters applied:

Artificial Intelligence BSD Clear Filters & Widen Search

$300 Free Credits for Your Google Cloud Projects
Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.

Start Free Trial
Atera - an All-in-one platform for IT management
Ideal for IT departments and MSPs (managed service providers)

Your IT essentials, integrated & elevated. Take your IT management from automated to autonomous, download Atera's agent to start your free trial!

Try Atera now
1

Markdownify MCP Server

Convert files and web content into clean, usable Markdown easily

Markdownify MCP is a Model Context Protocol server that converts many types of files and web content into clean Markdown. It supports formats such as PDFs, images, audio with transcription, DOCX, XLSX, and PPTX, along with web sources like YouTube transcripts, Bing results, and general webpages. Markdownify MCP is designed to simplify content extraction and make data easier to read, share, and reuse in structured workflows.

Downloads: 0 This Week

Last Update: 2026-05-02
See Project
2

WhatsApp MCP Server

WhatsApp MCP server enabling AI access to chats and messaging

whatsapp-mcp is an open source Model Context Protocol (MCP) server that enables AI agents to interact directly with a user’s WhatsApp account through a structured interface. It acts as a bridge between WhatsApp and large language models, allowing controlled access to messages, chats, and contacts. whatsapp-mcp is composed of two main components: a Go-based bridge that connects to the WhatsApp Web API and stores data locally, and a Python-based MCP server that exposes tools for AI interaction. ...

Downloads: 0 This Week

Last Update: 2026-03-17
See Project
3

WhisperLive

A nearly-live implementation of OpenAI's Whisper

WhisperLive is a “nearly live” implementation of OpenAI’s Whisper model focused on real-time transcription. It runs as a server–client system in which the server hosts a Whisper backend and clients stream audio to be transcribed with very low delay. The project supports multiple inference backends, including Faster-Whisper, NVIDIA TensorRT, and OpenVINO, allowing you to target GPUs and different CPU architectures efficiently. It can handle microphone input, pre-recorded audio files, and network streams such as RTSP and HLS, making it flexible for live events, monitoring, or accessibility workflows. ...

Downloads: 6 This Week

Last Update: 2026-06-02
See Project
4

DramaBox

super expressive prompting model based on ltx2.3

...The project includes a warm inference server, a CLI workflow, and a Gradio app for interactive generation. It also supports additional LoRA training on top of DramaBox, making it possible to adapt the model for a specific speaker, language flavor, or performance style. DramaBox is aimed at developers, researchers, and audio creators who need highly expressive English TTS for character dialogue, narrative audio, prototyping, or voice experimentation.

Downloads: 1 This Week

Last Update: 2026-05-23
See Project
Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure
Native application identity and user-based security for your Azure cloud

Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.

Get a free trial
5

Riffusion App

Stable diffusion for real-time music generation (web app)

...It relies on a separate inference server to perform model computations, enabling flexible deployment depending on hardware capabilities. Users can input prompts or modify parameters to influence the style, tempo, and characteristics of generated audio, making it useful for creative exploration and prototyping.

Downloads: 0 This Week

Last Update: 2026-03-18
See Project
6

OuteTTS

Interface for OuteTTS models

OuteTTS is an interface library for running OuteTTS text-to-speech models across a range of backends, making it easier to deploy the same model on different hardware and runtimes. It provides a high-level Interface API that wraps model configuration, speaker handling, and audio generation so you can focus on integrating speech into your application rather than wiring up low-level engines. The project supports multiple backends including llama.cpp (Python bindings and server), Hugging Face Transformers, ExLlamaV2, VLLM and a JavaScript interface via Transformers.js, allowing it to run on CPUs, NVIDIA CUDA GPUs, AMD ROCm, Vulkan-capable GPUs, and Apple Metal. ...

Downloads: 1 This Week

Last Update: 2025-11-28
See Project
7

Sopro TTS

A lightweight text-to-speech model with zero-shot voice cloning

...Users can install it with standard Python tools, run a demo server locally, and experiment with CLI or Python API usage for producing synthetic speech.

Downloads: 0 This Week

Last Update: 2026-02-06
See Project
8

LiveKit Agents

Framework for building realtime multimodal voice AI agents apps

...It is designed to run server-side and can integrate with various AI model providers and realtime APIs to support different application requirements. LiveKit Agents also includes tools for scheduling and managing agent tasks, making it easier to connect users to automated assistants in live communication scenarios.

Downloads: 3 This Week

Last Update: 2 days ago
See Project
9

Riffusion

Real-time music generation using stable diffusion techniques AI

...Riffusion (hobby) serves as the core implementation for audio and image processing, providing essential building blocks for generating music from text prompts. It includes both developer-oriented tools and user-facing components such as a command-line interface and an interactive Streamlit application for experimentation. Additionally, it can run as a Flask server to expose model inference through an API, enabling integration with other applications or services.

Downloads: 0 This Week

Last Update: 2026-03-18
See Project
Error to trace to log to deploy. One click. No SSH.
Catch the cause before the pager goes off.

AppSignal links every error to the trace, the trace to the log, the log to the deploy that shipped it.

Free 30 days.
10

RAVL, Recognition And Vision Library.

General C++ Library, with modules for Computer Vision, Pattern Recognition and much more.

Downloads: 1 This Week

Last Update: 2020-04-22
See Project
11

millosh's workshop

A collection of software made by Milos Rancic.

Downloads: 0 This Week

Last Update: 2016-09-23
See Project
12

Bermuda Text-to-Speech

This project includes basic NLP and DSP techniques for Text-to-Speech

See TTS demo at: http://rslp.racai.ro/index.php?page=tts This is an entirely written in JAVA project which includes a set of tools and methods designed to enable Multilingual Text-to-Speech (TTS) synthesis. We currently support English and Romanian but we will soon train more models and make them available for download. If you want to read more about our other NLP and TTS tools check out http://nlptools.racai.ro.

Downloads: 0 This Week

Last Update: 2014-03-24
See Project
13

Epos TTS System

Epos is a language independent rule-driven Text-to-Speech (TTS) system

Epos is a language independent rule-driven Text-to-Speech (TTS) system primarily designed to serve as a research tool. Epos is (or tries to be) independent of the language processed, linguistic description method, and computing environment.

1 Review

Downloads: 3 This Week

Last Update: 2015-03-31
See Project
14

Webvoice

Webvoice is a text to speech cgi program. You can embed a link in a html page to send things you want to say, via sound. No software is required on the client side. Festival and sox are needed on the server. Webvoice has its own interface (if needed).

Downloads: 0 This Week

Last Update: 2014-04-24
See Project
15

FreeTTS-Android Interfacing

An Android application for interfacing with a FreeTTS server, in order to generate Text to Speech application inside Android emulator.

Downloads: 0 This Week

Last Update: 2015-08-05
See Project
16

Internet Text Radio

Internet Text Radio,designed around freeTTS,connects to a text server and tunes to a channel.The server starts pumping text data for that channel to the client, which converts text to speech, playing back the text as audio,like an internet radio station.

Downloads: 0 This Week

Last Update: 2014-03-17
See Project