Search Results for "audio speaker software" - Page 3

Sort By:

Showing 199 open source projects for "audio speaker software"

View related business solutions

Python Clear Filters & Widen Search

Compliant and Reliable File Transfers Backed by Top Security Certifications
Cerberus FTP Server delivers SOC 2 Type II certified security and FIPS 140-2 validated encryption.

Stop relying on non-certified, legacy file transfer tools that creak under the weight of modern security demands. Get full audit trails, advanced access controls and more supported by an award-winning team of experts. Start your free 25-day trial today.

Start Free Trial
Gemini 3 and 200+ AI Models on One Platform
Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.

Start Free
1

PyExe - YT DL Mk42 (b) [I.S.A]

PyExe - YouTube Downloader Mark 42 type-B [I.S.A]

'PyExe - YT DL Mk42 (b)' is an desktop application developed using python 3.6.8 and other add-on libaries. Can download YouTube videos and audios. 'PyExe - YT DL Mk42 (b)' has two parts: 1) Download Video - downloads YouTube video (.mp4) 2) Download Audio - downloads YouTube video (.mp3)

Downloads: 0 This Week

Last Update: 2024-06-29
See Project
2

Coqui TTS

A deep learning toolkit for Text-to-Speech, battle-tested in research

TTS is a library for advanced Text-to-Speech generation. It's built on the latest research, was designed to achieve the best trade-off among ease-of-training, speed and quality. TTS comes with pre-trained models, tools for measuring dataset quality and is already used in 20+ languages for products and research projects. High-performance Deep Learning models for Text2Speech tasks. Text2Spec models (Tacotron, Tacotron2, Glow-TTS, SpeedySpeech). Speaker Encoder to compute speaker embeddings...

Downloads: 18 This Week

Last Update: 2023-12-12
See Project
3

Intention Repeater MAX

Repeating your Intentions to aid in manifestation

Please see the README.txt. The ServitorConnect 4443 and Python Daemon and Intention Repeater Android are better because repeating once-per-hour is better than millions of times per second (or even 3Hz). The archive bundle includes binaries and source code for: MAX and Simple Intention Repeaters CUDA version for Windows/Linux Memory Frequency Generator Multi-Format to WAV Repeater Android app Sourcecode File/Image Writers Nesting Files Creator ...

1 Review

Downloads: 3 This Week

Last Update: 2025-04-17
See Project
4

YehDown

Yeahdown: Easy-to-use video downloader for Windows

Yeahdown is a straightforward, user-friendly Windows-based application designed to simplify the process of downloading videos and audio from popular websites like YouTube and Vimeo. Perfect for non-technical users, it offers an intuitive interface and fast, reliable downloads. Key features include improved download speeds, support for multiple major video platforms, and real-time updates for new features. Tested on windows 11.

Downloads: 7 This Week

Last Update: 2025-07-20
See Project
Go from Code to Production URL in Seconds
Cloud Run deploys apps in any language instantly. Scales to zero. Pay only when code runs.

Skip the Kubernetes configs. Cloud Run handles HTTPS, scaling, and infrastructure automatically. Two million requests free per month.

Try it free
5

vocal-separate

An extremely simple tool for separating vocals and background music

...Users can drag and drop an audio or video file onto the interface to begin separation, choosing between two, four, or five stems, which allows isolating specific components like vocals, bass, drums, or piano depending on the chosen model. After processing, the tool outputs separate WAV files for each extracted stem, making it easy to export and use in audio editing or remix software.

Downloads: 4 This Week

Last Update: 2026-02-17
See Project
6

Dynamite Download Manager

PyIDM remake for downloading stuff

Dynamite Download Manager is a powerful download manager equipped with multi-connections and a high-speed engine, designed to enhance your downloading experience. By utilizing multiple connections, DDM splits files into smaller segments and downloads them simultaneously, significantly increasing download speeds. Its advanced high-speed engine ensures faster and more efficient downloading, even for large files. DDM supports a wide variety of file formats, enabling you to download general...

Downloads: 0 This Week

Last Update: 2025-09-08
See Project
7

vits_chinese

Best practice TTS based on BERT and VITS

vits_chinese is an implementation of the VITS end-to-end text-to-speech (TTS) architecture tailored for Chinese (and possibly multilingual) speech synthesis. VITS is a model combining variational autoencoders (VAEs), normalizing flows, adversarial learning, and a stochastic duration predictor — a design that enables generation of natural, expressive speech, capturing variations in rhythm and prosody. By customizing or porting VITS for Chinese, this project aims to produce high-quality TTS...

Downloads: 0 This Week

Last Update: 2025-11-28
See Project
8

VALL-E X

Open source implementation of Microsoft's VALL-E X zero-shot TTS model

VALL-E-X is an open-source implementation of Microsoft’s VALL-E X zero-shot text-to-speech model, focused on multilingual, cross-lingual voice cloning. It is capable of synthesizing speech in English, Chinese, and Japanese from text while mimicking the voice characteristics of a speaker given only a short 3–10 second prompt. The model attempts to match not just timbre, but also tone, pitch, emotion, and prosody of the reference audio, resulting in highly personalized output. VALL-E-X supports zero-shot cross-lingual synthesis, meaning a monolingual speaker’s voice can be used to speak other languages without additional training. ...

Downloads: 2 This Week

Last Update: 2025-11-28
See Project
9

MahaKurawa.My.ID MP4 VA Extract

MahaKurawa.My.ID MP4 VA Extract is a tool to extract mp4 file content

MahaKurawa.My.ID MP4 VA Extract is a tool to extract MP4 file video and audio content. It also have ability to extract MKV file and single SSA Subtitle file. This software will not convert any video and audio file from MP4 file. This software just extract them as it is. This tool is made for that specific purpose. This tool "MahaKurawa.My.ID MP4 VA Extract v.1.0.3.1" can be obtained for free on https://www.mahakurawa.my.id.

Downloads: 2 This Week

Last Update: 2023-12-14
See Project
Forever Free Full-Stack Observability | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
10

DarkAudacity

A customized version of Audacity

...Audacity and DarkAudacity come from a community effort. Many people have contributed to the audio code. Because they are Open Source, anyone is allowed to read and modify the source code. DarkAudacity is a variation on the Audacity software, made possible because Audacity is Open Source.

2 Reviews

Downloads: 43 This Week

Last Update: 2023-10-06
See Project
11

MahaKurawa MP4V-A Extractor

This software is a tool to extract video and audio file that contained

This software is a tool to extract video and audio file that contained by a .MP4 format. This software will not convert any video and audio file from yout .mp4 file. This software just extract them as it is. This tool is made for that specific purpose. This tool "MahaKurawa MP4 V-A Extractor V.10" can be obtained for free on https://www.mahakurawa.my.id.

Downloads: 0 This Week

Last Update: 2023-08-31
See Project
12

VALL-E

PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)

We introduce a language modeling approach for text to speech synthesis (TTS). Specifically, we train a neural codec language model (called VALL-E) using discrete codes derived from an off-the-shelf neural audio codec model, and regard TTS as a conditional language modeling task rather than continuous signal regression as in previous work. During the pre-training stage, we scale up the TTS training data to 60K hours of English speech which is hundreds of times larger than existing systems. VALL-E emerges in-context learning capabilities and can be used to synthesize high-quality personalized speech with only a 3-second enrolled recording of an unseen speaker as an acoustic prompt. ...

Downloads: 4 This Week

Last Update: 2023-04-14
See Project
13

Video 2 Audio The Converter [I.S.A]

Video 2 Audio The Converter [Improved.Simplified.Alternative]

'Vido 2 Audio : The converter' is an desktop application developed using python 3.6.8 and other add-on libaries. Converts video file into audio file. Vido 2 Audio : The converter has two modes: 1) Single file - Convert one video file into audio file. 2) Multiple files - Convert more than one video files into audio files from a folder\directory. Compatible only for windows OS.

1 Review

Downloads: 0 This Week

Last Update: 2023-06-07
See Project
14

PyExe - YT DL I.S.A

PyExe - YT DL [Improved.Simplified.Alternative]

'PyExe - YT DL' is an desktop application developed using python 3.6.8 and other add-on libaries. Can download YouTube videos and audios. 'PyExe - YT DL' has two parts: 1) Download Video - downloads YouTube video (.mp4) 2) Download Audio - downloads YouTube video (.mp3) Compatible only for windows OS.

Downloads: 0 This Week

Last Update: 2023-06-30
See Project
15

Txt-2-Mp3 6.3 Mark 2 [I.S.A]

Txt-2-Mp3 6.3 Mark 2 [Improved.Simplified.Alternative]

'Txt2Mp3' an desktop application developed using python 3.6.8 and other add-on libaries. Can convert texts into audio (.mp3) files using gTTS (Google Text-to-speech) api module library. Compatible only for windows OS.

Downloads: 0 This Week

Last Update: 2023-06-07
See Project
16

Debreate - Debian Package Builder

A utility for creating Debian packages (.deb)

Debreate is a utility to aid in creating Debian (.deb) packages. Currently it only supports binary packaging (note that the term "binary package" is used loosely, as such packages can contain scripts & non-code items such as media images, audio, & more) for personal distribution. Plans for using backends such as dh_make & debuild for creating source packages are in the works. But source packaging can be quite different & is a must if you want to get your packages into a distribution's...

15 Reviews

Downloads: 6 This Week

Last Update: 2023-05-12
See Project
17

footswitch3

Audio Transcription software for Linux (Gstreamer) with a foot pedal

Footswitch 3 is a media player for transcribers on Linux. Written in python using the python bindings for Gstreamer it allows a transcriber to control the audio or video with a foot pedal, and includes a set of macros that integrate into LibreOffice. This allows the transcriber to control the media player from within Libreoffice as well, making it useful for those who do not yet own a foot pedal/foot switch. Control of the media player from LibreOffice can be via Hotkeys or an integrated...

1 Review

Downloads: 1 This Week

Last Update: 2023-04-02
See Project
18

footswitch2basic

Audio Transcription software for Linux (Vlc) with a foot pedal

Footswitch 2 (Basic) is a media player for transcribers on Linux. This version is a stripped down version of Footswitch2, containing only the absolute essentials for transcription. Written in python and using the python bindings for VLC it allows a transcriber to control the audio or video with a footpedal, and includes a set of macros that integrate into LibreOffice. This allows the transcriber to control the media player from within Libreoffice as well, making it useful for those who do...

Downloads: 0 This Week

Last Update: 2023-04-07
See Project
19

footswitch3basic

Audio Transcription software for Linux (Gstreamer) with a foot pedal

Footswitch3basic is a media player for transcribers on Linux. Written in python using the bindings for Gstreamer it allows a transcriber to control the audio or video with a foot pedal, and includes a set of macros that integrate into LibreOffice. This allows the transcriber to control the media player from within Libreoffice as well, making it useful for those who do not yet own a foot pedal/foot switch. Control of the media player from LibreOffice can be via Hotkeys or an integrated...

Downloads: 0 This Week

Last Update: 2023-04-02
See Project
20

lv2gen

Generate boilerplate for LV2 plugins

A python package for describing audio and synth plugins and generating boilerplate code from that description, including LV2’s manifest.ttl and a simplified GUI for the`Mod Dwarf.

Downloads: 0 This Week

Last Update: 2023-02-08
See Project
21

YTD-Downloader

YTD-Downloader is a GUI-based Desktop Application.

YTD-Downloader is a GUI-based Desktop Application. Users can download audio and video from YouTube using this software.

Downloads: 11 This Week

Last Update: 2022-11-28
See Project
22

Footswitch2 Equaliser

15 band pulseaudio equaliser

...Pulseaudio must be up and running for this software to work.

1 Review

Downloads: 0 This Week

Last Update: 2022-11-16
See Project
23

AI Atelier

Based on the Disco Diffusion, version of the AI art creation software

Based on the Disco Diffusion, we have developed a Chinese & English version of the AI art creation software "AI Atelier". We offer both Text-To-Image models (Disco Diffusion and VQGAN+CLIP) and Text-To-Text (GPT-J-6B and GPT-NEOX-20B) as options. Making available complete source code of licensed works and modifications, which include larger works using a licensed work, under the same license. Copyright and license notices must be preserved.

Downloads: 1 This Week

Last Update: 2023-03-23
See Project
24

SVoice (Speech Voice Separation)

We provide a PyTorch implementation of the paper Voice Separation

...Separate models are trained for different speaker counts, and the largest-capacity model dynamically determines the actual number of speakers in a mixture. The repository includes all necessary scripts for training, dataset preparation, distributed training, evaluation, and audio separation.

Downloads: 4 This Week

Last Update: 2 days ago
See Project
25

AugLy

A data augmentations library for audio, image, text, and video

AugLy is a data augmentations library that currently supports four modalities (audio, image, text & video) and over 100 augmentations. Each modality’s augmentations are contained within its own sub-library. These sub-libraries include both function-based and class-based transforms, composition operators, and have the option to provide metadata about the transform applied, including its intensity. AugLy is a great library to utilize for augmenting your data in model training, or to evaluate...

Downloads: 0 This Week

Last Update: 2022-03-29
See Project