speaker free download

Showing 7 open source projects for "speaker"

View related business solutions

Multimedia Python Clear Filters & Widen Search

$300 Free Credits to Build on Google Cloud
New to Google Cloud? Get $300 in credits to explore Compute Engine, BigQuery, Cloud Run, Gemini Enterprise Agent Platform, and more.

Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query petabytes in BigQuery, or build agents with Gemini Enterprise Agent Platform. Once your credits are used, keep building with 20+ always-free tier products including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. No commitment required—just sign up and start building.

Claim $300 Free
Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
1

Music Assistant

Music Assistant is a free, opensource Media library manager

Music Assistant Server is the core backend for Music Assistant, a free and open-source music library manager for local and online music sources. It connects streaming services, local files, metadata providers, and many speaker ecosystems into one centralized music system. The server is designed to run on an always-on device such as a Raspberry Pi, NAS, Intel NUC, or similar home server. It can work as a standalone product, but it is especially tailored for Home Assistant users who want automation, voice control, and smart-home playback workflows. Music Assistant supports features such as library matching, metadata enrichment, gapless playback, crossfade, volume normalization, synchronized playback, announcements, and queue transfers. ...

Downloads: 8 This Week

Last Update: 4 days ago
See Project
2

footswitch2

Audio Transcription software for Linux (Vlc) with a foot pedal

Footswitch 2 is a media player for transcribers on Linux. Written in python and using the python bindings for VLC it allows a transcriber to control the audio or video with a USB footpedal, and includes a set of macros that integrate into LibreOffice. This allows the transcriber to control the media player from within Libreoffice as well, making it useful for those who do not yet own a footpedal/footswitch. Control of the media player from LibreOffice can be via Hotkeys or an integrated...

Downloads: 4 This Week

Last Update: 2026-04-09
See Project
3

footswitch3

Audio Transcription software for Linux (Gstreamer) with a foot pedal

Footswitch 3 is a media player for transcribers on Linux. Written in python using the python bindings for Gstreamer it allows a transcriber to control the audio or video with a foot pedal, and includes a set of macros that integrate into LibreOffice. This allows the transcriber to control the media player from within Libreoffice as well, making it useful for those who do not yet own a foot pedal/foot switch. Control of the media player from LibreOffice can be via Hotkeys or an integrated...

1 Review

Downloads: 6 This Week

Last Update: 2023-04-02
See Project
4

Distant Speech Recognition

Beamforming and Speech Recognition Toolkit

BTK contains C++ and Python libraries that implement speech processing and microphone array techniques such as speech feature extraction, speech enhancement, speaker tracking, beamforming, dereverberation and echo cancellation algorithms. The Millennium ASR provides C++ and python libraries for automatic speech recognition. The Millennium ASR implements a weighted finite state transducer (WFST) decoder, training and adaptation methods. These toolkits are meant for facilitating research and development of automatic distant speech recognition.

Downloads: 1 This Week

Last Update: 2019-08-21
See Project
Build Agents and Models on One Platform
Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.

Try It Free
5

avimmir

(audio, video, image) Multimedia Multimodal Information Retrieval

audio classification; speaker segmentation; speaker clustering; speaker recognition; spoken document retrieval; image retrieval; video retrieval; etc.

Downloads: 0 This Week

Last Update: 2013-11-23
See Project
6

Proxies Audio

An ALSA plugin to implement a digital crossover/equalizer for the Proxies speaker project.

Downloads: 0 This Week

Last Update: 2014-03-08
See Project
7

Stortinget når det passer

Stortinget on-Demand (Stortinget når det passer) is a system to record, store and split meeting-recordings according to their minutes so that an end-user may view the speach of a specific speaker.

Downloads: 0 This Week

Last Update: 2016-08-30
See Project