Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Search Results

Search Results for "audio speaker software" - Page 2

x

Sort By:

Relevance

Clear All Filters

OS

Linux 163
Windows 145
Mac 112
More...
BSD 84
ChromeOS 56
Desktop Operating Systems 7
Mobile Operating Systems 6
Server Operating Systems 2
Embedded Operating Systems 1

Category

Multimedia 141
Software Development 62
Artificial Intelligence 54
Scientific/Engineering 13
System 13
Communications 9
Internet 7
Education 5
Business 4
Desktop Environment 4
Games 4
Database 3
Formats and Protocols 3
Productivity 2
Religion and Philosophy 1
Terminals 1
Text Editors 1

License

OSI-Approved Open Source 178
Creative Commons Attribution License 7
Public Domain 5
Other License 2

Translations

Programming Language

Python 199
C++ 32
C 24
Java 10
JavaScript 7
More...
Perl 5
C# 4
Objective C 4
PHP 4
MATLAB 3
Ruby 3
Unix Shell 3
ActionScript 2
TypeScript 2
Kotlin 1
Lisp 1
Oz 1
PL/SQL 1
Tcl 1
VHDL/Verilog 1
Visual Basic .NET 1
XBasic 1
Zope 1

Status

Production/Stable 57
Beta 44
Pre-Alpha 21
Alpha 14
More...
Planning 12
Mature 5
Inactive 2

Showing 199 open source projects for "audio speaker software"

View related business solutions

Python Clear Filters & Widen Search

Full-stack observability with actually useful AI | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
Build Securely on Azure with Proven Frameworks
Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.

Download Now
1

pyglet

pyglet is a cross-platform windowing and multimedia library for Python

Pyglet is a cross-platform windowing and multimedia library for Python, intended for developing games and other visually rich applications. It supports windowing, input event handling, OpenGL graphics, loading images and videos, and playing sounds and music.

Downloads: 4 This Week

Last Update: 2026-04-05
See Project
2

StyleTTS 2

Towards Human-Level Text-to-Speech through Style Diffusion

...The architecture uses a two-stage training process and leverages an auxiliary speech language model to guide generation toward more natural and coherent utterances. StyleTTS2 supports both single-speaker and multi-speaker configurations, with the ability to sample or transfer styles from reference audio, making it powerful for expressive TTS and character voices. The repository includes training scripts, configuration files, and pre-trained auxiliary modules such as a text aligner, pitch extractor, and PL-BERT-based linguistic encoder.

Downloads: 5 This Week

Last Update: 2025-11-28
See Project
3

Groq Python

The official Python Library for the Groq API

Groq Python is the official Python SDK for the Groq REST API, giving Python developers straightforward access to Groq’s LLM, chat, audio, and other AI services. Through this library, you can call Groq’s models from Python code — for example to request chat completions, code generation, transcription, or any supported endpoint — using idiomatic Python syntax. The SDK handles authentication (via environment variable or parameter), defines proper type-safe request/response data types, and...

Downloads: 4 This Week

Last Update: 17 hours ago
See Project
4

txtai

Build AI-powered semantic search applications

txtai executes machine-learning workflows to transform data and build AI-powered semantic search applications. Traditional search systems use keywords to find data. Semantic search applications have an understanding of natural language and identify results that have the same meaning, not necessarily the same keywords. Backed by state-of-the-art machine learning models, data is transformed into vector representations for search (also known as embeddings). Innovation is happening at a rapid...

Downloads: 7 This Week

Last Update: 2026-04-29
See Project
Try Google Cloud Risk-Free With $300 in Credit
No hidden charges. No surprise bills. Cancel anytime.

Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.

Start Free
5

MoviePy

Video editing with Python

MoviePy is a Python module for video editing, which can be used for basic operations (like cuts, concatenations, title insertions), video compositing (a.k.a. non-linear editing), video processing, or to create advanced effects. It can read and write the most common video formats, including GIF. MoviePy is an open source software originally written by Zulko and released under the MIT licence. It works on Windows, Mac, and Linux, with Python 2 or Python 3. The code is hosted on Github, where...

Downloads: 22 This Week

Last Update: 2025-05-21
See Project
6

video-use

Edit videos with Claude Code

...Designed to work with Claude Code, it automates the entire editing process—from cutting clips to rendering the final output—without requiring manual timelines or complex software interfaces. The system intelligently analyzes audio transcripts and visual cues to make precise, context-aware editing decisions. It supports a wide range of content types, including interviews, tutorials, montages, and talking-head videos. By combining structured text representations with on-demand visual previews, it minimizes processing overhead while maintaining high-quality results. ...

Downloads: 9 This Week

Last Update: 2026-05-03
See Project
7

DocArray

The data structure for multimodal data

DocArray is a library for nested, unstructured, multimodal data in transit, including text, image, audio, video, 3D mesh, etc. It allows deep-learning engineers to efficiently process, embed, search, recommend, store, and transfer multimodal data with a Pythonic API. Door to multimodal world: super-expressive data structure for representing complicated/mixed/nested text, image, video, audio, 3D mesh data. The foundation data structure of Jina, CLIP-as-service, DALL·E Flow, DiscoArt etc. Data...

Downloads: 0 This Week

Last Update: 2025-03-21
See Project
8

Videomass

Videomass is a free, open source and cross-platform GUI for FFmpeg

Videomass is a free, open-source graphical interface for FFmpeg designed to make advanced video and audio processing accessible to both beginners and experienced users. Built in Python using wxPython, it provides a cross-platform environment for managing encoding, conversion, and editing tasks through a visual interface. The software supports multitasking operations, allowing users to process multiple media files simultaneously. It offers extensive configuration options while also providing presets to simplify common workflows. ...

Downloads: 6 This Week

Last Update: 2026-04-24
See Project
9

Triton Inference Server

The Triton Inference Server provides an optimized cloud

Triton Inference Server is an open-source inference serving software that streamlines AI inferencing. Triton enables teams to deploy any AI model from multiple deep learning and machine learning frameworks, including TensorRT, TensorFlow, PyTorch, ONNX, OpenVINO, Python, RAPIDS FIL, and more. Triton supports inference across cloud, data center, edge, and embedded devices on NVIDIA GPUs, x86 and ARM CPU, or AWS Inferentia.

Downloads: 10 This Week

Last Update: 2026-04-28
See Project
Gemini 3 and 200+ AI Models on One Platform
Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.

Start Free
10

DocsGPT

Private AI platform for agents, enterprise search and RAG pipelines

DocsGPT is an open-source AI platform for deploying private RAG pipelines, AI agents, and enterprise search on your own infrastructure. Connect any data source (PDFs, DOCX, CSV, Excel, HTML, audio, GitHub, databases, URLs) and get accurate, hallucination-free answers with source citations. Choose your LLM: OpenAI, Anthropic, Google Gemini, or local models. Works with Qdrant, MongoDB, and Elasticsearch and more. Deploy via Docker or Kubernetes with full data sovereignty. Build...

Downloads: 0 This Week

Last Update: 2026-04-28
See Project
11

Android Emulator Container Scripts

Minimal scripts to run the emulator in a container for various systems

android-emulator-container-scripts turns the Android Emulator into a cloud-native service you can run in Docker and Kubernetes, so teams can provision ephemeral Android devices on demand. It includes scripts and container images that configure the emulator for headless operation, wire up networking, and expose endpoints for ADB and web access. A built-in WebRTC bridge lets you stream the emulator screen to a browser with interactive input, which is ideal for CI dashboards, remote debugging,...

Downloads: 4 This Week

Last Update: 2025-10-10
See Project
12

JamTools

JamTools is a cross-platform gadget set software

JamTools is a multifunctional desktop utility suite designed to provide a collection of tools for productivity, media processing, and system enhancements within a single application. It integrates various features such as file management, multimedia handling, and system utilities into a unified interface. The project emphasizes ease of use while offering advanced functionality for handling common tasks efficiently. It includes support for media-related operations, often leveraging FFmpeg for...

Downloads: 1 This Week

Last Update: 2026-04-28
See Project
13

Multimodal

TorchMultimodal is a PyTorch library

This project, also known as TorchMultimodal, is a PyTorch library for building, training, and experimenting with multimodal, multi-task models at scale. The library provides modular building blocks such as encoders, fusion modules, loss functions, and transformations that support combining modalities (vision, text, audio, etc.) in unified architectures. It includes a collection of ready model classes—like ALBEF, CLIP, BLIP-2, COCA, FLAVA, MDETR, and Omnivore—that serve as reference...

Downloads: 0 This Week

Last Update: 2026-01-12
See Project
14

Jina

Build cross-modal and multimodal applications on the cloud

Jina is a framework that empowers anyone to build cross-modal and multi-modal applications on the cloud. It uplifts a PoC into a production-ready service. Jina handles the infrastructure complexity, making advanced solution engineering and cloud-native technologies accessible to every developer. Build applications that deliver fresh insights from multiple data types such as text, image, audio, video, 3D mesh, PDF with Jina AI’s DocArray. Polyglot gateway that supports gRPC, Websockets, HTTP,...

Downloads: 0 This Week

Last Update: 2024-11-12
See Project
15

Media Downloader

A simple tool to download video and audio form internet sources

This software enables users to download video and audio content from a variety of supported websites for personal and educational use. It is free and open source, promoting transparency, flexibility, and community-driven development.

Downloads: 9 This Week

Last Update: 2025-05-02
See Project
16

VCClient

Software that uses AI to perform real-time voice conversion

VCClient is a real-time voice conversion system that uses machine learning models to transform a speaker’s voice into another voice with minimal latency. It is designed for live applications such as streaming, gaming, and virtual communication, where immediate feedback is essential. The system supports multiple voice conversion models, including RVC and other neural network-based approaches, allowing users to switch between different voices or customize their output. It provides both a...

Downloads: 22 This Week

Last Update: 2026-03-23
See Project
17

footswitch2

Audio Transcription software for Linux (Vlc) with a foot pedal

Footswitch 2 is a media player for transcribers on Linux. Written in python and using the python bindings for VLC it allows a transcriber to control the audio or video with a USB footpedal, and includes a set of macros that integrate into LibreOffice. This allows the transcriber to control the media player from within Libreoffice as well, making it useful for those who do not yet own a footpedal/footswitch. Control of the media player from LibreOffice can be via Hotkeys or an integrated...

Downloads: 9 This Week

Last Update: 2026-04-09
See Project
18

MiniMonitor

Lightweight capture card & mic monitoring with minimal resources

MiniMonitor is a lightweight Windows application designed for monitoring capture cards and microphones with minimal system impact. Ideal for Elgato and other capture devices, it detects connected video and audio inputs, tests microphones, and provides real-time video and audio playback. Users can select devices through a simple GUI, toggle Fullscreen display, and quickly check functionality without heavy software overhead. Built with Python, OpenCV, PyAudio, and Tkinter, MiniMonitor is perfect for streamers, content creators, and anyone needing low-latency monitoring of capture cards and audio sources.

Downloads: 14 This Week

Last Update: 2025-11-12
See Project
19

LinAMP Music Player

A lightweight and fast music player to play your music libraries

A lightweight and fast music player to play your music libraries. https://github.com/shampuan/LinAMP

3 Reviews

Downloads: 3 This Week

Last Update: 2025-12-04
See Project
20

SPPAS

SPPAS - the automatic annotation and analyses of speech

SPPAS is a scientific computer software package written and maintained by Brigitte Bigi of the Laboratoire Parole et Langage, in Aix-en-Provence, France. Available for free, with open source code, there is simply no other package for linguists to simple use in the automatic annotations of speech, the analyses of any kind of annotated data and the conversion of annotated files. SPPAS is able to produce automatically speech annotations from a recorded speech sound and its orthographic...

Downloads: 25 This Week

Last Update: 2026-04-06
See Project
21

FluidPatcher

A performance-oriented patch interface for FluidSynth

FluidPatcher is a performance-oriented interface for FluidSynth built using wxpython to create a simple GUI that allows live editing, selecting, and playing of patches. A patch is a collection of settings such as soundfont presets for each MIDI channel, control-change/sysex messages to send when the patch is selected, and midi router or effects settings. Groups of patches are stored in banks, which are saved as human-readable and -editable YAML files. This allows a musician to easily create...

Downloads: 8 This Week

Last Update: 5 days ago
See Project
22

Tartube

Download videos/channels/playlists from YouTube and many other sites

Tartube is a GUI front-end for youtube-dl, yt-dlp and other compatible video downloaders. It is written in Python 3 / Gtk 3 and runs on MS Windows, Linux, BSD and MacOS.

Downloads: 932 This Week

Last Update: 2026-01-20
See Project
23

GeigerLog

Python program for Geiger counters and Environmental Sensors

GeigerLog is a combination of data logger, presenter, and analyzer, and is now released in version 2.1! Based on Python (Version 3), it runs on Linux, Windows, Macs, Raspberry Pi, and other. GeigerLog is WiFi-enabled for servers and clients; it can be monitored by SmartPhone. Initially developed for Geiger counters, it is now a more universal tool, which equally well handles data like temperature, air-pressure, humidity, CO2. Continues to support Geiger counters like GQ Electronic's...

7 Reviews

Downloads: 80 This Week

Last Update: 2025-11-23
See Project
24

Pybris

B language compiler written in Python targeting RISVM

Pybris is a compiler written in Python using Pyparsing for the B Programming Language. The compiler emits a variant of Bitmario RISVM assembly. The practical goal of the project is to provide a way to develop digital signal processing (DSP) effects for the Competent Audio library that is a friendlier alternative to writing RISVM assembly by hand. Pybris is written for Python 2.7, but has also been tested to run with Python 3.8.10.

Downloads: 0 This Week

Last Update: 2024-07-08
See Project
25

ES-DOS A

ES-DOS is a application for windows that looks like MS-DOS but is not a OS

Downloads: 0 This Week

Last Update: 2026-01-24
See Project

Previous
1
You're on page 2
3
4
5
6
Next

Related Searches

tartube

android emulator

youtube

emulator

pyaudio-0.2.14-cp314-cp314-win_amd64.whl

semantic search

moviepy

unstructured data

cloud server

chatgpt

Related Categories

Multimedia

Software Development

Artificial Intelligence

Scientific/Engineering

System

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise