Search Results for "audio speaker software" - Page 2

Showing 199 open source projects for "audio speaker software"

View related business solutions
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • Build Securely on Azure with Proven Frameworks Icon
    Build Securely on Azure with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 1
    pyglet

    pyglet

    pyglet is a cross-platform windowing and multimedia library for Python

    Pyglet is a cross-platform windowing and multimedia library for Python, intended for developing games and other visually rich applications. It supports windowing, input event handling, OpenGL graphics, loading images and videos, and playing sounds and music.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 2
    StyleTTS 2

    StyleTTS 2

    Towards Human-Level Text-to-Speech through Style Diffusion

    ...The architecture uses a two-stage training process and leverages an auxiliary speech language model to guide generation toward more natural and coherent utterances. StyleTTS2 supports both single-speaker and multi-speaker configurations, with the ability to sample or transfer styles from reference audio, making it powerful for expressive TTS and character voices. The repository includes training scripts, configuration files, and pre-trained auxiliary modules such as a text aligner, pitch extractor, and PL-BERT-based linguistic encoder.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 3
    Groq Python

    Groq Python

    The official Python Library for the Groq API

    Groq Python is the official Python SDK for the Groq REST API, giving Python developers straightforward access to Groq’s LLM, chat, audio, and other AI services. Through this library, you can call Groq’s models from Python code — for example to request chat completions, code generation, transcription, or any supported endpoint — using idiomatic Python syntax. The SDK handles authentication (via environment variable or parameter), defines proper type-safe request/response data types, and...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 4
    txtai

    txtai

    Build AI-powered semantic search applications

    txtai executes machine-learning workflows to transform data and build AI-powered semantic search applications. Traditional search systems use keywords to find data. Semantic search applications have an understanding of natural language and identify results that have the same meaning, not necessarily the same keywords. Backed by state-of-the-art machine learning models, data is transformed into vector representations for search (also known as embeddings). Innovation is happening at a rapid...
    Downloads: 7 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 5
    MoviePy

    MoviePy

    Video editing with Python

    MoviePy is a Python module for video editing, which can be used for basic operations (like cuts, concatenations, title insertions), video compositing (a.k.a. non-linear editing), video processing, or to create advanced effects. It can read and write the most common video formats, including GIF. MoviePy is an open source software originally written by Zulko and released under the MIT licence. It works on Windows, Mac, and Linux, with Python 2 or Python 3. The code is hosted on Github, where...
    Downloads: 22 This Week
    Last Update:
    See Project
  • 6
    video-use

    video-use

    Edit videos with Claude Code

    ...Designed to work with Claude Code, it automates the entire editing process—from cutting clips to rendering the final output—without requiring manual timelines or complex software interfaces. The system intelligently analyzes audio transcripts and visual cues to make precise, context-aware editing decisions. It supports a wide range of content types, including interviews, tutorials, montages, and talking-head videos. By combining structured text representations with on-demand visual previews, it minimizes processing overhead while maintaining high-quality results. ...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 7
    DocArray

    DocArray

    The data structure for multimodal data

    DocArray is a library for nested, unstructured, multimodal data in transit, including text, image, audio, video, 3D mesh, etc. It allows deep-learning engineers to efficiently process, embed, search, recommend, store, and transfer multimodal data with a Pythonic API. Door to multimodal world: super-expressive data structure for representing complicated/mixed/nested text, image, video, audio, 3D mesh data. The foundation data structure of Jina, CLIP-as-service, DALL·E Flow, DiscoArt etc. Data...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Videomass

    Videomass

    Videomass is a free, open source and cross-platform GUI for FFmpeg

    Videomass is a free, open-source graphical interface for FFmpeg designed to make advanced video and audio processing accessible to both beginners and experienced users. Built in Python using wxPython, it provides a cross-platform environment for managing encoding, conversion, and editing tasks through a visual interface. The software supports multitasking operations, allowing users to process multiple media files simultaneously. It offers extensive configuration options while also providing presets to simplify common workflows. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 9
    Triton Inference Server

    Triton Inference Server

    The Triton Inference Server provides an optimized cloud

    Triton Inference Server is an open-source inference serving software that streamlines AI inferencing. Triton enables teams to deploy any AI model from multiple deep learning and machine learning frameworks, including TensorRT, TensorFlow, PyTorch, ONNX, OpenVINO, Python, RAPIDS FIL, and more. Triton supports inference across cloud, data center, edge, and embedded devices on NVIDIA GPUs, x86 and ARM CPU, or AWS Inferentia.
    Downloads: 10 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • 10
    DocsGPT

    DocsGPT

    Private AI platform for agents, enterprise search and RAG pipelines

    DocsGPT is an open-source AI platform for deploying private RAG pipelines, AI agents, and enterprise search on your own infrastructure. Connect any data source (PDFs, DOCX, CSV, Excel, HTML, audio, GitHub, databases, URLs) and get accurate, hallucination-free answers with source citations. Choose your LLM: OpenAI, Anthropic, Google Gemini, or local models. Works with Qdrant, MongoDB, and Elasticsearch and more. Deploy via Docker or Kubernetes with full data sovereignty. Build...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Android Emulator Container Scripts

    Android Emulator Container Scripts

    Minimal scripts to run the emulator in a container for various systems

    android-emulator-container-scripts turns the Android Emulator into a cloud-native service you can run in Docker and Kubernetes, so teams can provision ephemeral Android devices on demand. It includes scripts and container images that configure the emulator for headless operation, wire up networking, and expose endpoints for ADB and web access. A built-in WebRTC bridge lets you stream the emulator screen to a browser with interactive input, which is ideal for CI dashboards, remote debugging,...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 12
    JamTools

    JamTools

    JamTools is a cross-platform gadget set software

    JamTools is a multifunctional desktop utility suite designed to provide a collection of tools for productivity, media processing, and system enhancements within a single application. It integrates various features such as file management, multimedia handling, and system utilities into a unified interface. The project emphasizes ease of use while offering advanced functionality for handling common tasks efficiently. It includes support for media-related operations, often leveraging FFmpeg for...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    Multimodal

    Multimodal

    TorchMultimodal is a PyTorch library

    This project, also known as TorchMultimodal, is a PyTorch library for building, training, and experimenting with multimodal, multi-task models at scale. The library provides modular building blocks such as encoders, fusion modules, loss functions, and transformations that support combining modalities (vision, text, audio, etc.) in unified architectures. It includes a collection of ready model classes—like ALBEF, CLIP, BLIP-2, COCA, FLAVA, MDETR, and Omnivore—that serve as reference...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Jina

    Jina

    Build cross-modal and multimodal applications on the cloud

    Jina is a framework that empowers anyone to build cross-modal and multi-modal applications on the cloud. It uplifts a PoC into a production-ready service. Jina handles the infrastructure complexity, making advanced solution engineering and cloud-native technologies accessible to every developer. Build applications that deliver fresh insights from multiple data types such as text, image, audio, video, 3D mesh, PDF with Jina AI’s DocArray. Polyglot gateway that supports gRPC, Websockets, HTTP,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Media Downloader

    Media Downloader

    A simple tool to download video and audio form internet sources

    This software enables users to download video and audio content from a variety of supported websites for personal and educational use. It is free and open source, promoting transparency, flexibility, and community-driven development.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 16
    VCClient

    VCClient

    Software that uses AI to perform real-time voice conversion

    VCClient is a real-time voice conversion system that uses machine learning models to transform a speaker’s voice into another voice with minimal latency. It is designed for live applications such as streaming, gaming, and virtual communication, where immediate feedback is essential. The system supports multiple voice conversion models, including RVC and other neural network-based approaches, allowing users to switch between different voices or customize their output. It provides both a...
    Downloads: 22 This Week
    Last Update:
    See Project
  • 17
    footswitch2

    footswitch2

    Audio Transcription software for Linux (Vlc) with a foot pedal

    Footswitch 2 is a media player for transcribers on Linux. Written in python and using the python bindings for VLC it allows a transcriber to control the audio or video with a USB footpedal, and includes a set of macros that integrate into LibreOffice. This allows the transcriber to control the media player from within Libreoffice as well, making it useful for those who do not yet own a footpedal/footswitch. Control of the media player from LibreOffice can be via Hotkeys or an integrated...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 18
    MiniMonitor

    MiniMonitor

    Lightweight capture card & mic monitoring with minimal resources

    MiniMonitor is a lightweight Windows application designed for monitoring capture cards and microphones with minimal system impact. Ideal for Elgato and other capture devices, it detects connected video and audio inputs, tests microphones, and provides real-time video and audio playback. Users can select devices through a simple GUI, toggle Fullscreen display, and quickly check functionality without heavy software overhead. Built with Python, OpenCV, PyAudio, and Tkinter, MiniMonitor is perfect for streamers, content creators, and anyone needing low-latency monitoring of capture cards and audio sources.
    Downloads: 14 This Week
    Last Update:
    See Project
  • 19
    LinAMP Music Player

    LinAMP Music Player

    A lightweight and fast music player to play your music libraries

    A lightweight and fast music player to play your music libraries. https://github.com/shampuan/LinAMP
    Downloads: 3 This Week
    Last Update:
    See Project
  • 20
    SPPAS

    SPPAS

    SPPAS - the automatic annotation and analyses of speech

    SPPAS is a scientific computer software package written and maintained by Brigitte Bigi of the Laboratoire Parole et Langage, in Aix-en-Provence, France. Available for free, with open source code, there is simply no other package for linguists to simple use in the automatic annotations of speech, the analyses of any kind of annotated data and the conversion of annotated files. SPPAS is able to produce automatically speech annotations from a recorded speech sound and its orthographic...
    Downloads: 25 This Week
    Last Update:
    See Project
  • 21
    FluidPatcher

    FluidPatcher

    A performance-oriented patch interface for FluidSynth

    FluidPatcher is a performance-oriented interface for FluidSynth built using wxpython to create a simple GUI that allows live editing, selecting, and playing of patches. A patch is a collection of settings such as soundfont presets for each MIDI channel, control-change/sysex messages to send when the patch is selected, and midi router or effects settings. Groups of patches are stored in banks, which are saved as human-readable and -editable YAML files. This allows a musician to easily create...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 22
    Tartube

    Tartube

    Download videos/channels/playlists from YouTube and many other sites

    Tartube is a GUI front-end for youtube-dl, yt-dlp and other compatible video downloaders. It is written in Python 3 / Gtk 3 and runs on MS Windows, Linux, BSD and MacOS.
    Leader badge
    Downloads: 932 This Week
    Last Update:
    See Project
  • 23
    GeigerLog

    GeigerLog

    Python program for Geiger counters and Environmental Sensors

    GeigerLog is a combination of data logger, presenter, and analyzer, and is now released in version 2.1! Based on Python (Version 3), it runs on Linux, Windows, Macs, Raspberry Pi, and other. GeigerLog is WiFi-enabled for servers and clients; it can be monitored by SmartPhone. Initially developed for Geiger counters, it is now a more universal tool, which equally well handles data like temperature, air-pressure, humidity, CO2. Continues to support Geiger counters like GQ Electronic's...
    Leader badge
    Downloads: 80 This Week
    Last Update:
    See Project
  • 24
    Pybris

    Pybris

    B language compiler written in Python targeting RISVM

    Pybris is a compiler written in Python using Pyparsing for the B Programming Language. The compiler emits a variant of Bitmario RISVM assembly. The practical goal of the project is to provide a way to develop digital signal processing (DSP) effects for the Competent Audio library that is a friendlier alternative to writing RISVM assembly by hand. Pybris is written for Python 2.7, but has also been tested to run with Python 3.8.10.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    ES-DOS A
    ES-DOS is a application for windows that looks like MS-DOS but is not a OS
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB