Search Results for "audio speaker software" - Page 4

Sort By:

Showing 199 open source projects for "audio speaker software"

View related business solutions

Python Clear Filters & Widen Search

Go From AI Idea to AI App Fast
One platform to build, fine-tune, and deploy ML models. No MLOps team required.

Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.

Try Free
Go from Code to Production URL in Seconds
Cloud Run deploys apps in any language instantly. Scales to zero. Pay only when code runs.

Skip the Kubernetes configs. Cloud Run handles HTTPS, scaling, and infrastructure automatically. Two million requests free per month.

Try it free
1

Spleeter

Deezer source separation library including pretrained models

Spleeter is the Deezer source separation library with pretrained models written in Python and using Tensorflow. It makes it easy to train music source separation models (assuming you have a dataset of isolated sources), and provides already trained state of the art models for performing various flavours of separation. 2 stems and 4 stems models have state of the art performances on the musdb dataset. Spleeter is also very fast as it can perform separation of audio files to 4 stems 100x...

1 Review

Downloads: 41 This Week

Last Update: 2021-09-03
See Project
2

pydatascope

Software oscilloscope using Python and tkinter

Software oscilloscope using Python and tkinter. Supports multiple sources: socket, file, audio, USB. Displays data by samples, time or frequency. Scales the input automatically or manually.

1 Review

Downloads: 0 This Week

Last Update: 2021-09-25
See Project
3

OpenDAFF

Directional Audio File Format

OpenDAFF is a free, open-source software package for directional audio data - like the directivity of microphones, speakers, as well as head-related transfer functions (HRTFs)

Downloads: 5 This Week

Last Update: 2021-01-08
See Project
4

LiVES

LiVES is a Video Editing System. It is designed to be simple to use, y

LiVES mixes realtime video performance and non-linear editing in one professional quality application. It is designed to be simple to use, yet powerful. It is small in size, yet it has many advanced features. Using LiVES, you can start editing and making video right away, without having to worry about formats, frame sizes, or framerates. It is a very flexible tool which is used by both professional VJ's and video editors - mix and switch clips from the keyboard, use dozens of realtime...

15 Reviews

Downloads: 5 This Week

Last Update: 2020-11-08
See Project
Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
5

Linux-Intelligent-Ocr-Solution

Easy-OCR solution and Tesseract trainer for GNU/Linux

Linux-intelligent-ocr-solution Lios is a free and open source software for converting print in to text using either scanner or a camera, It can also produce text out of scanned images from other sources such as Pdf, Image, Folder containing Images or screenshot. Program is given total accessibility for visually impaired. A Tesseract Trainer GUI is also shipped with this package. Forum : https://groups.google.com/forum/#!forum/lios Video Tutorial :...

5 Reviews

Downloads: 5 This Week

Last Update: 2020-10-19
See Project
6

Tensor2Tensor

Library of deep learning models and datasets

Deep Learning (DL) has enabled the rapid advancement of many useful technologies, such as machine translation, speech recognition and object detection. In the research community, one can find code open-sourced by the authors to help in replicating their results and further advancing deep learning. However, most of these DL systems use unique setups that require significant engineering effort and may only work for a specific problem or architecture, making it hard to run new experiments and...

Downloads: 1 This Week

Last Update: 2021-05-24
See Project
7

GROWbox Supervisor System (GROWSS)

Automated Plant Environment Growing System using Raspberry Pi

GROWbox Supervisor System (GROWSS) is a grow box or grow cabinet environmental controller. GROWSS uses a Raspberry Pi (RPI) & Grove sensors to monitor and control the environment in your grow box or cabinet. Environmental factors such as temperature, humidity, soil moisture, & smoke (from fire) are monitored and displayed on a terminal connected to the RPI & on a mobile app. GROWSS also controls the exhaust fan and grow lights based on the time interval (ie. 12 hours on/12 hours off) &...

Downloads: 0 This Week

Last Update: 2020-07-19
See Project
8

GoodByeCatpcha

Solver ReCaptcha v2 Free

An async Python library to automate solving ReCAPTCHA v2 by images/audio using Mozilla's DeepSpeech, PocketSphinx, Microsoft Azure’s, Google Speech and Amazon's Transcribe Speech-to-Text API. Also image recognition to detect the object suggested in the captcha. Built with Pyppeteer for Chrome automation framework and similarities to Puppeteer, PyDub for easily converting MP3 files into WAV, aiohttp for async minimalistic web-server, and Python’s built-in AsyncIO for convenience.

Downloads: 2 This Week

Last Update: 2020-06-24
See Project
9

GUIDOLib

The GUIDOLib provides a powerful engine for the graphic rendering of music scores, based on the Guido Music Notation format. It supports Linux, Mac OS X, Windows, Android and iOS operating systems. A Java JNI interface is available as well as a Javascript version of the library. A Web API has also been designed, allowing to deploy the engine as a Web service.

4 Reviews

Downloads: 0 This Week

Last Update: 2023-04-11
See Project
Custom VMs From 1 to 96 vCPUs With 99.95% Uptime
General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.

Try Free
10

Defox text to speech and downloader

Written or imported text offline read or online download.

This software design to convert text to speech and download the converted speech. Description : • Installation setup with two languages (English, French) • Two areas called text reading and speech downloading • Many languages supported to download center Note 1: I'm a student yet and I'm not in the software designing industry. Therefore maybe I haven't software making skills. I'm worried about that. ! Note 2 : When you double click on the software maybe it will get some seconds...

1 Review

Downloads: 0 This Week

Last Update: 2019-09-27
See Project
11

NeneBot v1.0b

Portuguese Bot Development - Software Intuitivo

This process is taking a bit longer, do to recently had to change home. Now, the program is based on audio replies. Work still in progress

Downloads: 0 This Week

Last Update: 2019-05-11
See Project
12

Deepvoice3_pytorch

PyTorch implementation of convolutional neural networks

An open source implementation of Deep Voice 3: Scaling Text-to-Speech with Convolutional Sequence Learning.

Downloads: 1 This Week

Last Update: 2024-08-13
See Project
13

Swami Project

A SoundFont editor and other software for editing, managing and sharing sample based MIDI instrument files for computer music composition. Support for other formats is planned.

3 Reviews

Downloads: 2 This Week

Last Update: 2019-03-09
See Project
14

Mirrorcast

Open Source Alternative to Chromecast, Mirror Desktop and Play media r

...Mirrorcast uses up about the same amount of system resources as google chromes cast feature. The delay is less than 1 second on most networks. To achieve this we will use existing FOSS software such as ffmpeg, mpv, and omxplayer.

Downloads: 3 This Week

Last Update: 2023-08-04
See Project
15

TunesViewer

Easy podcast access in Linux, Android

A small, easy-to-use program to access itunesU media & podcasts in Linux and Android.

16 Reviews

Downloads: 3 This Week

Last Update: 2020-10-28
See Project
16

AngelReader

An E-book, Audio-book, & Library Loader in One Application

AngelReader: A minimalist but powerful GUI application that has the capacity to load [1] E-books in plain text format with the least use of both software and hardware resources. It can also load [2] Audio-books with the basic functions of play, stop, pause, and resume with the same minimalist economy that doesn't hog computer resources. When used in integration with the AngelReader Library Selector, it can function as a mini library management system for books in electronic formats. ...

Downloads: 0 This Week

Last Update: 2018-10-16
See Project
17

DC-TTS

TensorFlow Implementation of DC-TTS: yet another text-to-speech model

DC-TTS is a TensorFlow implementation of the DC-TTS architecture, a fully convolutional text-to-speech system designed to be efficiently trainable while producing natural speech. It follows the “Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks with Guided Attention” paper, but the author adapts and extends the design to make it practical for real experiments. The model is split into two networks: Text2Mel, which maps text to mel-spectrograms, and SSRN...

Downloads: 0 This Week

Last Update: 2025-11-28
See Project
18

OpenBroadcaster Server

Broadcast Automation Software LPFM IPTV

OpenBroadcaster has a simple and effective web-based automation system with scheduler to conveniently manage your station from just about anywhere, through the creation of content lists that can play out over both terrestrial or internet radio and TV, using a mobile-friendly 'drag-and-drop' interface. Setting up repeating content, such as station identification, public service announcements or advertisements is "point and click" easy.

Downloads: 0 This Week

Last Update: 2022-09-14
See Project
19

Lip Reading

Cross Audio-Visual Recognition using 3D Architectures

The input pipeline must be prepared by the users. This code is aimed to provide the implementation for Coupled 3D Convolutional Neural Networks for audio-visual matching. Lip-reading can be a specific application for this work. Audio-visual recognition (AVR) has been considered as a solution for speech recognition tasks when the audio is corrupted, as well as a visual recognition method used for speaker verification in multi-speaker scenarios. The approach of AVR systems is to leverage the extracted information from one modality to improve the recognition ability of the other modality by complementing the missing information. ...

Downloads: 2 This Week

Last Update: 2022-08-11
See Project
20

DreamStudio by Volocian Studios

Free yourself

Volocian believes first and foremost that financial status shouldn’t be a barrier to entry in multimedia content creation. Whether you want to design a product or website, record a band, remix a song, or produce a feature length movie with professional post production graphics and effects, Volocian™ wants to help, even if you’re using borrowed hardware with no previous experience. We provide affordable products and services for any budget, including cost-free solutions for education and demo...

8 Reviews

Downloads: 7 This Week

Last Update: 2017-10-04
See Project
21

JAVT - Just Another Voice Transformer

Just Another Speech Recognition and Text to Speech software.

JAVT or Just Another Voice Transformer (formerly, it is called Just Another Video Transcriber) is a Speech Recognition software that also support text to Speech and simple media conversion. JAVT allows you to convert from video files to audio wav file using ffmpeg, and then transcribe the audio file to text using either Microsoft SAPI or CMU Sphinx. You can also open a text file and allow JAVT to read it out for you through text to speech conversion.

Downloads: 3 This Week

Last Update: 2020-08-19
See Project
22

aeneas

Automagically synchronize audio and text (aka forced alignment)

aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment). aeneas automatically generates a synchronization map between a list of text fragments and an audio file containing the narration of the text. In computer science this task is known as (automatically computing a) forced alignment.

Downloads: 4 This Week

Last Update: 2024-06-21
See Project
23

Distant Speech Recognition

Beamforming and Speech Recognition Toolkit

BTK contains C++ and Python libraries that implement speech processing and microphone array techniques such as speech feature extraction, speech enhancement, speaker tracking, beamforming, dereverberation and echo cancellation algorithms. The Millennium ASR provides C++ and python libraries for automatic speech recognition. The Millennium ASR implements a weighted finite state transducer (WFST) decoder, training and adaptation methods. These toolkits are meant for facilitating research and...

Downloads: 0 This Week

Last Update: 2019-08-21
See Project
24

randomcd

burns a random selection of audio files from your collection onto a CD

randomcd is a python script that allows you to copy random playlists based on your collection of audio files onto a CD or a USB drive. You can either specify a directory to explore, or let the program use your player's database (only clementine, banshee and rhythmbox are currently supported). If appropriate software is installed, files can be normalised before they're copied. Specific genres can also be excluded from the compilation.

Downloads: 0 This Week

Last Update: 2016-09-23
See Project
25

Fast Forward

Free video editor to convert, cut, trim, stream select and encode

Fast Forward is free video editing software that allows you to convert, cut, trim, remove streams, encode and customise a variety of parameters such as frame rate, bitrate, frame size and output file size. Fast Forward can encode H264, MPEG2 or Xvid video, as well as Dolby Digital AC3, Dolby Digital Plus eAC3+, AAC and Vorbis audio. It is very useful for removing ads from recorded TV programs, or combining the .VOB files from a DVD file system.

Downloads: 2 This Week

Last Update: 2019-06-23
See Project