Search Results for "audio linux" - Page 10

Sort By:

Showing 849 open source projects for "audio linux"

View related business solutions

Python Clear Filters & Widen Search

MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
Try Google Cloud Risk-Free With $300 in Credit
No hidden charges. No surprise bills. Cancel anytime.

Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.

Start Free
1

Demucs

Code for the paper Hybrid Spectrogram and Waveform Source Separation

Demucs (Deep Extractor for Music Sources) is a deep-learning framework for music source separation—extracting individual instrument or vocal tracks from a mixed audio file. The system is based on a U-Net-like convolutional architecture combined with recurrent and transformer elements to capture both short-term and long-term temporal structure. It processes raw waveforms directly rather than spectrograms, allowing for higher-quality reconstruction and fewer artifacts in separated tracks. The...

Downloads: 109 This Week

Last Update: 2025-10-12
See Project
2

Audio Webui

A webui for different audio related Neural Networks

Audio Webui is a Gradio-based web user interface that unifies a wide range of audio-related neural networks under a single, accessible front end. It is designed as an “all-in-one” environment where users can experiment with text-to-speech, voice cloning, generative music, and other neural audio models without writing boilerplate code. The project supports multiple back-end models and toolchains (such as Bark, RVC, AudioLDM, Audiocraft, and other text-to-audio or voice-cloning tools),...

Downloads: 0 This Week

Last Update: 2025-11-28
See Project
3

Fansly Downloader

Easy to use fansly.com content downloading tool

Fansly Downloader is the go-to app for all your bulk media downloading needs. Download photos, videos, audio, or any other media from Fansly, this powerful tool has got you covered! Say goodbye to the hassle of individually downloading each piece of media, now you can download them all or just some, with just a few clicks.

Downloads: 29 This Week

Last Update: 2024-08-29
See Project
4

MusicLM - Pytorch

Implementation of MusicLM music generation model in Pytorch

Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch. They are basically using text-conditioned AudioLM, but surprisingly with the embeddings from a text-audio contrastive learned model named MuLan. MuLan is what will be built out in this repository, with AudioLM modified from the other repository to support the music generation needs here.

Downloads: 0 This Week

Last Update: 2023-09-06
See Project
Earn up to 16% annual interest with Nexo.
Access competitive interest rates on your digital assets.

Generate interest, borrow against your crypto, and trade a range of cryptocurrencies — all in one platform. Geographic restrictions, eligibility, and terms apply.

Get started with Nexo.
5

StoryTeller

Multimodal AI Story Teller, built with Stable Diffusion, GPT, etc.

A multimodal AI story teller, built with Stable Diffusion, GPT, and neural text-to-speech (TTS). Given a prompt as an opening line of a story, GPT writes the rest of the plot; Stable Diffusion draws an image for each sentence; a TTS model narrates each line, resulting in a fully animated video of a short story, replete with audio and visuals. To develop locally, install dev dependencies and install pre-commit hooks. This will automatically trigger linting and code quality checks before each...

Downloads: 0 This Week

Last Update: 2023-08-22
See Project
6

find-similar

User-friendly library to find similar objects

The mission of the FindSimilar project is to provide a powerful and versatile open source library that empowers developers to efficiently find similar objects and perform comparisons across a variety of data types. Whether dealing with texts, images, audio, or more, our project aims to simplify the process of identifying similarities and enhancing decision-making. https://github.com/findsimilar/find-similar - GitHub repo http://demo.findsimilar.org/ - Demo project and...

1 Review

Downloads: 0 This Week

Last Update: 2023-11-12
See Project
7

MahaKurawa MP4V-A Extractor

This software is a tool to extract video and audio file that contained

This software is a tool to extract video and audio file that contained by a .MP4 format. This software will not convert any video and audio file from yout .mp4 file. This software just extract them as it is. This tool is made for that specific purpose. This tool "MahaKurawa MP4 V-A Extractor V.10" can be obtained for free on https://www.mahakurawa.my.id.

Downloads: 1 This Week

Last Update: 2023-08-31
See Project
8

pyst: Python for Asterisk

Pyst consists of a set of interfaces and libraries to allow programming of Asterisk from python. The library currently supports AGI, AMI, and the parsing of Asterisk configuration files. The library also includes debugging facilities for AGI. 2014-04-17: Moved the version control to GIT. To check out see the tab "Code". Note that the whole history including ancient CVS, then some time in monotone, then subversion was united into one GIT repository thanks to ESR's...

5 Reviews

Downloads: 0 This Week

Last Update: 2023-08-07
See Project
9

audio-diffusion-pytorch

Audio generation using diffusion models, in PyTorch

A fully featured audio diffusion library, for PyTorch. Includes models for unconditional audio generation, text-conditional audio generation, diffusion autoencoding, upsampling, and vocoding. The provided models are waveform-based, however, the U-Net (built using a-unet), DiffusionModel, diffusion method, and diffusion samplers are both generic to any dimension and highly customizable to work on other formats. Note: no pre-trained models are provided here, this library is meant for research...

Downloads: 0 This Week

Last Update: 2023-03-29
See Project
Forever Free Full-Stack Observability | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
10

VALL-E

PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)

We introduce a language modeling approach for text to speech synthesis (TTS). Specifically, we train a neural codec language model (called VALL-E) using discrete codes derived from an off-the-shelf neural audio codec model, and regard TTS as a conditional language modeling task rather than continuous signal regression as in previous work. During the pre-training stage, we scale up the TTS training data to 60K hours of English speech which is hundreds of times larger than existing systems....

Downloads: 0 This Week

Last Update: 2023-04-14
See Project
11

NOW

No-code tool for creating a neural search solution in minutes

One line to host them all. Bootstrap your multimodal search case in minutes. NOW gives the world access to multimodal neural search with just one command. NOW supports various formats for uploading your dataset to your search application. You may either choose a demo dataset hosted by NOW, or use your own custom dataset, to build an application. NOW can support your custom data in the form of a DocumentArray, as a path to a local folder, or S3 bucket. You can choose a demo dataset to get...

Downloads: 0 This Week

Last Update: 2023-04-10
See Project
12

Debreate - Debian Package Builder

A utility for creating Debian packages (.deb)

Debreate is a utility to aid in creating Debian (.deb) packages. Currently it only supports binary packaging (note that the term "binary package" is used loosely, as such packages can contain scripts & non-code items such as media images, audio, & more) for personal distribution. Plans for using backends such as dh_make & debuild for creating source packages are in the works. But source packaging can be quite different & is a must if you want to get your packages into a distribution's...

15 Reviews

Downloads: 3 This Week

Last Update: 2023-05-12
See Project
13

Amiga Memories

A walk along memory lane

Amiga Memories is a project (started & released in 2013) that aims to make video programmes that can be published on the internet. The images and sound produced by Amiga Memories are 100% automatically generated. The generator itself is implemented in Squirrel, the 3D rendering is done on GameStart 3D. An Amiga Memories video is mostly based on a narrative. The purpose of the script is to define the spoken and written content. The spoken text will be read by a voice synthesizer (Text To...

Downloads: 0 This Week

Last Update: 2023-03-22
See Project
14

NÜWA - Pytorch

Implementation of NÜWA, attention network for text to video synthesis

Implementation of NÜWA, state of the art attention network for text-to-video synthesis, in Pytorch. It also contains an extension into video and audio generation, using a dual decoder approach. It seems as though a diffusion-based method has taken the new throne for SOTA. However, I will continue on with NUWA, extending it to use multi-headed codes + hierarchical causal transformer. I think that direction is untapped for improving on this line of work. In the paper, they also present a way...

Downloads: 0 This Week

Last Update: 2023-03-22
See Project
15

footswitch2basic

Audio Transcription software for Linux (Vlc) with a foot pedal

Footswitch 2 (Basic) is a media player for transcribers on Linux. This version is a stripped down version of Footswitch2, containing only the absolute essentials for transcription. Written in python and using the python bindings for VLC it allows a transcriber to control the audio or video with a footpedal, and includes a set of macros that integrate into LibreOffice. This allows the transcriber to control the media player from within Libreoffice as well, making it useful for those who do not yet own a foot pedal. ...

Downloads: 2 This Week

Last Update: 2023-04-07
See Project
16

DeepMozart

Audio generation using diffusion models

Audio generation using diffusion models in PyTorch. The code is based on the audio-diffusion-pytorch repository.

Downloads: 0 This Week

Last Update: 2023-03-29
See Project
17

footswitch3

Audio Transcription software for Linux (Gstreamer) with a foot pedal

Footswitch 3 is a media player for transcribers on Linux. Written in python using the python bindings for Gstreamer it allows a transcriber to control the audio or video with a foot pedal, and includes a set of macros that integrate into LibreOffice. This allows the transcriber to control the media player from within Libreoffice as well, making it useful for those who do not yet own a foot pedal/foot switch.

1 Review

Downloads: 3 This Week

Last Update: 2023-04-02
See Project
18

footswitch3basic

Audio Transcription software for Linux (Gstreamer) with a foot pedal

Footswitch3basic is a media player for transcribers on Linux. Written in python using the bindings for Gstreamer it allows a transcriber to control the audio or video with a foot pedal, and includes a set of macros that integrate into LibreOffice. This allows the transcriber to control the media player from within Libreoffice as well, making it useful for those who do not yet own a foot pedal/foot switch.

Downloads: 0 This Week

Last Update: 2023-04-02
See Project
19

mrViewer

Flipbook, Image Viewer and Audio-Video Player

This project is no longer active. It has been replaced by mrv2 at: www.sourceforge.net/p/mrv2 A video player, interactive image viewer, and flipbook for use in VFX, 3D computer graphics and professional illustration.

11 Reviews

Downloads: 84 This Week

Last Update: 2023-04-10
See Project
20

Riffusion

Real-time music generation using stable diffusion techniques AI

Riffusion (hobby) is a Python-based open source library designed for real-time music and audio generation using stable diffusion techniques. Riffusion (hobby) works by generating and manipulating spectrogram images, which are then converted into playable audio clips, effectively bridging image-based diffusion models with sound synthesis. It implements a diffusion pipeline that supports prompt interpolation, allowing smooth transitions between different musical styles or prompts over time....

Downloads: 1 This Week

Last Update: 2026-03-18
See Project
21

EnCodec

State-of-the-art deep learning based audio codec

Encodec is a neural audio codec developed by Meta for high-fidelity, low-bitrate audio compression using end-to-end deep learning. Unlike traditional codecs (like MP3 or Opus), Encodec uses a learned quantizer and decoder to reconstruct complex waveforms with remarkable accuracy at bitrates as low as 1.5 kbps. It employs a convolutional encoder–decoder architecture trained with perceptual loss functions that optimize for human auditory quality rather than raw waveform distance. The model can...

Downloads: 1 This Week

Last Update: 2025-10-12
See Project
22

Tidal-Media-Downloader

Download 'TIDAL' Music On Windows/Linux/MacOs (PYTHON/C#)

Tidal-Media-Downloader is an application that lets you download videos and tracks from Tidal. It supports two versions, tidal-dl and tidal-gui. (This repository only contains tidal-dl, and the release isn't the newest gui version.)

Downloads: 68 This Week

Last Update: 2024-11-13
See Project
23

Footswitch2 Equaliser

15 band pulseaudio equaliser

15 band audio equaliser originally intended for use with Footswitch2 transcription tools but will happily run independently. This Linux python utility provides a GUI front end to modify the sound of audio using pulseaudio's ladspa module with Steve Hariss' mbeq_1197 and Frank Neumann's split_1406 plugins. Multiband equaliser and Mono to Stereo splitter respectively.

1 Review

Downloads: 0 This Week

Last Update: 2022-11-16
See Project
24

psgdump

Dump psg/ym chip tune files to txt and midi format

PSGDump tool is parser and converter for chip tune files. It supports PSG and YM input file formats, focusing on AY/YM chip tunes from ZX Spectrum and Atari ST. The tool produces text output of notes played and creates multi-track MIDI file.

Downloads: 0 This Week

Last Update: 2022-09-19
See Project
25

cdcover

cdcover allows the creation of inlay-sheets for jewel cd-cases. It is written in Python and uses Python-TK to provide an easy to use GUI. cdcover can access a CDDB-Server to get title and track-Info for audio CDs.

Downloads: 5 This Week

Last Update: 2022-10-05
See Project