Showing 490 open source projects for "audio for mac"

View related business solutions
  • Outgrown Windows Task Scheduler? Icon
    Outgrown Windows Task Scheduler?

    Free diagnostic identifies where your workflow is breaking down—with instant analysis of your scheduling environment.

    Windows Task Scheduler wasn't built for complex, cross-platform automation. Get a free diagnostic that shows exactly where things are failing and provides remediation recommendations. Interactive HTML report delivered in minutes.
    Download Free Tool
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 1
    Piano transcription

    Piano transcription

    Task of transcribing piano recordings into MIDI files

    Piano transcription is an open-source high-resolution piano transcription system by ByteDance that converts raw audio recordings of piano performance into symbolic MIDI files — detecting note onsets, offsets, pitch, velocity, and even pedal usage. The system is implemented in Python (PyTorch) and is capable of accurate transcription of polyphonic piano recordings, even with complex passages and pedal techniques, making it suitable for classical piano music. By using this transcription tool,...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    Music Source Separation

    Music Source Separation

    Separate audio recordings into individual sources

    Music Source Separation is a PyTorch-based open-source implementation for the task of separating a music (or audio) recording into its constituent sources — for example isolating vocals, instruments, bass, accompaniment, or background from a mixed track. It aims to give users the ability to take any existing song and decompose it into separate stems (vocals, accompaniment, etc.), or to train custom separation models on their own datasets (e.g. for speech enhancement, instrument isolation, or...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 3
    youtube-dl

    youtube-dl

    Download videos from YouTube (and more sites)

    ...It requires the Python interpreter (2.6, 2.7, or 3.2+), and it is not platform specific. We also provide a Windows executable that includes Python. youtube-dl should work in your Unix box, in Windows or in Mac OS X. It is released to the public domain, which means you can modify it, redistribute it or use it however you like. youtube-dl is a powerful, open-source command-line program designed to facilitate the downloading of videos and audio from popular video streaming websites. Widely recognized for its versatility, youtube-dl supports a vast array of platforms beyond YouTube, including Vimeo, Dailymotion, and many others. ...
    Downloads: 199 This Week
    Last Update:
    See Project
  • 4
    SVoice (Speech Voice Separation)

    SVoice (Speech Voice Separation)

    We provide a PyTorch implementation of the paper Voice Separation

    SVoice is a PyTorch-based implementation of Facebook Research’s study on speaker voice separation as described in the paper “Voice Separation with an Unknown Number of Multiple Speakers.” This project presents a deep learning framework capable of separating mixed audio sequences where several people speak simultaneously, without prior knowledge of how many speakers are present. The model employs gated neural networks with recurrent processing blocks that disentangle voices over multiple...
    Downloads: 6 This Week
    Last Update:
    See Project
  • Atera all-in-one platform IT management software with AI agents Icon
    Atera all-in-one platform IT management software with AI agents

    Ideal for internal IT departments or managed service providers (MSPs)

    Atera’s AI agents don’t just assist, they act. From detection to resolution, they handle incidents and requests instantly, taking your IT management from automated to autonomous.
    Learn More
  • 5
    VoiceFixer

    VoiceFixer

    General Speech Restoration

    VoiceFixer is a machine-learning framework for “speech restoration”: given a degraded or distorted audio recording — with noise, clipping, low sampling rate, reverberation, or other artifacts — it attempts to recover high-fidelity, clean speech. The architecture works in two stages: first an analysis stage that tries to extract “clean” intermediate features from the noisy audio (e.g. removing noise, denoising, dereverberation, upsampling), and then a neural vocoder-based synthesis stage that...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    Mocking Bird

    Mocking Bird

    Clone a voice in 5 seconds to generate arbitrary speech in real-time

    MockingBird is an open-source voice cloning and real-time speech generation toolkit that lets you clone a speaker’s voice from a short audio sample (reportedly as little as 5 seconds) and then synthesize arbitrary speech in that voice. It builds on deep-learning based TTS / voice-cloning technology (in the lineage of projects such as Real-Time-Voice-Cloning), but extends it with support for Mandarin Chinese and multiple Chinese speech datasets — broadening its applicability beyond English....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    pytube

    pytube

    A lightweight, dependency-free Python library

    Pytube is a lightweight, dependency-free Python library that enables downloading YouTube videos and audio streams with minimal setup. It supports video resolution selection, progressive or adaptive streams, and caption downloads. Pytube is ideal for automation scripts, archiving tools, and media applications that need to interface with YouTube content programmatically.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 8

    pydatascope

    Software oscilloscope using Python and tkinter

    Software oscilloscope using Python and tkinter. Supports multiple sources: socket, file, audio, USB. Displays data by samples, time or frequency. Scales the input automatically or manually.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    VidCutter

    VidCutter

    A modern yet simple multi-platform video cutter and joiner

    A modern, simple to use, constantly evolving and hella fast MEDIA CUTTER + JOINER w/ frame-accurate SmartCut technology, chapter support, media stream selection for audio + subtitle channels and blackdetect video filter support to automatically detect scene changes or skip commercials in digital TV recordings. Chapter support allows scene chapter names to be included in final media metadata. NOTE: results will only work in media players that support chapters. Flatpak release includes the...
    Downloads: 9 This Week
    Last Update:
    See Project
  • Grafana: The open and composable observability platform Icon
    Grafana: The open and composable observability platform

    Faster answers, predictable costs, and no lock-in built by the team helping to make observability accessible to anyone.

    Grafana is the open source analytics & monitoring solution for every database.
    Learn More
  • 10
    DeepSpeech

    DeepSpeech

    Open source embedded speech-to-text engine

    DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers. DeepSpeech is an open-source Speech-To-Text engine, using a model trained by machine learning techniques based on Baidu's Deep Speech research paper. Project DeepSpeech uses Google's TensorFlow to make the implementation easier. A pre-trained English model is available for use and can be downloaded following the...
    Downloads: 13 This Week
    Last Update:
    See Project
  • 11
    Denoiser

    Denoiser

    Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)

    Denoiser is a real-time speech enhancement model operating directly on raw waveforms, designed to clean noisy audio while running efficiently on CPU. It uses a causal encoder-decoder architecture with skip connections, optimized with losses defined both in the time domain and frequency domain to better suppress noise while preserving speech. Unlike models that operate on spectrograms alone, this design enables lower latency and coherent waveform output. The implementation includes data...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Pydub

    Pydub

    Manipulate audio with a simple and easy high level interface

    Manipulate audio with a simple and easy high level interface. You can pass an optional bitrate argument to export using any syntax ffmpeg supports. Any further arguments supported by ffmpeg can be passed as a list in a 'parameters' argument, with switch first, argument second. Note that no validation takes place on these parameters, and you may be limited by what your particular build of ffmpeg/avlib supports. You can open and save WAV files with pure python. For opening and saving non-wav...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    HiFi-GAN

    HiFi-GAN

    Generative Adversarial Networks for Efficient and High Fidelity Speech

    HiFi-GAN is a GAN-based neural vocoder designed to generate high-fidelity speech waveforms from mel spectrograms with exceptional efficiency. It introduces a generator architecture tailored to model the periodic structure of speech and a set of discriminators that focus on different scales and periods of the waveform to better capture naturalness. The model targets a sweet spot between sample quality and generation speed, outperforming many previous GAN vocoders while being far faster than...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    OpenDAFF

    OpenDAFF

    Directional Audio File Format

    OpenDAFF is a free, open-source software package for directional audio data - like the directivity of microphones, speakers, as well as head-related transfer functions (HRTFs)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Youtube Video Downloader

    Youtube Video Downloader

    Youtube Video Downloader is Open Source GUI tool

    Youtube Video Downloader is Open Source GUI tool to download Youtube video. It is Developed with Python, Qt, and Pytube Library. It is Multi-thread Application. Best Available Option download video in highly available Quality . Download Videos in 720p, 480p, 360p etc.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 16
    Tensor2Tensor

    Tensor2Tensor

    Library of deep learning models and datasets

    Deep Learning (DL) has enabled the rapid advancement of many useful technologies, such as machine translation, speech recognition and object detection. In the research community, one can find code open-sourced by the authors to help in replicating their results and further advancing deep learning. However, most of these DL systems use unique setups that require significant engineering effort and may only work for a specific problem or architecture, making it hard to run new experiments and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17

    FastoCloud PRO

    IPTV/NVR/CCTV/Video cloud https://fastocloud.com

    IPTV/Video cloud Features: Cross-platform (Linux, MacOSX, FreeBSD, Raspbian/Armbian) GPU/CPU Encode/Decode/Post Processing Stream statistics CCTV Adaptive hls streams Load balancing Temporary urls HLS push EPG scanning Subtitles to text conversions AD insertion Logo overlay Video effects Relays Timeshifts Catchups Playlists Restream/Transcode from online streaming services like Youtube, Twitch ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18

    SmartBody

    Character animation system for games and simulations.

    SmartBody is available for download for Windows, Linux and OSX users. SmartBody can also be used on Android and iOS platforms. SmartBody is a character animation platform that provides the following capabilities in real time: * Locomotion (walk, jog, run, turn, strafe, jump, etc.) * Steering - avoiding obstacles and moving objects * Object manipulation - reach, grasp, touch , pick up objects * Lip Syncing - characters can speak with simultaneous lip-sync using text-to-speech or...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    termtosvg

    termtosvg

    Record terminal sessions as SVG animations

    termtosvg is a Unix terminal recorder written in Python that renders your command line sessions as standalone SVG animations. Produce lightweight and clean-looking animations embeddable on a project page. Custom color themes, terminal UI, and animation controls via SVG templates. Compatible with asciinema recording format. termtosvg is compatible with Linux, macOS and BSD OSes, requires Python >= 3.5 and can be installed using pip.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    GUIDOLib
    The GUIDOLib provides a powerful engine for the graphic rendering of music scores, based on the Guido Music Notation format. It supports Linux, Mac OS X, Windows, Android and iOS operating systems. A Java JNI interface is available as well as a Javascript version of the library. A Web API has also been designed, allowing to deploy the engine as a Web service.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Spotify Music Downloader

    Spotify Music Downloader

    Spotify Music Downloader

    Download music from Spotify and other music sources.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 22
    Canorus

    Canorus

    Music score editor

    Canorus is a free cross-platform music score editor. It supports an unlimited number and length of staffs, polyphony, a MIDI playback of notes, chord markings, lyrics, import/export filters to formats like MIDI, MusicXML, ABC Music, MusiXTeX and LilyPond
    Leader badge
    Downloads: 29 This Week
    Last Update:
    See Project
  • 23
    Tacotron-2

    Tacotron-2

    DeepMind's Tacotron-2 Tensorflow implementation

    Tacotron-2 is a TensorFlow implementation of DeepMind’s Tacotron-2 end-to-end text-to-speech architecture, which predicts mel spectrograms from raw text and then feeds them to a neural vocoder such as WaveNet. It reproduces the original paper’s hyperparameters exactly via paper_hparams.py, while also offering a tuned hparams.py with extra improvements that often yield better audio quality in practice. The repository is structured as a full training pipeline: dataset preparation,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    NeneBot v1.0b

    NeneBot v1.0b

    Portuguese Bot Development - Software Intuitivo

    This process is taking a bit longer, do to recently had to change home. Now, the program is based on audio replies. Work still in progress
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    iTunesExport

    iTunesExport

    iTunesExport is written as a helper tool to export iTunes Playlists to

    The application exports the songs in the iTunes playlists and encodes and process them in a very flexible, customizable way. For use you have to select the desired playlists and then select a target folder, which can be any USB Mass Storage device or a directory at your hard disk. A click on the "Settings" button open a panel where you can customize the handling of the files. A click on the process button starts the export. You can also export the files into a folder and burn the...
    Downloads: 0 This Week
    Last Update:
    See Project