Showing 15 open source projects for "audio codecs"

View related business solutions
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    Audiogen Codec

    Audiogen Codec

    48khz stereo neural audio codec for general audio

    AGC (Audiogen Codec) is a convolutional autoencoder based on the DAC architecture, which holds SOTA. We found that training with EMA and adding a perceptual loss term with CLAP features improved performance. These codecs, being low compression, outperform Meta's EnCodec and DAC on general audio as validated from internal blind ELO games. We trained (relatively) very low compression codecs in the pursuit of solving a core issue regarding general music and audio generation, low acoustic quality, and audible artifacts, which hinder industry use for these models. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    Moshi

    Moshi

    A speech-text foundation model for real time dialogue

    Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec. Mimi processes 24 kHz audio, down to a 12.5 Hz representation with a bandwidth of 1.1 kbps, in a fully streaming manner (latency of 80ms, the frame size), yet performs better than existing, non-streaming, codecs like SpeechTokenizer (50 Hz, 4kbps), or SemantiCodec (50 Hz, 1.3kbps). Moshi models two streams of audio: one corresponds to Moshi, and the other one to the user. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 3
    OpenShot Video Editor

    OpenShot Video Editor

    Award-Winning Open Source Video Editing Software

    OpenShot Video Editor is a powerful yet very simple and easy-to-use video editor that delivers high quality video editing and animation solutions. OpenShot offers a myriad of features and capabilities, including powerful curve-based Key frame animations, 3D animated titles and effects, slow motion and time effects, audio mixing and editing, and so much more. It’s available for Linux, Mac and Windows, with a very simple and friendly interface. Start creating stunning videos quickly and easily...
    Downloads: 114 This Week
    Last Update:
    See Project
  • 4
    WavTokenizer

    WavTokenizer

    SOTA discrete acoustic codec models with 40/75 tokens per second

    ...Its architecture incorporates a broader vector-quantization space, extended contextual windows, and improved attention networks, combined with multi-scale discriminators and inverse Fourier transform blocks to enhance waveform reconstruction. Extensive experiments show that WavTokenizer matches or surpasses previous neural codecs across speech, music, and general audio on both objective metrics and subjective listening tests.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Atera - an All-in-one platform for IT management Icon
    Atera - an All-in-one platform for IT management

    Ideal for IT departments and MSPs (managed service providers)

    Your IT essentials, integrated & elevated. Take your IT management from automated to autonomous, download Atera's agent to start your free trial!
    Try Atera now
  • 5
    Tauon

    Tauon

    The music player of today

    Tauon is a modern, streamlined music player app that's packed with features! An emphasis on playlists and drag-and-drop importing puts you in control of your music library. Faded volume control, 24-bit FLAC support, and gapless playback provide the ultimate listening experience. Excellent CUE sheet support, an original smart playlist system, and network playback from koel or Airsonic servers. Last.fm, Listenbrainz, and Maloja scribbling. MPRIS2 support for desktop integration. Tauon is a...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 6
    Qwen3-Omni

    Qwen3-Omni

    Qwen3-omni is a natively end-to-end, omni-modal LLM

    ...It achieves state-of-the-art results: across 36 audio and audio-visual benchmarks, it hits open-source SOTA on 32 and overall SOTA on 22, outperforming or matching strong closed-source models such as Gemini-2.5 Pro and GPT-4o. To reduce latency, especially in audio/video streaming, Talker predicts discrete speech codecs via a multi-codebook scheme and replaces heavier diffusion approaches.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    PyAV

    PyAV

    Pythonic bindings for FFmpeg's libraries

    ...While powerful, it requires a solid understanding of FFmpeg concepts, as it prioritizes flexibility and control over abstraction. Overall, PyAV is a robust tool for developers building advanced video and audio processing systems in Python.
    Downloads: 33 This Week
    Last Update:
    See Project
  • 8
    PyLivestream

    PyLivestream

    Pure Python FFmpeg-based live video / audio streaming to YouTube

    PyLivestream is a Python-based tool that enables real-time video streaming from various input sources to platforms such as YouTube and Twitch. It acts as a wrapper around FFmpeg, allowing users to stream video from cameras, files, or screen capture devices with minimal configuration. The tool supports cross-platform operation and integrates easily into Python workflows, making it suitable for automation and scripting. It provides options for controlling streaming parameters such as bitrate,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    yt-dlp

    yt-dlp

    A youtube-dl fork with additional features and fixes

    yt-dlp is a youtube-dl fork based on the now inactive youtube-dlc. The main focus of this project is adding new features and patches while also keeping up to date with the original project
    Downloads: 570 This Week
    Last Update:
    See Project
  • $300 Free Credits to Build on Google Cloud Icon
    $300 Free Credits to Build on Google Cloud

    New to Google Cloud? Get $300 in credits to explore Compute Engine, BigQuery, Cloud Run, Gemini Enterprise Agent Platform, and more.

    Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query petabytes in BigQuery, or build agents with Gemini Enterprise Agent Platform. Once your credits are used, keep building with 20+ always-free tier products including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. No commitment required—just sign up and start building.
    Claim $300 Free
  • 10
    EnCodec

    EnCodec

    State-of-the-art deep learning based audio codec

    Encodec is a neural audio codec developed by Meta for high-fidelity, low-bitrate audio compression using end-to-end deep learning. Unlike traditional codecs (like MP3 or Opus), Encodec uses a learned quantizer and decoder to reconstruct complex waveforms with remarkable accuracy at bitrates as low as 1.5 kbps. It employs a convolutional encoder–decoder architecture trained with perceptual loss functions that optimize for human auditory quality rather than raw waveform distance. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    youtube-dl

    youtube-dl

    Download videos from YouTube (and more sites)

    youtube-dl is a command-line program to download videos from YouTube and a few more sites. It requires the Python interpreter (2.6, 2.7, or 3.2+), and it is not platform specific. We also provide a Windows executable that includes Python. youtube-dl should work in your Unix box, in Windows or in Mac OS X. It is released to the public domain, which means you can modify it, redistribute it or use it however you like. youtube-dl is a powerful, open-source command-line program designed to...
    Downloads: 65 This Week
    Last Update:
    See Project
  • 12
    MystiQ

    MystiQ

    Qt5/C++ FFmpeg Media Converter

    MystiQ is a cross-platform multimedia converter built with Qt and FFmpeg, designed to provide a modern graphical interface for video and audio processing tasks. It allows users to perform operations such as transcoding, trimming, and format conversion without needing to use command-line tools. The application supports a wide range of codecs and formats, enabling compatibility across devices and platforms. It includes batch processing capabilities, allowing multiple files to be converted simultaneously. ...
    Downloads: 12 This Week
    Last Update:
    See Project
  • 13
    Antorca
    A performance and usability focused Linux distribution based on 64-bit Debian testing. It is the successor to illume OS. To use the live ISO image, login to "root" with the password "antorca".
    Downloads: 3 This Week
    Last Update:
    See Project
  • 14
    Fast Forward

    Fast Forward

    Free video editor to convert, cut, trim, stream select and encode

    Fast Forward is free video editing software that allows you to convert, cut, trim, remove streams, encode and customise a variety of parameters such as frame rate, bitrate, frame size and output file size. Fast Forward can encode H264, MPEG2 or Xvid video, as well as Dolby Digital AC3, Dolby Digital Plus eAC3+, AAC and Vorbis audio. It is very useful for removing ads from recorded TV programs, or combining the .VOB files from a DVD file system. Thanks to FFmpeg, these processes are extremely...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 15
    A fully automated video conversion script,using XML profiles for configuration. Supports 1 or 2 pass encoding,auto crop detection,auto bitrate based on target filesize,audio/video preprocessing/filtering as well as all major codecs and container formats.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
Auth0 Logo