Showing 18 open source projects for "mpeg audio codec"

View related business solutions
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    Audiogen Codec

    Audiogen Codec

    48khz stereo neural audio codec for general audio

    AGC (Audiogen Codec) is a convolutional autoencoder based on the DAC architecture, which holds SOTA. We found that training with EMA and adding a perceptual loss term with CLAP features improved performance. These codecs, being low compression, outperform Meta's EnCodec and DAC on general audio as validated from internal blind ELO games. We trained (relatively) very low compression codecs in the pursuit of solving a core issue regarding general music and audio generation, low acoustic quality, and audible artifacts, which hinder industry use for these models. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    HeartMuLa

    HeartMuLa

    A Family of Open Sourced Music Foundation Models

    ...At the center is HeartMuLa, a music language model that generates music conditioned on inputs like lyrics and tags, with multilingual support that broadens the range of lyric-driven use cases. The project also includes HeartCodec, a music codec optimized for high reconstruction fidelity, enabling efficient tokenization and reconstruction workflows that are critical for training and generation pipelines. For text extraction from audio, it provides HeartTranscriptor, a Whisper-based model tuned specifically for lyrics transcription, which helps bridge generated or recorded audio back into structured text. ...
    Downloads: 14 This Week
    Last Update:
    See Project
  • 3
    AudioCraft

    AudioCraft

    Audiocraft is a library for audio processing and generation

    AudioCraft is a PyTorch library for text-to-audio and text-to-music generation, packaging research models and tooling for training and inference. It includes MusicGen for music generation conditioned on text (and optionally melody) and AudioGen for text-conditioned sound effects and environmental audio. Both models operate over discrete audio tokens produced by a neural codec (EnCodec), which acts like a tokenizer for waveforms and enables efficient sequence modeling. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 4
    WavTokenizer

    WavTokenizer

    SOTA discrete acoustic codec models with 40/75 tokens per second

    WavTokenizer is a state-of-the-art discrete acoustic codec designed specifically for audio language modeling, capable of compressing 24 kHz audio into just 40 or 75 tokens per second while preserving high perceptual quality. It is built to represent speech, music, and general audio with extremely low bitrate, making it ideal as a front-end for large audio language models like GPT-4o and similar architectures.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 5
    Moshi

    Moshi

    A speech-text foundation model for real time dialogue

    Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec. Mimi processes 24 kHz audio, down to a 12.5 Hz representation with a bandwidth of 1.1 kbps, in a fully streaming manner (latency of 80ms, the frame size), yet performs better than existing, non-streaming, codecs like SpeechTokenizer (50 Hz, 4kbps), or SemantiCodec (50 Hz, 1.3kbps). Moshi models two streams of audio: one corresponds to Moshi, and the other one to the user. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    voxshare_gui

    voxshare_gui

    *VoxShare* is a simple Python-based push-to-talk multicast voice chat

    VoxShare is a simple Python-based push-to-talk multicast voice chat application with a sleek modern GUI built using CustomTkinter. Provided as python source code or compiled standalone windows application (no need to install anything).
    Downloads: 20 This Week
    Last Update:
    See Project
  • 7
    VALL-E

    VALL-E

    PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)

    We introduce a language modeling approach for text to speech synthesis (TTS). Specifically, we train a neural codec language model (called VALL-E) using discrete codes derived from an off-the-shelf neural audio codec model, and regard TTS as a conditional language modeling task rather than continuous signal regression as in previous work. During the pre-training stage, we scale up the TTS training data to 60K hours of English speech which is hundreds of times larger than existing systems. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 8
    EnCodec

    EnCodec

    State-of-the-art deep learning based audio codec

    Encodec is a neural audio codec developed by Meta for high-fidelity, low-bitrate audio compression using end-to-end deep learning. Unlike traditional codecs (like MP3 or Opus), Encodec uses a learned quantizer and decoder to reconstruct complex waveforms with remarkable accuracy at bitrates as low as 1.5 kbps. It employs a convolutional encoder–decoder architecture trained with perceptual loss functions that optimize for human auditory quality rather than raw waveform distance. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9

    FastoCloud PRO

    IPTV/NVR/CCTV/Video cloud https://fastocloud.com

    IPTV/Video cloud Features: Cross-platform (Linux, MacOSX, FreeBSD, Raspbian/Armbian) GPU/CPU Encode/Decode/Post Processing Stream statistics CCTV Adaptive hls streams Load balancing Temporary urls HLS push EPG scanning Subtitles to text conversions AD insertion Logo overlay Video effects Relays Timeshifts Catchups Playlists Restream/Transcode from online streaming services like Youtube, Twitch ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 10
    Fast Forward

    Fast Forward

    Free video editor to convert, cut, trim, stream select and encode

    Fast Forward is free video editing software that allows you to convert, cut, trim, remove streams, encode and customise a variety of parameters such as frame rate, bitrate, frame size and output file size. Fast Forward can encode H264, MPEG2 or Xvid video, as well as Dolby Digital AC3, Dolby Digital Plus eAC3+, AAC and Vorbis audio. It is very useful for removing ads from recorded TV programs, or combining the .VOB files from a DVD file system. Thanks to FFmpeg, these processes are extremely streamlined and fast. To speed up your conversions, use the "Straight Copy" codec options (only useable under specific circumstances). Accepted formats include: *.3g2 *.3gp *.asf *.avi *.drc *.flv *.gif *.gifv *.m2ts *.m2v *.m4p *.m4v *.mkv *.mng *.mov *.mp4 *.mpeg *.mpg *.mxf *.nsv *.ogg *.ogv *.rm *.rmvb *.roq *.svi *.vob *.webm *.wmv *.yuv
    Downloads: 5 This Week
    Last Update:
    See Project
  • 11

    convertToMP4

    Easily convert to MP4 with H264 and without all the codec hassle

    Convert existing media data (movies or images) to an mp4 movie, with the high performane h264 codec. Usually it takes quite a lot to get the video running as you wish. As convertion engine, Mencoder will be used. This script is intended to ease the creation of videos. It is suitable for the beginner or the lazy advanced people. It will not suite the professional wanting to have full control over all codec parameters. Consider donating to this project:...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 12
    LibreEngineering
    LibreEngineering - suite of instrumentation, electrical, mechanical, process engineering calculation and design programs and other tools. Licensed under GPL3. Written in Python with Qt toolkit.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    EnKoDeur-Mixeur
    EnKoDeur-Mixeur (EKD) is an open source software which makes videos, pictures and audio post-production. It can be also used to convert videos in many formats. It is written in python and use the PyQt4 bindings.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    PyKaraoke is a cross-platform karaoke player. It currently supports CDG (MP3+G, OGG+G, WAV+G), MIDI (.KAR, .MID) and MPEG formats.
    Downloads: 24 This Week
    Last Update:
    See Project
  • 15
    The Qt Audio Processor is an ultimate audio files processing software, including ripping, converting, tagging and burning to, from and between every available audio codec.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Rippy is a script designed to make ripping DVDs easier. It uses mplayer and mencoder to transcode a video to another format. Features: automatic bitrate calculation based on desired target size; automatic crop detection; mp3 audio with resampling;
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    This is a program for in-car use. Its designed in python using pygame. The main goal is to have an easy to navigate program while having it look good. The features will include mp3/divx/mpeg/dvd and other media files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    playgc

    playgc

    a simple video to audio converter

    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB