Encodec is a neural audio codec developed by Meta for high-fidelity, low-bitrate audio compression using end-to-end deep learning. Unlike traditional codecs (like MP3 or Opus), Encodec uses a learned quantizer and decoder to reconstruct complex waveforms with remarkable accuracy at bitrates as low as 1.5 kbps. It employs a convolutional encoder–decoder architecture trained with perceptual loss functions that optimize for human auditory quality rather than raw waveform distance. The model can operate in real time and supports variable bandwidths, bitrates, and multi-band audio. Encodec has applications in speech and music compression, generative modeling, and efficient data transmission for communication systems. The repository includes pretrained checkpoints, PyTorch inference code, and examples for integrating Encodec as a module in downstream generative or streaming systems.
Features
- End-to-end learned neural audio compression at ultra-low bitrates
- Real-time encoding and decoding with GPU acceleration
- Configurable bitrates, bandwidths, and model sizes
- High perceptual quality maintained via multi-scale loss optimization
- Pretrained checkpoints for speech and music domains
- Modular PyTorch implementation for integration into larger pipelines