open codec free download

Showing 26 open source projects for "open codec"

View related business solutions

Python Clear Filters & Widen Search

MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
Try Google Cloud Risk-Free With $300 in Credit
No hidden charges. No surprise bills. Cancel anytime.

Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.

Start Free
1

Audiogen Codec

48khz stereo neural audio codec for general audio

AGC (Audiogen Codec) is a convolutional autoencoder based on the DAC architecture, which holds SOTA. We found that training with EMA and adding a perceptual loss term with CLAP features improved performance. These codecs, being low compression, outperform Meta's EnCodec and DAC on general audio as validated from internal blind ELO games. We trained (relatively) very low compression codecs in the pursuit of solving a core issue regarding general music and audio generation, low acoustic...

Downloads: 0 This Week

Last Update: 2024-10-02
See Project
2

HeartMuLa

A Family of Open Sourced Music Foundation Models

HeartMuLa is the open-source library and reference implementation for the HeartMuLa family of music foundation models, designed to support both music generation and music-related understanding tasks in a cohesive stack. At the center is HeartMuLa, a music language model that generates music conditioned on inputs like lyrics and tags, with multilingual support that broadens the range of lyric-driven use cases.

Downloads: 16 This Week

Last Update: 2026-03-05
See Project
3

AudioCraft

Audiocraft is a library for audio processing and generation

AudioCraft is a PyTorch library for text-to-audio and text-to-music generation, packaging research models and tooling for training and inference. It includes MusicGen for music generation conditioned on text (and optionally melody) and AudioGen for text-conditioned sound effects and environmental audio. Both models operate over discrete audio tokens produced by a neural codec (EnCodec), which acts like a tokenizer for waveforms and enables efficient sequence modeling. The repo provides...

Downloads: 6 This Week

Last Update: 2025-10-13
See Project
4

WavTokenizer

SOTA discrete acoustic codec models with 40/75 tokens per second

WavTokenizer is a state-of-the-art discrete acoustic codec designed specifically for audio language modeling, capable of compressing 24 kHz audio into just 40 or 75 tokens per second while preserving high perceptual quality. It is built to represent speech, music, and general audio with extremely low bitrate, making it ideal as a front-end for large audio language models like GPT-4o and similar architectures. The model uses a single-quantizer design together with temporal compression to...

Downloads: 0 This Week

Last Update: 2025-11-28
See Project
8 Monitoring Tools in One APM. Install in 5 Minutes.
Errors, performance, logs, uptime, hosts, anomalies, dashboards, and check-ins. One interface.

AppSignal works out of the box for Ruby, Elixir, Node.js, Python, and more. 30-day free trial, no credit card required.

Start Free
5

AV1 AVIF

AV1 Image File Format Specification - ISO-BMFF/HEIF derivative

AV1 AVIF is the official specification and reference design for the AV1 Image File Format (AVIF), defining how AV1-encoded bitstreams are packaged into the HEIF container format (based on ISOBMFF) to produce AVIF files. The project outlines the syntax and semantics required for AVIF compliance, including support for multiple image profiles, color depths, chroma subsampling modes, HDR/WCG, alpha channels, animation/image sequences, and various color-space/bit-depth combinations — making AVIF...

Downloads: 0 This Week

Last Update: 2025-12-08
See Project
6

Moshi

A speech-text foundation model for real time dialogue

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec. Mimi processes 24 kHz audio, down to a 12.5 Hz representation with a bandwidth of 1.1 kbps, in a fully streaming manner (latency of 80ms, the frame size), yet performs better than existing, non-streaming, codecs like SpeechTokenizer (50 Hz, 4kbps), or SemantiCodec (50 Hz, 1.3kbps). Moshi models two streams of audio: one corresponds to Moshi, and...

Downloads: 0 This Week

Last Update: 2024-11-05
See Project
7

voxshare_gui

*VoxShare* is a simple Python-based push-to-talk multicast voice chat

VoxShare is a simple Python-based push-to-talk multicast voice chat application with a sleek modern GUI built using CustomTkinter. Provided as python source code or compiled standalone windows application (no need to install anything).

Downloads: 16 This Week

Last Update: 2025-07-01
See Project
8

Warlock-Studio

AI Suite for upscaling, interpolating & restoring images/videos

v6.0. Warlock-Studio is a Windows application that uses Real-ESRGAN, BSRGAN, IRCNN, GFPGAN, RealESRNet, RealESRAnime and RIFE Artificial Intelligence models to upscale, restore faces, interpolate frames and reduce noise in images and videos. the application supports GPU acceleration (including multi-GPU setups) and offers batch processing for large workloads. It includes drag-and-drop handling for single or multiple files, optional pre-resize functions, and an automatic tiling system...

Downloads: 25 This Week

Last Update: 2026-02-16
See Project
9

VALL-E

PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)

We introduce a language modeling approach for text to speech synthesis (TTS). Specifically, we train a neural codec language model (called VALL-E) using discrete codes derived from an off-the-shelf neural audio codec model, and regard TTS as a conditional language modeling task rather than continuous signal regression as in previous work. During the pre-training stage, we scale up the TTS training data to 60K hours of English speech which is hundreds of times larger than existing systems....

Downloads: 0 This Week

Last Update: 2023-04-14
See Project
Full-stack observability with actually useful AI | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
10

EnCodec

State-of-the-art deep learning based audio codec

Encodec is a neural audio codec developed by Meta for high-fidelity, low-bitrate audio compression using end-to-end deep learning. Unlike traditional codecs (like MP3 or Opus), Encodec uses a learned quantizer and decoder to reconstruct complex waveforms with remarkable accuracy at bitrates as low as 1.5 kbps. It employs a convolutional encoder–decoder architecture trained with perceptual loss functions that optimize for human auditory quality rather than raw waveform distance. The model can...

Downloads: 0 This Week

Last Update: 2025-10-12
See Project
11

SimpleVideoEncoder

Simple video encoder

Simple video encoder is GUI for ffmpeg designed to encode video files. The application is designed so that the process of starting the encoding of one or more videofiles takes 2-3 clicks.

Downloads: 3 This Week

Last Update: 2022-09-27
See Project
12

BEE free TECH

[ Beautiful, Effective, Efficient, Freedom, Technology ]

BEE free 20.04 is a community-driven Linux distribution based on Ubuntu 20.04 .

6 Reviews

Downloads: 3 This Week

Last Update: 2020-12-02
See Project
13

FastoCloud PRO

IPTV/NVR/CCTV/Video cloud https://fastocloud.com

IPTV/Video cloud Features: Cross-platform (Linux, MacOSX, FreeBSD, Raspbian/Armbian) GPU/CPU Encode/Decode/Post Processing Stream statistics CCTV Adaptive hls streams Load balancing Temporary urls HLS push EPG scanning Subtitles to text conversions AD insertion Logo overlay Video effects Relays Timeshifts Catchups Playlists Restream/Transcode from online streaming services like Youtube, Twitch ...

Downloads: 2 This Week

Last Update: 2020-06-20
See Project
14

Simple Video Trimmer

Preview video with VLC; use in/out hotkeys; trim with FFMPEG.

Written in Python, this program uses VLC media player to preview a video, and then it uses FFMPEG with "copy" codec to quickly cut out a segment from the video. VLC seek/jump hotkeys, as well as in/out hotkeys are supported. Currently, it's just in Python (no EXE), but it should be easy to run it from Python. From the command prompt, install the dependencies: pip install wxPython pip install boto pip install selenium pip install google-api-python-client Then start the program by:...

Downloads: 0 This Week

Last Update: 2018-05-19
See Project
15

convertToMP4

Easily convert to MP4 with H264 and without all the codec hassle

Convert existing media data (movies or images) to an mp4 movie, with the high performane h264 codec. Usually it takes quite a lot to get the video running as you wish. As convertion engine, Mencoder will be used. This script is intended to ease the creation of videos. It is suitable for the beginner or the lazy advanced people. It will not suite the professional wanting to have full control over all codec parameters. Consider donating to this project:...

Downloads: 1 This Week

Last Update: 2017-11-02
See Project
16

LibreEngineering

LibreEngineering - suite of instrumentation, electrical, mechanical, process engineering calculation and design programs and other tools. Licensed under GPL3. Written in Python with Qt toolkit.

Downloads: 0 This Week

Last Update: 2013-08-28
See Project
17

EnKoDeur-Mixeur

EnKoDeur-Mixeur (EKD) is an open source software which makes videos, pictures and audio post-production. It can be also used to convert videos in many formats. It is written in python and use the PyQt4 bindings.

1 Review

Downloads: 1 This Week

Last Update: 2013-04-30
See Project
18

QtAP

The Qt Audio Processor is an ultimate audio files processing software, including ripping, converting, tagging and burning to, from and between every available audio codec.

Downloads: 0 This Week

Last Update: 2017-12-31
See Project
19

Rippy

Rippy is a script designed to make ripping DVDs easier. It uses mplayer and mencoder to transcode a video to another format. Features: automatic bitrate calculation based on desired target size; automatic crop detection; mp3 audio with resampling;

Downloads: 0 This Week

Last Update: 2017-08-31
See Project
20

MKV Demux All

This is a program with one, specific purpose: to provide the ability to demux ALL of the streams in any number of Matroska container (mkv) files automatically.

5 Reviews

Downloads: 8 This Week

Last Update: 2015-03-22
See Project
21

giligooju

parallelized H.264 codec.

Downloads: 0 This Week

Last Update: 2013-04-19
See Project
22

playgc

a simple video to audio converter

Downloads: 0 This Week

Last Update: 2022-11-28
See Project
23

Safename

A codec for safe filenames.

1 Review

Downloads: 0 This Week

Last Update: 2012-07-09
See Project
24

xvidcfg

This toolkit creates codec configuration files. The generated configuration file is meant to be read by transcodes plugins. Target codecs will be XviD and libavcodec, but further extension are planned.

Downloads: 0 This Week

Last Update: 2016-11-19
See Project
25

Python YUV Player

The project is aimed at all the multimedia codec developers. It will play various data formats like yuv420/422, rgb888/565/etc. It will support conversion from one color space to another, scaling/zooming, fps control, grid display to identify MB boundary

Downloads: 0 This Week

Last Update: 2015-10-13
See Project