Showing 151 open source projects for "audio codec"

View related business solutions
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • 1
    Audiogen Codec

    Audiogen Codec

    48khz stereo neural audio codec for general audio

    AGC (Audiogen Codec) is a convolutional autoencoder based on the DAC architecture, which holds SOTA. We found that training with EMA and adding a perceptual loss term with CLAP features improved performance. These codecs, being low compression, outperform Meta's EnCodec and DAC on general audio as validated from internal blind ELO games. We trained (relatively) very low compression codecs in the pursuit of solving a core issue regarding general music and audio generation, low acoustic quality, and audible artifacts, which hinder industry use for these models. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 2
    Wrapper for VideoStation

    Wrapper for VideoStation

    Synology VideoStation and DLNA FFmpeg Wrapper with AAC, DTS, EAC3

    ...The project also includes options for configuring audio codec priorities and enabling multi-channel output such as 5.1 audio. It is compatible with modern Synology DSM versions and integrates directly into existing media server workflows. Overall, it enhances media server flexibility by removing codec restrictions and improving playback support.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 3
    HeartMuLa

    HeartMuLa

    A Family of Open Sourced Music Foundation Models

    ...At the center is HeartMuLa, a music language model that generates music conditioned on inputs like lyrics and tags, with multilingual support that broadens the range of lyric-driven use cases. The project also includes HeartCodec, a music codec optimized for high reconstruction fidelity, enabling efficient tokenization and reconstruction workflows that are critical for training and generation pipelines. For text extraction from audio, it provides HeartTranscriptor, a Whisper-based model tuned specifically for lyrics transcription, which helps bridge generated or recorded audio back into structured text. ...
    Downloads: 12 This Week
    Last Update:
    See Project
  • 4
    WavTokenizer

    WavTokenizer

    SOTA discrete acoustic codec models with 40/75 tokens per second

    WavTokenizer is a state-of-the-art discrete acoustic codec designed specifically for audio language modeling, capable of compressing 24 kHz audio into just 40 or 75 tokens per second while preserving high perceptual quality. It is built to represent speech, music, and general audio with extremely low bitrate, making it ideal as a front-end for large audio language models like GPT-4o and similar architectures.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Compliant and Reliable File Transfers Backed by Top Security Certifications Icon
    Compliant and Reliable File Transfers Backed by Top Security Certifications

    Cerberus FTP Server delivers SOC 2 Type II certified security and FIPS 140-2 validated encryption.

    Stop relying on non-certified, legacy file transfer tools that creak under the weight of modern security demands. Get full audit trails, advanced access controls and more supported by an award-winning team of experts. Start your free 25-day trial today.
    Start Free Trial
  • 5
    Moshi

    Moshi

    A speech-text foundation model for real time dialogue

    Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec. Mimi processes 24 kHz audio, down to a 12.5 Hz representation with a bandwidth of 1.1 kbps, in a fully streaming manner (latency of 80ms, the frame size), yet performs better than existing, non-streaming, codecs like SpeechTokenizer (50 Hz, 4kbps), or SemantiCodec (50 Hz, 1.3kbps). Moshi models two streams of audio: one corresponds to Moshi, and the other one to the user. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    rtmp-rtsp-stream-client-java

    rtmp-rtsp-stream-client-java

    Library to stream in rtmp and rtsp for Android. All code in Java

    Library for streaming in RTMP and RTSP. All code in Java.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 7
    AudioCraft

    AudioCraft

    Audiocraft is a library for audio processing and generation

    AudioCraft is a PyTorch library for text-to-audio and text-to-music generation, packaging research models and tooling for training and inference. It includes MusicGen for music generation conditioned on text (and optionally melody) and AudioGen for text-conditioned sound effects and environmental audio. Both models operate over discrete audio tokens produced by a neural codec (EnCodec), which acts like a tokenizer for waveforms and enables efficient sequence modeling. ...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 8
    go2rtc

    go2rtc

    Ultimate camera streaming application

    go2rtc is a lightweight, zero-dependency streaming server designed to unify and convert video streams across a wide range of protocols and devices, particularly in smart home and surveillance environments. Written in Go, it provides real-time streaming capabilities with extremely low latency by supporting protocols such as RTSP, WebRTC, RTMP, HTTP, and HomeKit, while also enabling seamless transcoding using FFmpeg when needed. The application can ingest streams from IP cameras, USB devices,...
    Downloads: 12 This Week
    Last Update:
    See Project
  • 9
    ffmpeg-commander

    ffmpeg-commander

    A web-based GUI for quickly generating common FFmpeg command-line

    ffmpeg-commander is a web-based graphical interface that simplifies the creation of FFmpeg commands for common video and audio encoding tasks. It provides a user-friendly environment where users can configure encoding options without needing to memorize complex command-line syntax. Built with modern web technologies, it generates FFmpeg commands dynamically based on user input. The tool focuses on common workflows such as format conversion, compression, and codec selection. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Build Securely on Azure with Proven Frameworks Icon
    Build Securely on Azure with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 10
    KSPlayer

    KSPlayer

    A video player for iOS, macOs, tvOS, visionOS

    KSPlayer is a cross-platform multimedia playback framework designed for Apple ecosystems, including iOS, macOS, tvOS, and visionOS. It is built on top of AVPlayer and FFmpeg, combining native system capabilities with extended codec support for a wide range of media formats. The framework provides high-performance playback with support for advanced features such as HDR video, multi-audio streams, and subtitle rendering. It supports both local media files and network streams, including live streaming scenarios with low latency. KSPlayer is designed to integrate seamlessly with SwiftUI, UIKit, and AppKit, enabling developers to build modern user interfaces around its playback engine. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    opencore-amr

    Audio codecs extracted from Android Open Source Project

    Library of OpenCORE Framework implementation of Adaptive Multi Rate Narrowband and Wideband (AMR-NB and AMR-WB) speech codec. Library of VisualOn implementation of Adaptive Multi Rate Wideband (AMR-WB) encoder and Advanced Audio Coding (AAC) encoder. Modified library of Fraunhofer AAC decoder and encoder.
    Leader badge
    Downloads: 8,313 This Week
    Last Update:
    See Project
  • 12
    MMC is a commander-style media player for Windows, with native, hw accelerated video playing and translucent gui. Mpxplay is a console audio player for DOS and Win32 operating systems. x264vfw, x265vfw and xAV1vfw are video for windows encoder and decoder codecs, useful with VirtualDub.
    Leader badge
    Downloads: 275 This Week
    Last Update:
    See Project
  • 13
    VidCoder

    VidCoder

    A Blu-ray, DVD and video file transcoder for Windows

    VidCoder is a Windows-based open-source video transcoding and ripping tool that provides a graphical interface built around standard command-line multimedia tools. It lets users convert video files (or rip DVDs/Blu-rays, when supported) into modern formats and codecs, making it useful for people who want to compress, re-encode, or transcode video content without dealing directly with low-level encoder settings. Because VidCoder integrates and automates the invocation of complex backend...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 14
    BlackBelt CodecPack

    BlackBelt CodecPack

    A clean, lean CoDec Pack. FFDShow and LAV Combined.

    Contains support for popular formats. Works especially well with MediaPortal. LAV, ffdshow - why choose between when you can have both in one pack ! WMV/WMA, DivX, AVI, ASF, FLV, Ogg FLAC, HEV1, x264, x265 etc. NO SPYWARE, NO ADWARE, NO TOOLBARS, NO PLAYER - JUST PURE CODECS Windows XP / Vista / 7 / 8 / 10 - 32/64 bit.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 15
    Shutter Encoder

    Shutter Encoder

    Free professional video converter Windows|Mac|Linux

    Shutter Encoder is an video, audio and image converter based on FFmpeg and other great tools. It has been designed by video editors in order to be as accessible and efficient as possible. It's a swiss knife tool for any video editor. Link to website & downloads : https://www.shutterencoder.com - Without conversion: Cut without re-encoding, Replace audio, Rewrap, Conform, Merge, Extract, Subtitling, Video inserts - Sound conversions: WAV, AIFF, FLAC, ALAC, MP3, AAC, AC3, OPUS, OGG - Editing codecs: DNxHD, DNxHR, Apple ProRes, QT Animation, GoPro CineForm, Uncompressed YUV - Output codecs: H.264, H.265, VP8, VP9, AV1, OGV - Broadcast codecs: XDCAM HD422, AVC-Intra 100, XAVC, HAP - Old codecs: DV PAL, MJPEG, Xvid, WMV, MPEG - Archiving codec: FFV1 - Images creation: JPEG, Image - Burn & Rip: DVD, Blu-ray, DVD RIP - Analysis: Loudness & True Peak, Audio normalization, Cut detection, Black detection, Media, VMAF - Download: Web video
    Leader badge
    Downloads: 438 This Week
    Last Update:
    See Project
  • 16
    Tuniac

    Tuniac

    Tuniac Media Player

    Tuniac is an iTunes style media player/manager for Windows. Advanced playlist editor, search as you type and queue support. Supports: MPEG-1 Audio (mp3, mp2, mp1) FLAC (flac, fla, oga, ogg) Advanced Audio Coding (aac, m4a, m4b, mp4) Apple Lossless Audio Codec aka ALAC (m4a) Windows Media Audio (wma) Vorbis (ogg) Opus (opus) WavPack (wv) TAK Audio (tak) TrueAudio (tta) Monkeys Audio (ape, mac) Musepack (mpc, mp+, mpp) OptimFrog (ofr, ofs) Shorten (shn) Dolby Digital (ac3) DSD (dff, dfs) PCM (wav, aif) CD Audio (cda) Speex (spx) MOD Formats (mod, mo3, xm, it, s3m, mtm) Game Audio Formats (adx, umx) MIDI (mid) Supporting radio streaming of most the above formats.
    Leader badge
    Downloads: 14 This Week
    Last Update:
    See Project
  • 17

    JosePlayer

    Play audio and video files and folders, CDs and DVDs

    It is an awesome multimedia player that plays any multimedia content. You only need to install a free directshow package codec to fully enjoy this media player.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Linkify
    Linkify is a radio client for Windows. The program allows connecting to digital networks, transmitting and receiving audio on selected talkgroups. The program works on a PTT (Push-To-Talk) basis — holding the assigned key enables transmission, releasing it stops transmission, similar to a real radio. __________________________________________________________________________________ Linkify to klient radiowy dla systemu Windows. Program umożliwia łączenie się z...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Apprentice Video

    Apprentice Video

    it's a video player, also works for music and pictures

    This player stands on the giant shoulders of FFmpeg. Audio rendering is accomplished via portaudio v19. Video rendering is via OpenGL, using fragment programs when possible. User interface is implemented with Qt 4/5/6. ASS/SSA subtitle rendering is implemented with libass. This player provides several performance options to enable adequate video playback on slow hardware: * skip loop filter * skip non-reference frames * skip color converter * reduce playback speed to accommodate slow video decoding This player supports playback of HDR video on non-HDR displays: * colorspace transform to BT.709 colorspace via an auto-generated 3D LUT * tone mapping from HDR to SDR (BT. 709) This player supports playback of MPEG-TS files containing multiple programs, timeline anomalies, and codec changes. ...
    Downloads: 11 This Week
    Last Update:
    See Project
  • 20
    voxshare_gui

    voxshare_gui

    *VoxShare* is a simple Python-based push-to-talk multicast voice chat

    VoxShare is a simple Python-based push-to-talk multicast voice chat application with a sleek modern GUI built using CustomTkinter. Provided as python source code or compiled standalone windows application (no need to install anything).
    Downloads: 8 This Week
    Last Update:
    See Project
  • 21
    Ant Movie Catalog

    Ant Movie Catalog

    Free program made to manage your collection of movies

    ...Import information from Internet (using scripts); by default it includes scripts for IMDB (US), DVDFR (FR), Allociné (FR) and lots of others. User-customizable links to do a search on movie websites. Information importation from various media files (audio & video codec, bitrates, resolution, framerate, size). Scripting technology, using Object Pascal language, allowing to modify catalog: Find & replace, moving field values, ... Printing, using customizable templates. Export to other formats: HTML (based on a template that you can modify or create yourself), SQL commands (to re-import data in a DBMS such as MySQL), CSV (text files, can be used as tables with Microsft Excel for example). ...
    Downloads: 26 This Week
    Last Update:
    See Project
  • 22
    blackvideo-mini-player

    blackvideo-mini-player

    A standalone lightweight auxiliary CLI video player for BlackVideo.

    Lightweight cross-platform video player (Ada + SDL2 + FFmpeg). Support player for the BlackVideo. Works standalone via CLI or right-click on any video file. Usage Method 1 — Command Line Step 1. Unzip blackvideo-mini-player-v2.3.0.win.zip Step 2. Open the build\ folder, then type cmd directly in the address bar and press Enter — this opens a terminal already in that folder. Alternatively: open Command Prompt anywhere and use cd with the copied path: cd...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 23
    Aeyae Remux

    Aeyae Remux

    GOP-based video editor (remuxer)

    ...Aeyae Remux can be used to stitch together files... within reason. Since it is only a remuxer -- stitched files must be of compatible format. This means stitched files must have the same audio/video codecs, and codec properties (width, height, sample rate, etc...). That said... Aeyae Remux currently doesn't check or enforce this restriction, so beware -- garbage input will likely produce garbage output. Linux binaries are provided as AppImage, built on Ubuntu 14.04. 8 GB of RAM (but more is better) is highly recommended if working with source files longer than 2 hours.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    ViaVoip

    ViaVoip

    A portable peer to peer voice-chat/walkie-talkie.

    ViaVoip is a simple Voice Over IP application that can be used when you need to talk, chat, or send files through the internet, but you can't or don't want to make use of any third party services. Its peer to peer design allows the two end points to connect directly to each other, without any central server nor account registration. It runs on Windows, Linux, Mac OS X and Android, and is portable, that is you don't need any setup, just get a copy and run it from any storage...
    Downloads: 14 This Week
    Last Update:
    See Project
  • 25

    Virtualdub Batch Video DeShake v26.0204

    Batch to compress [and deshake] all videos [or images] in folder

    Installation: Execute "DeShakInst.BAT" VirtualDub2 44282; AviSynth+ 3.7.5 updated to C:\DVD DESHAK.BAT updated to C:\UT and added to PATH Usage: DESHAK task[s] [parameters] Tasks: tp1: deshake pass1 LOG generation for 2nd pass tp2: deshake pass2 and compress video and audio to MP3 tcomp: compress (no deshake) twav: extract WAV and/or uses external WAV audio Parameters (more in help): vEXT: video extension (ie: vmov), default: vAVI qN: h264 quality 1-9 (9=lossless), def: q3 (crf23) aN: mp3 quality 1-5, def: a3 (192k) * generates: ZZoriginalname.AVI * some settings at begining ie: vdPath Min Requirements: XP; Win7x64 for aviSynth video NoiseReduction Klite Mega Codec Pack (with LAME encoder) Other Utilities: LOG2CHAPS.BAT generate _OGG.txt chapters @ scene change VID2AUD.BAT extract Audios VID2MKV.BAT multiplex vid+aud+chapters VIDJOIN.BAT merges videos to MKV
    Downloads: 4 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next