Showing 359 open source projects for "encoder"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • 1
    Multimodal

    Multimodal

    TorchMultimodal is a PyTorch library

    ...It includes a collection of ready model classes—like ALBEF, CLIP, BLIP-2, COCA, FLAVA, MDETR, and Omnivore—that serve as reference implementations you can adopt or adapt. The design emphasizes composability: you can mix and match encoder, fusion, and decoder components rather than starting from monolithic models. The repository also includes example scripts and datasets for common multimodal tasks (e.g. retrieval, visual question answering, grounding) so you can test and compare models end to end. Installation supports both CPU and CUDA, and the codebase is versioned, tested, and maintained.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    VGGT

    VGGT

    [CVPR 2025 Best Paper Award] VGGT

    VGGT is a transformer-based framework aimed at unifying classic visual geometry tasks—such as depth estimation, camera pose recovery, point tracking, and correspondence—under a single model. Rather than training separate networks per task, it shares an encoder and leverages geometric heads/decoders to infer structure and motion from images or short clips. The design emphasizes consistent geometric reasoning: outputs from one head (e.g., correspondences or tracks) reinforce others (e.g., pose or depth), making the system more robust to challenging viewpoints and textures. The repo provides inference pipelines to estimate geometry from monocular inputs, stereo pairs, or brief sequences, together with evaluation harnesses for common geometry benchmarks. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    NVIDIA Isaac GR00T

    NVIDIA Isaac GR00T

    NVIDIA Isaac GR00T N1.5 is the world's first open foundation model

    ...The vision-language model remains frozen during both pretraining and finetuning, preserving language understanding and improving generalization. Streamlined MLP connection between vision encoder and LLM with added layer normalization.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Internet Friendly Media Encoder

    Internet Friendly Media Encoder

    Simple, Flexible & Powerful H.265/HEVC & H266/VVC video encoder!

    A versatile, future-proof, and expandable multimedia encoder designed for ease of use. Internet Friendly Media Encoder supports adding subtitles and attachments, and can function as a remuxing tool with a Copy Stream option. It allows you to combine multiple video, audio, subtitle, and attachment streams into a single file, remove unwanted streams, or incorporate subtitles from another video without extracting.
    Downloads: 14 This Week
    Last Update:
    See Project
  • Error to trace to log to deploy. One click. No SSH. Icon
    Error to trace to log to deploy. One click. No SSH.

    Catch the cause before the pager goes off.

    AppSignal links every error to the trace, the trace to the log, the log to the deploy that shipped it.
    Free 30 days.
  • 5
    Step-Audio 2

    Step-Audio 2

    Multi-modal large language model designed for audio understanding

    Step-Audio2 is an advanced, end-to-end multimodal large language model designed for high-fidelity audio understanding and natural speech conversation: unlike many pipelines that separate speech recognition, processing, and synthesis, Step-Audio2 processes raw audio, reasons about semantic and paralinguistic content (like emotion, speaker characteristics, non-verbal cues), and can generate contextually appropriate responses — including potentially generating or transforming audio output. It integrates a latent-space audio encoder, discrete acoustic tokens, and reinforcement-learning–based training (CoT + RL) to enhance its ability to capture and reproduce voice styles, intonations, and subtle vocal cues. Moreover, Step-Audio2 supports tool-calling and retrieval-augmented generation (RAG), allowing it to access external knowledge sources or audio/text databases, thus reducing hallucinations and improving coherence in complex dialogues.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    HunyuanOCR

    HunyuanOCR

    OCR expert VLM powered by Hunyuan's native multimodal architecture

    HunyuanOCR is an open-source, end-to-end OCR (optical character recognition) Vision-Language Model (VLM) developed by Tencent‑Hunyuan. It’s designed to unify the entire OCR pipeline, detection, recognition, layout parsing, information extraction, translation, and even subtitle or structured output generation, into a single model inference instead of a cascade of separate tools. Despite being fairly lightweight (about 1 billion parameters), it delivers state-of-the-art performance across a...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    PI-Based Image Encoder / Converter

    PI-Based Image Encoder / Converter

    Python code able to convert / compress image to PI (3.14, π) Indexes

    Image processing tool that encodes pixel data as indices within the first 16.7 million digits of PI (Ď€). Features high-performance Numba-accelerated search and a signature 'film-grain' aesthetic upon reconstruction. ZIP also include 16 MB file with 16,7 mil numbers of PI Benchmark(Single-Thread): Hardware & Environment Apple Silicon: Apple M2 (Mac mini/MacBook) x86_64 Platform: Intel Core Ultra 5 225F (Arrow Lake, 10 Cores) OS 1: Fedora 43 (GNOME) OS 2: Windows 11 Pro...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    Corne keyboard
    crkbd is the firmware and PCB design for the Corne split mechanical keyboard (aka "Corne"), maintained by foostan and the community. It provides QMK/VIA/Vial firmware support, RGB underglow, multiple layouts, and flexible hardware customization.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    FFmpeg Batch AV Converter

    FFmpeg Batch AV Converter

    Free all in one audio/video ffmpeg batch encoder

    FFmpeg Batch AV Converter is a free universal audio and video encoder for Windows and Linux (via Wine), that allows to use the full potential of ffmpeg command line with a few mouse clicks in a convenient GUI with drag and drop, progress information. Some fancy wizards make things easy for non-experts. Thanks to its multi-file encoding feature, it may be the fastest a/v batch encoder available, since it maximizes system resources usage by launching as many simultaneous processes up to user cpu thread count. ...
    Leader badge
    Downloads: 1,264 This Week
    Last Update:
    See Project
  • $300 Free Credits to Build on Google Cloud Icon
    $300 Free Credits to Build on Google Cloud

    New to Google Cloud? Get $300 in credits to explore Compute Engine, BigQuery, Cloud Run, Gemini Enterprise Agent Platform, and more.

    Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query petabytes in BigQuery, or build agents with Gemini Enterprise Agent Platform. Once your credits are used, keep building with 20+ always-free tier products including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. No commitment required—just sign up and start building.
    Claim $300 Free
  • 10
    ToolSparkr

    ToolSparkr

    Free online developer tools JSON formatter, Base64 encoder, and more

    ToolSparkr - 35+ free browser-based developer tools. No signup required, no data uploaded to servers. Everything runs in your browser. Tools: JSON Formatter & Validator, Base64 Encoder/Decoder, Password Generator, QR Code Creator, MD5/SHA Hash Generator, Color Converter, URL Encoder/Decoder, UUID Generator, DNS Lookup, WHOIS Lookup, IP Lookup, HTTP Status Checker, HTML Entity Encoder, Morse Code Converter, CSS Minifier, Bcrypt Generator & more. Built for developers, designers, marketers, and power users who need fast, reliable utilities without switching between multiple websites. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11

    fbls

    A binary encoder and decoder.

    As of its early beta stages, its not useful for much. If you could give it and download and try what i have it would mean a lot!
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    GoFileEncoder

    GoFileEncoder

    A simple XOR file encoder.

    GoFileEncoder is a little file encoder that uses the XOR encryption. It is coded in Golang, and is compiled for several OSs (Windows®, Linux®, and soon macOS®).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    DeepSeek VL

    DeepSeek VL

    Towards Real-World Vision-Language Understanding

    ...It enables understanding and generation across visual and textual modalities—meaning it can process an image + a prompt, answer questions about images, caption, classify, or reason about visuals in context. The model is likely used internally as the visual encoder backbone for agent use cases, to ground perception in downstream tasks (e.g. answering questions about a screenshot). The repository includes model weights (or pointers to them), evaluation metrics on standard vision + language benchmarks, and configuration or architecture files. It also supports inference tools for forwarding image + prompt through the model to produce text output. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    ws

    ws

    Tiny WebSocket library for Go

    ...Example applications using ws are developed in separate repository ws-examples. We can apply the same pattern to read and write structured responses through a JSON encoder and decoder.Zero-copy upgrade helps to avoid unnecessary allocations and copying while handling HTTP Upgrade request.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    ffxvid

    ffxvid

    Small script to encode to Xvid video

    ffxvid is a TUI (Terminal User Interface) Bash shell program script written for the GNU/Linux and BSD operating systems for encoding video files to the MPEG4 Part 2/Xvid video format using the FFmpeg encoder and the libxvid library. Targeted primarily at encoding of full-length feature films and film series but it can be used for other content as well.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    q4rescue

    q4rescue

    A live linux Rescue toolkit/Emergency OS - based on q4os Trinity

    A live linux system rescue toolkit based on q4os Trinity available as a bootable iso for administrating, repairing and cloning/restoring your system and data. Check wiki for full description : https://sourceforge.net/p/q4rescue/wiki/ Main tools: -Foxclone -Rescuezilla -Clonezilla -DDrescue-gui -qtfsarchiver -G4L -Apart -Testdisk -Photorec -Boot Repair -WoeUSB -Q4OS imager -UNetbootin -usbimager -Kdirstats -Kdiskmark -Rclone & Rclone...
    Downloads: 41 This Week
    Last Update:
    See Project
  • 17
    JSONLab

    JSONLab

    JSONLab: compact, portable, robust JSON/binary-JSON encoder

    JSONLab is a free and open-source JSON/UBJSON/MessagePack encoder and decoder written in the native MATLAB language. It can be used to convert a MATLAB data structure (array, struct, cell, struct array, cell array, and objects) into JSON/UBJSON/MessagePack formatted strings and files, or to parse a JSON/UBJSON/MessagePack file into MATLAB data structure. JSONLab supports nearly all versions of MATLAB and GNU Octave (a free MATLAB clone).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    mTRF-Toolbox

    mTRF-Toolbox

    A MATLAB package for modelling multivariate stimulus-response data

    mTRF-Toolbox is a MATLAB package for modelling multivariate stimulus-response data, suitable for neurophysiological data such as MEG, EEG, sEEG, ECoG and EMG. It can be used to model the functional relationship between neuronal populations and dynamic sensory inputs such as natural scenes and sounds, or build neural decoders for reconstructing stimulus features and developing real-time applications such as brain-computer interfaces (BCIs). Toolbox Paper: ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 19
    SafeUtils

    SafeUtils

    110+ developer tools as native MacOS, Linux & Windows desktop apps.

    Tools: https://safeutils.com/barcode-generator https://safeutils.com/color-picker https://safeutils.com/qr-code-generator https://safeutils.com/qr-code-scanner https://safeutils.com/word-counter https://safeutils.com/base-64-decoder https://safeutils.com/diff-checker https://safeutils.com/hex-to-ascii https://safeutils.com/json-formatter https://safeutils.com/lorem-ipsum-generator https://safeutils.com/random-generator https://safeutils.com/time-converter https://safeutils.com/xml-formatter https://safeutils.com/ascii-to-binary https://safeutils.com/ascii-to-hex https://safeutils.com/base-64-encoder https://safeutils.com/binary-to-ascii https://safeutils.com/case-converter https://safeutils.com/csv-to-json https://safeutils.com/decimal-to-ascii https://safeutils.com/html-formatter https://safeutils.com/html-preview https://safeutils.com/html-to-markdown https://safeutils.com/id-generator https://safeutils.com/json-to-csv https://safeutils.com/json-to-xml
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Video of Death
    Video of Death is a nonlinear video editing software. You must install FFmpeg. 5-9-26: Found a bit of a bug, I'm not going to chase. If there is no visual in the timeline, audio won't play by it's self. 5-15-26: Fixed video/audio sync bug. Preview sync isn't perfect, but exported videos have perfect video/audio sync. 5-22-26: Had a major issue on exports, that didn't show in testing. Fixed. Also fixed audio sync in preview, as well as source window functions.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    StarFishDAQ

    StarFishDAQ

    A general purpose data acquisition system and development platform.

    The StarFishDAQ (or SFDQ for short) is a platform for developing MCU based hardware without having to start from scratch. It consists of hardware, firmware and software components. The SFDQ allows those projects that would otherwise use an off-the-shelf DAQ, PLC or MCU SBC to start with a stack of hardware, firmware and software. This allows more control, customization, and future applicability without sinking a ton of effort. The SFDQ can be used simply and quickly, as-is in...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    StyleTTS 2

    StyleTTS 2

    Towards Human-Level Text-to-Speech through Style Diffusion

    ...The repository includes training scripts, configuration files, and pre-trained auxiliary modules such as a text aligner, pitch extractor, and PL-BERT-based linguistic encoder.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 23

    Virtualdub Batch Video DeShake v26.0204

    Batch to compress [and deshake] all videos [or images] in folder

    Installation: Execute "DeShakInst.BAT" VirtualDub2 44282; AviSynth+ 3.7.5 updated to C:\DVD DESHAK.BAT updated to C:\UT and added to PATH Usage: DESHAK task[s] [parameters] Tasks: tp1: deshake pass1 LOG generation for 2nd pass tp2: deshake pass2 and compress video and audio to MP3 tcomp: compress (no deshake) twav: extract WAV and/or uses external WAV audio Parameters (more in help): vEXT: video extension (ie: vmov), default: vAVI qN: h264 quality 1-9 (9=lossless), def: q3 (crf23) aN: mp3 quality 1-5, def: a3 (192k) * generates: ZZoriginalname.AVI * some settings at begining ie: vdPath Min Requirements: XP; Win7x64 for aviSynth video NoiseReduction Klite Mega Codec Pack (with LAME encoder) Other Utilities: LOG2CHAPS.BAT generate _OGG.txt chapters @ scene change VID2AUD.BAT extract Audios VID2MKV.BAT multiplex vid+aud+chapters VIDJOIN.BAT merges videos to MKV
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    wxLame

    wxLame

    wxLame is a free front-end for the LAME MP3 encoder.

    Author website: https://cfgnunes.com Website: https://cfgnunes.com/wxlame GIT repository: https://github.com/cfgnunes/wxlame
    Downloads: 3 This Week
    Last Update:
    See Project
  • 25
    JAVE2

    JAVE2

    The JAVE (Java Audio Video Encoder) library is Java wrapper

    JAVE2 is a Java-based wrapper library for FFmpeg that enables developers to perform audio and video encoding operations through a structured and object-oriented API. It simplifies interaction with FFmpeg by abstracting command-line complexity into reusable Java components. The library supports tasks such as transcoding, format conversion, and media extraction, making it suitable for backend multimedia services. It provides configuration options for codecs, bitrates, and container formats,...
    Downloads: 0 This Week
    Last Update:
    See Project
Auth0 Logo