Showing 479 open source projects for "encoder"

View related business solutions
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    Shutter Encoder

    Shutter Encoder

    A professional video compression tool accessible to all

    Shutter Encoder is a cross-platform video and audio processing application designed to provide professional-grade encoding and conversion tools through an accessible graphical interface. Built primarily on FFmpeg, it offers a wide range of media operations including transcoding, compression, format conversion, and editing. The software supports numerous codecs and formats, enabling users to prepare media for broadcasting, streaming, or archiving.
    Downloads: 11 This Week
    Last Update:
    See Project
  • 2
    Logstash Logback Encoder

    Logstash Logback Encoder

    Logback JSON encoder and appenders

    Provides log back encoders, layouts, and appenders to log in JSON and other formats supported by Jackson. Supports both regular LoggingEvents (logged through a Logger) and AccessEvents (logged via logback-access). Originally written to support output in Logstash's JSON format, but has evolved into a highly configurable, general-purpose, structured logging mechanism for JSON and other Jackson data forms. The structure of the output, and the data it contains, is fully configurable. The general...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 3
    simplejson

    simplejson

    simplejson is a simple, fast, extensible JSON encoder/decoder

    ...The encoder can be specialized to provide serialization in any kind of situation, without any special support by the objects to be serialized (somewhat like pickle). This is best done with the default kwarg to dumps.
    Downloads: 12 This Week
    Last Update:
    See Project
  • 4
    Pytorch-toolbelt

    Pytorch-toolbelt

    PyTorch extensions for fast R&D prototyping and Kaggle farming

    ...Extras for Catalyst library (Visualization of batch predictions, additional metrics). By design, both encoder and decoder produces a list of tensors, from fine (high-resolution, indexed 0) to coarse (low-resolution) feature maps. Access to all intermediate feature maps is beneficial if you want to apply deep supervision losses on them or encoder-decoder of object detection task.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Auth0 B2B Essentials: SSO, MFA, and RBAC Built In Icon
    Auth0 B2B Essentials: SSO, MFA, and RBAC Built In

    Unlimited organizations, 3 enterprise SSO connections, role-based access control, and pro MFA included. Dev and prod tenants out of the box.

    Auth0's B2B Essentials plan gives you everything you need to ship secure multi-tenant apps. Unlimited orgs, enterprise SSO, RBAC, audit log streaming, and higher auth and API limits included. Add on M2M tokens, enterprise MFA, or additional SSO connections as you scale.
    Sign Up Free
  • 5
    ModernBERT

    ModernBERT

    Bringing BERT into modernity via both architecture changes and scaling

    ModernBERT is an open-source research project that modernizes the classic BERT encoder architecture by incorporating recent advances in transformer design, training techniques, and efficiency improvements. The goal of the project is to bring BERT-style models up to date with the capabilities of modern large language models while preserving the strengths of bidirectional encoder architectures used for tasks such as classification, retrieval, and semantic search.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    rav1e

    rav1e

    The fastest and safest AV1 encoder

    rav1e is an open-source implementation of an encoder for the AV1 video codec, developed in Rust (with some assembly) by the community around Xiph Foundation. Its design philosophy is to start from a correct, minimal, and fast AV1 encoder — sacrificing some encoding speed/efficiency of reference encoders in exchange for simplicity, stability, and compilability across platforms — and then gradually improve.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    UltraJSON

    UltraJSON

    Ultra fast JSON decoder and encoder written in C with Python bindings

    UltraJSON is an ultra-fast JSON encoder and decoder written in pure C with bindings for Python 3.7+. May be used as a drop-in replacement for most other JSON parsers for Python. Used to enable special encoding of "unsafe" HTML characters into safer Unicode sequences. Limits output to ASCII and escapes all extended characters above 127. Default is True. If your end format supports UTF-8, setting this option to false is highly recommended to save space.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Shutter Encoder

    Shutter Encoder

    Free professional video converter Windows|Mac|Linux

    Shutter Encoder is an video, audio and image converter based on FFmpeg and other great tools. It has been designed by video editors in order to be as accessible and efficient as possible. It's a swiss knife tool for any video editor. Link to website & downloads : https://www.shutterencoder.com - Without conversion: Cut without re-encoding, Replace audio, Rewrap, Conform, Merge, Extract, Subtitling, Video inserts - Sound conversions: WAV, AIFF, FLAC, ALAC, MP3, AAC, AC3, OPUS, OGG - Editing codecs: DNxHD, DNxHR, Apple ProRes, QT Animation, GoPro CineForm, Uncompressed YUV - Output codecs: H.264, H.265, VP8, VP9, AV1, OGV - Broadcast codecs: XDCAM HD422, AVC-Intra 100, XAVC, HAP - Old codecs: DV PAL, MJPEG, Xvid, WMV, MPEG - Archiving codec: FFV1 - Images creation: JPEG, Image - Burn & Rip: DVD, Blu-ray, DVD RIP - Analysis: Loudness & True Peak, Audio normalization, Cut detection, Black detection, Media, VMAF - Download: Web video
    Leader badge
    Downloads: 60 This Week
    Last Update:
    See Project
  • 9
    go-json

    go-json

    Fast JSON encoder/decoder compatible with encoding/json for Go

    Fast JSON encoder/decoder compatible with encoding/json for Go.
    Downloads: 0 This Week
    Last Update:
    See Project
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 10
    rtmp-rtsp-stream-client-java

    rtmp-rtsp-stream-client-java

    Library to stream in rtmp and rtsp for Android. All code in Java

    Library for streaming in RTMP and RTSP. All code in Java.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Typia

    Typia

    Super-fast/easy runtime validations and serializations

    Super-fast/easy runtime validations and serializations through transformation.
    Downloads: 15 This Week
    Last Update:
    See Project
  • 12
    FastVLM

    FastVLM

    This repository contains the official implementation of FastVLM

    ...The repository documents model variants, showcases head-to-head numbers against known baselines, and explains how the encoder integrates with common LLM backbones. Apple’s research brief frames FastVLM as targeting real-time or latency-sensitive scenarios, where lowering visual token pressure is critical to interactive UX. In short, it’s a practical recipe to make VLMs fast without exotic token-selection heuristics.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Janus

    Janus

    Unified Multimodal Understanding and Generation Models

    Janus is a sophisticated open-source project from DeepSeek AI that aims to unify both visual understanding and image generation in a single model architecture. Rather than having separate systems for “look and describe” and “prompt and generate”, Janus uses an autoregressive transformer framework with a decoupled visual encoder—allowing it to ingest images for comprehension and to produce images from text prompts with shared internal representations. The design tackles long-standing conflicts in multimodal models: namely that the visual encoder has to serve both analysis (understanding) and synthesis (generation) roles. By splitting those pathways but keeping one unified core transformer, Janus maintains flexibility and achieves strong performance across tasks previously requiring distinct architectures. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Encoder of Death
    Encoder of Death is a video/audio file encoding app. Batch Processing: Encode multiple files simultaneously in queue. Audio/Video Conversion: Convert between video formats or extract audio from video. Format Support: Audio: MP3, WAV, FLAC, AIFF, AAC, M4A, OGG Video: MP4, MKV, AVI, MOV, WebM, FLV, WMV, MPEG, MPG, M4V, 3GP 1-27-25, I addressed the issue of FFmpeg not bundling with the executables.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 15
    MedGemma

    MedGemma

    Collection of Gemma 3 variants that are trained for performance

    ...It includes multiple variants such as a 4 billion-parameter multimodal model that can process both medical images and text and a 27 billion-parameter text-only (and multimodal) model that offers deeper clinical reasoning and understanding at higher capacity, making it suitable for complex tasks like medical question answering, summarization of clinical notes, or generating reports from radiology images. The multimodal versions pair a SigLIP-based image encoder pre-trained on diverse de-identified medical imaging data.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 16
    Real-Time Voice Cloning

    Real-Time Voice Cloning

    Clone a voice in 5 seconds to generate arbitrary speech in real-time

    Real-Time Voice Cloning is an influential deep-learning repository that demonstrates how to clone a voice from just a few seconds of audio and then generate arbitrary speech in that voice in near real time. It implements the SV2TTS pipeline (“Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis”) in three stages: a speaker encoder, a synthesizer, and a vocoder. In the first stage, short audio clips are converted into a fixed-dimensional speaker embedding that captures voice characteristics; this embedding is then used by a Tacotron-style synthesizer to generate spectrograms from text, which a WaveRNN-based vocoder finally turns into audio. The repo includes both a command-line demo and a graphical “toolbox” application where you can load reference voices, type text, and hear the synthesized results interactively. ...
    Downloads: 13 This Week
    Last Update:
    See Project
  • 17
    IndexTTS2

    IndexTTS2

    Industrial-level controllable zero-shot text-to-speech system

    IndexTTS is a modern, zero-shot text-to-speech (TTS) system engineered to deliver high-quality, natural-sounding speech synthesis with few requirements and strong voice-cloning capabilities. It builds on state-of-the-art models such as XTTS and other modern neural TTS backbones, improving them with a conformer-based speech conditional encoder and upgrading the decoder to a high-quality vocoder (BigVGAN2), leading to clearer and more natural audio output. The system supports zero-shot voice cloning — meaning it can mimic a target speaker’s voice from a short reference sample — making it versatile for multi-voice uses. Compared to many open-source TTS tools, IndexTTS emphasizes efficiency and controllability: it offers faster inference, simpler training pipelines, and controllable speech parameters (like duration, pitch, and prosody), which is critical for production use.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 18

    Text Encoder

    Encode Plain Text to Hex, Base64, or UUEncode

    Simple but fully functional text encoder, encode plain text to Hex, Base64, or UUEncode. Easy to use and completely portable. Linux and Windows binaries included. (64 bit)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19

    laravel-encoder

    The Laravel Encoder package provides a robust and secure way to encode

    The Laravel Encoder package provides a robust and secure way to encode and decode IDs & Strings using customizable Base encoding mechanisms (Base62). With support for variable-length encoding, mappers for added security, and seamless integration with Laravel, this package is ideal for obfuscating sensitive data or creating URL-safe identifiers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    nghttp2

    nghttp2

    HTTP/2 C Library and tools

    ...Since then we have updated nghttp2 library constantly to the latest specification and nghttp2 is now one of the most mature HTTP/2 implementations. HTTP/2 utilizes header compression method called HPACK. We offer HPACK encoder and decoder are available as public API. nghttp2 library itself is a bit low-level. The experimental high-level C++ API is also available. We have Python binding of this library, but we have not covered everything yet.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    FireRedASR

    FireRedASR

    Open-source industrial-grade ASR models

    FireRedASR is an industrial-grade family of open-source automatic speech recognition models designed to provide high-precision speech-to-text performance across languages including Mandarin, English, and various Chinese dialects, achieving new state-of-the-art benchmarks on public test sets. The project includes multiple model variants to meet different application needs, such as high-accuracy end-to-end interaction using an encoder-adapter-LLM framework and efficient real-time recognition using attention-based encoder-decoder architectures, giving developers flexibility in balancing performance and resource constraints. FireRedASR not only excels in traditional speech recognition tasks but also demonstrates strong capability in challenging scenarios like singing lyrics recognition, where accurate transcription is often difficult for conventional models.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Poison

    Poison

    An incredibly fast, pure Elixir JSON library

    Poison is a fast and lightweight JSON library for Elixir focused on performance and idiomatic APIs. It provides straightforward encode and decode functions, along with a protocol-based encoder that lets you customize how your structs become JSON. Developers can derive or implement Poison.Encoder for domain types, control which fields are included, and map complex values into JSON-friendly forms. On the decoding side, it supports options for key handling and flexible parsing of JSON into Elixir maps, lists, and primitive values. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Bootstrap Your Own Latent (BYOL)

    Bootstrap Your Own Latent (BYOL)

    Usable Implementation of "Bootstrap Your Own Latent" self-supervised

    Practical implementation of an astoundingly simple method for self-supervised learning that achieves a new state-of-the-art (surpassing SimCLR) without contrastive learning and having to designate negative pairs. This repository offers a module that one can easily wrap any image-based neural network (residual network, discriminator, policy network) to immediately start benefitting from unlabelled image data. There is now new evidence that batch normalization is key to making this technique...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Perception Models

    Perception Models

    State-of-the-art Image & Video CLIP, Multimodal Large Language Models

    Perception Models is a state-of-the-art framework developed by Facebook Research for advanced image and video perception tasks. It introduces two primary components: the Perception Encoder (PE) for visual feature extraction and the Perception Language Model (PLM) for multimodal decoding and reasoning. The PE module is a family of vision encoders designed to excel in image and video understanding, surpassing models like SigLIP2, InternVideo2, and DINOv2 across multiple benchmarks. Meanwhile, PLM integrates with PE to power vision-language modeling, achieving results competitive with leading multimodal systems such as QwenVL2.5 and InternVL3, all while being fully reproducible with open data. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    vJEPA-2

    vJEPA-2

    PyTorch code and models for VJEPA2 self-supervised learning from video

    VJEPA2 is a next-generation self-supervised learning framework for video that extends the “predict in representation space” idea from i-JEPA to the temporal domain. Instead of reconstructing pixels, it predicts the missing high-level embeddings of masked space-time regions using a context encoder and a slowly updated target encoder. This objective encourages the model to learn semantics, motion, and long-range structure without the shortcuts that pixel-level losses can invite. The architecture is designed to scale: spatiotemporal ViT backbones, flexible masking schedules, and efficient sampling let it train on long clips while remaining stable. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB