Showing 359 open source projects for "encoder"

View related business solutions
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • 1
    EnCodec

    EnCodec

    State-of-the-art deep learning based audio codec

    ...Unlike traditional codecs (like MP3 or Opus), Encodec uses a learned quantizer and decoder to reconstruct complex waveforms with remarkable accuracy at bitrates as low as 1.5 kbps. It employs a convolutional encoder–decoder architecture trained with perceptual loss functions that optimize for human auditory quality rather than raw waveform distance. The model can operate in real time and supports variable bandwidths, bitrates, and multi-band audio. Encodec has applications in speech and music compression, generative modeling, and efficient data transmission for communication systems. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    MAE (Masked Autoencoders)

    MAE (Masked Autoencoders)

    PyTorch implementation of MAE

    ...It trains a Vision Transformer (ViT) by randomly masking a high percentage of image patches (typically 75%) and reconstructing the missing content from the remaining visible patches. This forces the model to learn semantic structure and global context without supervision. The encoder processes only the visible patches, while a lightweight decoder reconstructs the full image—making pretraining computationally efficient. After pretraining, the encoder serves as a powerful backbone for downstream tasks like image classification, segmentation, and detection, achieving top performance with minimal fine-tuning. The repository provides pretrained models, fine-tuning scripts, evaluation protocols, and visualization tools for reconstruction quality and learned features.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    DeepLabv3 Plus

    DeepLabv3 Plus

    Encoder-Decoder with Atrous Separable Convolution

    DeepLabv3 Plus is a PyTorch implementation of DeepLabv3+ for semantic segmentation. It implements the encoder-decoder architecture with atrous separable convolution and provides a practical workflow for training, prediction, and mIoU evaluation. The repository supports VOC-style segmentation datasets and includes utilities for annotation generation, JSON dataset conversion, model summary inspection, prediction, and metric calculation. It provides pretrained weight workflows for MobileNetV2 and Xception backbones and notes that the correct backbone should be selected during training and prediction. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4
    Deep learning time series forecasting

    Deep learning time series forecasting

    Deep learning PyTorch library for time series forecasting

    ...Historically, this repository provided open-source benchmarks and codes for flash flood and river flow forecasting. Full transformer (SimpleTransformer in model_dict): The full original transformer with all 8 encoder and decoder blocks. Requires passing the target in at inference.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 5
    Reformer PyTorch

    Reformer PyTorch

    Reformer, the efficient Transformer, in Pytorch

    This is a Pytorch implementation of Reformer. It includes LSH attention, reversible network, and chunking. It has been validated with an auto-regressive task (enwik8).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Mocking Bird

    Mocking Bird

    Clone a voice in 5 seconds to generate arbitrary speech in real-time

    ...It builds on deep-learning based TTS / voice-cloning technology (in the lineage of projects such as Real-Time-Voice-Cloning), but extends it with support for Mandarin Chinese and multiple Chinese speech datasets — broadening its applicability beyond English. The codebase is implemented in Python (with PyTorch) and includes modules for encoder, synthesizer, vocoder, preprocessing, and inference, as well as demo scripts and a web-server interface for easier experimentation or deployment. MockingBird supports both using pretrained models and training your own synthesizer (with custom datasets), giving flexibility for voice-cloning or custom-voice synthesis depending on your needs.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    fairseq-lua

    fairseq-lua

    Facebook AI Research Sequence-to-Sequence Toolkit

    fairseq-lua is the original Lua/Torch7 version of Facebook AI Research’s sequence modeling toolkit, designed for neural machine translation (NMT) and sequence generation. It introduced early attention-based architectures and training pipelines that later evolved into the modern PyTorch-based fairseq. The framework implements sequence-to-sequence models with attention, beam search decoding, and distributed training, providing a research platform for exploring translation, summarization, and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    file-splitter-rejoiner

    file splitter and rejoiner

    /* * * Freeware * Open Source * 2 tools in one application * using .Net 4.8 * (1) Simple files splitter and rejoiner tool using memory buffer * (2) Simple files base64 encoder and decoder using random sized Stream GB/TB+ data sizes * A good tool for an essentials inventory * Just when required. * Simple precise short and straightforward coding * Tested bugs free and perfect when I developed and released it. * * Developer: Tushar Jain * Release Time: 09:33 PM * Release Date: Friday, 23 April 2021 * * */
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    AliceMind

    AliceMind

    ALIbaba's Collection of Encoder-decoders from MinD

    This repository provides pre-trained encoder-decoder models and its related optimization techniques developed by Alibaba's MinD (Machine IntelligeNce of Damo) Lab. Pre-trained models for natural language understanding (NLU). We extend BERT to a new model, StructBERT, by incorporating language structures into pre-training. Specifically, we pre-train StructBERT with two auxiliary tasks to make the most of the sequential order of words and sentences, which leverage language structures at the word and sentence levels, respectively. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    More flexibility. More control.

    Generate interest, access liquidity without selling, and execute trades seamlessly. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 10

    mp3guessenc

    What's inside your mp3 files REALLY ?

    mp3guessenc is based upon the original project by Naoki Shibata. This command line utility was born to detect the encoder used for a mpeg layer III file but now it can do more and scan any mpeg audio file (any layer) and print a LOT of useful information. Latest modifications include fixes, new features and code optimizations.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 11
    TransPose

    TransPose

    PyTorch Implementation for "TransPose, Keypoint localization

    TransPose is a human pose estimation model based on a CNN feature extractor, a Transformer Encoder, and a prediction head. Given an image, the attention layers built in Transformer can efficiently capture long-range spatial relationships between keypoints and explain what dependencies the predicted keypoints locations highly rely on.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    Denoiser

    Denoiser

    Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)

    Denoiser is a real-time speech enhancement model operating directly on raw waveforms, designed to clean noisy audio while running efficiently on CPU. It uses a causal encoder-decoder architecture with skip connections, optimized with losses defined both in the time domain and frequency domain to better suppress noise while preserving speech. Unlike models that operate on spectrograms alone, this design enables lower latency and coherent waveform output. The implementation includes data augmentation techniques applied to the raw waveforms (e.g. noise mixing, reverberation) to improve model robustness and generalization to diverse noise types. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    TTS

    TTS

    Deep learning for text to speech

    ...Notebooks for extensive model benchmarking. Modular (but not too much) code base enabling easy testing for new ideas. Text2Spec models (Tacotron, Tacotron2, Glow-TTS, SpeedySpeech). Speaker Encoder to compute speaker embeddings efficiently. Vocoder models (MelGAN, Multiband-MelGAN, GAN-TTS, ParallelWaveGAN, WaveGrad, WaveRNN). If you are only interested in synthesizing speech with the released TTS models, installing from PyPI is the easiest option.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    Abdal Base64 Encoder Decoder

    Abdal Base64 Encoder Decoder

    Base64 Encoder Decoder Tools

    Abdal Base64 Encoder Decoder tool can perform encryption and decryption process without any restrictions and completely free of charge, but be careful that this tool should not be used for encrypting very sensitive information.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    ALAE

    ALAE

    Adversarial Latent Autoencoders

    ...The project implements the architecture introduced in the CVPR research paper on Adversarial Latent Autoencoders, which focuses on improving generative modeling by learning latent representations aligned with adversarial training objectives. Unlike traditional GANs that directly generate images from random noise, ALAE uses an encoder-decoder architecture that maps images into a structured latent space and then reconstructs them through adversarial training. This design allows the model to learn interpretable latent representations that can be manipulated to control generated image attributes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    clzw

    clzw

    Implementation of LZW compression algorithm in C

    Simple, fast implementation of LZW (Lempel–Ziv–Welch) data compression algorithm in C. - Console encoder/decoder tools - OS independent - Could be used in embedded projects - Works with raw code-stream LZW features: - Hardcoded dictionary size - Variable code size - Code search is performed by hash table and embedded in dictionary linked lists (encoder) - No dynamic memory allocation
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Imagination
    .... *** https://github.com/colossus73/imagination *** Imagination is a lightweight and simple DVD slide show maker written in C language and built with the GTK+3 toolkit and Cairo featuring 69 transitions. It requires the ffmpeg encoder and libsox to export the slideshow.
    Downloads: 21 This Week
    Last Update:
    See Project
  • 18
    SentEval

    SentEval

    A python tool for evaluating the quality of sentence embeddings

    SentEval is a standardized toolkit for evaluating sentence embeddings across a wide spectrum of downstream tasks and probing tests. It defines a simple interface—provide an encoder function from sentences to vectors—and then runs consistent training/evaluation loops for tasks like sentiment, entailment, paraphrase, and semantic textual similarity. The suite also contains linguistic probing tasks that illuminate what properties embeddings capture, such as tense, word order, or syntactic structure. Datasets are wrapped with unified preprocessing and metrics so results are comparable across papers and implementations. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Multilingual Speech Synthesis

    Multilingual Speech Synthesis

    An implementation of Tacotron 2 that supports multilingual experiments

    ...The first shares the whole encoder and uses an adversarial classifier to remove speaker-dependent information from the encoder. The second has separate encoders for each language.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20

    tkmpeg

    TK MPEG 1 encoder. Tcl/Tk 8.5/8.6 TEA Compatable.

    TK MPEG 1 encoder. Tcl/Tk 8.5/8.6 TEA Compatable. Based on ezMPEG. http://sourceforge.net/projects/ezmpeg
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21

    Fox Encoder

    Multipurpose Encoder/decoder

    Encode / Decode (from and to) text, hex, base64. Also supports hash function including MD4, MD5, SHA1, SHA2, SHA3, Keccak including most lengths (224, 256, 384 and 512),
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22

    collect_elemental

    Collects necessary TechSupport data for AWS Elemental Users

    This script provides AWS Elemental TechSupport with all the necessary logs and data specific to troubleshooting issues. Run this on any conductor, statmux, or encoder. It collects the following: * System Logs (/var/log) * Web (/opt/elemental_se/web/log) * Database logs (/data/pgsql/logs) * Database Backups (/home/elemental/database_backups) * Elemental Core Dumps (/data/server/cores) * System Core Dumps (/var/crash) * IPMI data (ipmi sel list sensor) * Elemental Node Info (/etc/motd /opt/elemental_se/eme.lic conductor.lic ui.lic versions.txt) * Event IDs
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    DETR

    DETR

    End-to-end object detection with transformers

    ...Unlike traditional computer vision techniques, DETR approaches object detection as a direct set prediction problem. It consists of a set-based global loss, which forces unique predictions via bipartite matching, and a Transformer encoder-decoder architecture. Given a fixed small set of learned object queries, DETR reasons about the relations of the objects and the global image context to directly output the final set of predictions in parallel. Due to this parallel nature, DETR is very fast and efficient.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    BitShade

    BitShade

    AES Cipher, Base64 encoder, SHA calculator

    BitShade is a graphic utility to encrypt/decrypt with AES, base64 encode/decode, calculate the SHA checksum. It is cross platform, written in Java.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Zopfli

    Zopfli

    Zopfli Compression Algorithm is a compression library

    Zopfli is a compression library and command-line tool that produces exceptionally small DEFLATE, zlib, and gzip streams by spending more CPU time to search for better encodings. It keeps strict compatibility with the ubiquitous DEFLATE format, so outputs can be decompressed by any standard tool or browser. The encoder performs exhaustive block splitting and greedy but thorough match searching to shave extra bytes off assets, which is ideal for web content and firmware where size matters more than compression time. A companion utility, zopflipng, targets PNGs by trying alternate filter strategies and recompressing IDAT chunks, often achieving additional savings without changing image quality. ...
    Downloads: 2 This Week
    Last Update:
    See Project
Auth0 Logo