VITS2 backbone with multilingual-bert
A high quality MP3 encoder
This package provides the ability to encode golang structs
Generate 3D objects conditioned on text or images
Batch audio encoding script for Linux/BSD
A C++ library for AVR and NodeMCU
Led dimming code for rotary encoder with turbo
An open-source framework for training large multimodal models
JBIG2 Encoder
Basaran, an open-source alternative to the OpenAI text completion API
Meta-Transformer for Unified Multimodal Learning
QRCode Encoder written in Pure Go
Neural machine translation and sequence learning using TensorFlow
Arduino Focuser, fully ASCOM complaint
Official codebase for I-JEPA
Singing voice change based on whisper, lora for singing voice clone
Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion
Transformer related optimization, including BERT, GPT
CPT: A Pre-Trained Unbalanced Transformer
A Very Low-Bitrate Codec for Speech Compression
Haptic input knob with software-defined endstops and virtual detents
Text-conditional image generation model based on OpenAI's unCLIP
A latent text-to-image diffusion model
A High Performance Library for Sequence Processing and Generation