PyTorch extensions for fast R&D prototyping and Kaggle farming
simplejson is a simple, fast, extensible JSON encoder/decoder
Robust Speech Recognition via Large-Scale Weak Supervision
A SOTA open-source image editing model
A simple but complete full-attention transformer
Accurate × Fast × Comprehensive
Data manipulation and transformation for audio signal processing
LLM training code for MosaicML foundation models
The unofficial python package that returns response of Google Bard
AV1 Image File Format Specification - ISO-BMFF/HEIF derivative
Open-source industrial-grade ASR models
Industrial-level controllable zero-shot text-to-speech system
TorchMultimodal is a PyTorch library
End-to-end speech processing toolkit
Segmentation models with pretrained backbones. PyTorch
A Conversational Speech Generation Model
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis
Consistency Distilled Diff VAE
Basaran, an open-source alternative to the OpenAI text completion API
Neural machine translation and sequence learning using TensorFlow
Implementation of NÜWA, attention network for text to video synthesis
Text-conditional image generation model based on OpenAI's unCLIP
CPT: A Pre-Trained Unbalanced Transformer
State-of-the-art deep learning based audio codec
Singing Voice Synthesis via Shallow Diffusion Mechanism