PyTorch extensions for fast R&D prototyping and Kaggle farming
Robust Speech Recognition via Large-Scale Weak Supervision
Provides code for running inference with the SegmentAnything Model
A Family of Open Foundation Models for Code Intelligence
Accurate × Fast × Comprehensive
A simple but complete full-attention transformer
Industrial-level controllable zero-shot text-to-speech system
Fast inference engine for Transformer models
Data manipulation and transformation for audio signal processing
End-to-end speech processing toolkit
LLM training code for MosaicML foundation models
Pretrained time-series foundation model developed by Google Research
Multimodal model achieving SOTA performance
OpenAI swift async text to image for SwiftUI app using OpenAI
A MATLAB package for modelling multivariate stimulus-response data
The unofficial python package that returns response of Google Bard
A Conversational Speech Generation Model
DeepSeek LLM: Let there be answers
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis
Consistency Distilled Diff VAE
Basaran, an open-source alternative to the OpenAI text completion API
Neural machine translation and sequence learning using TensorFlow
Transformer related optimization, including BERT, GPT
Implementation of NÜWA, attention network for text to video synthesis
Text-conditional image generation model based on OpenAI's unCLIP