PyTorch extensions for fast R&D prototyping and Kaggle farming
Robust Speech Recognition via Large-Scale Weak Supervision
Provides code for running inference with the SegmentAnything Model
A Family of Open Foundation Models for Code Intelligence
A SOTA open-source image editing model
A simple but complete full-attention transformer
Multimodal model achieving SOTA performance
Accurate × Fast × Comprehensive
Fast inference engine for Transformer models
Data manipulation and transformation for audio signal processing
LLM training code for MosaicML foundation models
The unofficial python package that returns response of Google Bard
Open-source industrial-grade ASR models
Industrial-level controllable zero-shot text-to-speech system
Pretrained time-series foundation model developed by Google Research
OpenAI swift async text to image for SwiftUI app using OpenAI
End-to-end speech processing toolkit
DeepSeek LLM: Let there be answers
A MATLAB package for modelling multivariate stimulus-response data
A Conversational Speech Generation Model
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis
Consistency Distilled Diff VAE
Basaran, an open-source alternative to the OpenAI text completion API
Neural machine translation and sequence learning using TensorFlow
Implementation of NÜWA, attention network for text to video synthesis