Robust Speech Recognition via Large-Scale Weak Supervision
Provides code for running inference with the SegmentAnything Model
A Family of Open Foundation Models for Code Intelligence
A standalone, portable generic Ada package for decoding images
Accurate × Fast × Comprehensive
Industrial-level controllable zero-shot text-to-speech system
A HEVC/H.265 Web Player
End-to-end speech processing toolkit
Boilerplate-free Kotlin config library for loading configuration files
TorchMultimodal is a PyTorch library
A Foundation Model for the Language of Financial Markets
An incredibly fast, pure Elixir JSON library
Pretrained time-series foundation model developed by Google Research
Multimodal model achieving SOTA performance
AV1 Image File Format Specification - ISO-BMFF/HEIF derivative
Audio codecs extracted from Android Open Source Project
Free online developer tools JSON formatter, Base64 encoder, and more
A Conversational Speech Generation Model
110+ developer tools as native MacOS, Linux & Windows desktop apps.
DeepSeek LLM: Let there be answers
Blazing fast and correct x86/x64 disassembler, assembler, decoder, etc
An Ada 2012 library for reading and writing PNG image files
Transformer related optimization, including BERT, GPT
A High Performance Library for Sequence Processing and Generation