Multi-modal large language model designed for audio understanding
Contexts Optical Compression
Language modeling in a sentence representation space
High-Fidelity and Controllable Generation of Textured 3D Assets
Code for the paper Hybrid Spectrogram and Waveform Source Separation
800,000 step-level correctness labels on LLM solutions to MATH problem
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)
Large-scale autoregressive pixel model for image generation by OpenAI
Large-scale xAI model for local inference with SGLang, Grok-2.5
VaultGemma: 1B DP-trained Gemma variant for private NLP tasks