Claude Code action for GitHub PRs
PyTorch implementation of JiT
Instructions on how to use the Realtime API on Microcontrollers
4M: Massively Multimodal Masked Modeling
New set of lightweight state-of-the-art, open foundation models
Foundation Models for Time Series
Qwen2.5-Coder is the code version of Qwen2.5, the large language model
Code for the paper "Improved Techniques for Training GANs"
Fast uncensored Gemma model optimized for local chat and coding
685B model with improved agents and consistency