Analyze computation-communication overlap in V3/R1
An experimental version of DeepSeek model
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Claude Code action for GitHub PRs
Foundation Models for Time Series
Stable Diffusion WebUI Forge is a platform on top of Stable Diffusion
Unified Multimodal Understanding and Generation Models
OpenTinker is an RL-as-a-Service infrastructure for foundation models
Pretrained time-series foundation model developed by Google Research
Generate Any 3D Scene in Seconds
ICLR2024 Spotlight: curation/training code, metadata, distribution
A PyTorch library for implementing flow matching algorithms
Foundational Models for State-of-the-Art Speech and Text Translation
Advancing Formal Mathematical Reasoning via Reinforcement Learning
FlashMLA: Efficient Multi-head Latent Attention Kernels
Compact English sentence embedding model for semantic search tasks
Qwen2.5-VL-3B-Instruct: Multimodal model for chat, vision & video
CLIP ViT-bigG/14: Zero-shot image-text model trained on LAION-2B
Open, non-commercial SDXL model for quality image generation