GLM-5: From Vibe Coding to Agentic Engineering
Analyze computation-communication overlap in V3/R1
A multimodal model for brain response prediction
An experimental version of DeepSeek model
Tool for exploring and debugging transformer model behaviors
Bidirectional token-classification model for identifiable info
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
Multimodal 7B model for image, video, and text understanding tasks
Compact 3B-param multimodal model for efficient on-device reasoning