GLM-5: From Vibe Coding to Agentic Engineering
A multimodal model for brain response prediction
Analyze computation-communication overlap in V3/R1
Tool for exploring and debugging transformer model behaviors
An experimental version of DeepSeek model
Bidirectional token-classification model for identifiable info
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
Multimodal 7B model for image, video, and text understanding tasks
Compact 3B-param multimodal model for efficient on-device reasoning