Grok-1
Open-source, high-performance Mixture-of-Experts large language model
...Due to its substantial size, utilizing Grok-1 requires a machine with significant GPU memory. The repository's MoE layer implementation prioritizes correctness over efficiency, avoiding the need for custom kernels.
This is a full repo snapshot ZIP file of the Grok-1 code.