Grok-1 is a 314-billion-parameter Mixture-of-Experts (MoE) large language model developed by xAI. Designed to optimize computational efficiency, it activates only 25% of its weights for each input token. In March 2024, xAI released Grok-1's model weights and architecture under the Apache 2.0 license, making them openly accessible to developers. The accompanying GitHub repository provides JAX example code for loading and running the model. Due to its substantial size, utilizing Grok-1 requires a machine with significant GPU memory. The repository's MoE layer implementation prioritizes correctness over efficiency, avoiding the need for custom kernels.
This is a full repo snapshot ZIP file of the Grok-1 code.
Features
- 314-billion-parameter Mixture-of-Experts (MoE) architecture
- Efficient computation, activating only 25% of parameters per token
- Fully open-source under the Apache 2.0 license
- JAX-based implementation with example code provided
- Scalable and modular, designed for research and development
- Requires high GPU memory for inference and fine-tuning
- Correctness-focused MoE layer without custom kernels
- Optimized for natural language understanding and generation
License
Apache License V2.0Follow Grok-1
Other Useful Business Software
MongoDB Atlas runs apps anywhere
MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
Rate This Project
Login To Rate This Project
User Reviews
-
Really great model