NitroGen is a foundation model for generalist gaming agents developed under the MineDojo initiative, aimed at training a vision-action AI that can play and interact with a wide variety of games by taking pixel inputs and predicting gamepad actions. As an open research model, NitroGen is trained on extensive gameplay data spanning thousands of hours and hundreds of games to instill broad, generalizable gaming competency rather than skill at a single title. This approach enables the model to control agents in different game genres and contexts, performing tasks that range from complex exploration and combat to fine-grained control in platformers, demonstrating adaptability across unseen environments. The project draws on MineDojo’s broader ecosystem for embodied AI, where multi-modal inputs and richly diverse benchmarks help push toward generalist AI capable of interactive decision making.
Features
- Vision-to-action foundation model
- Trained across many game environments
- Generalist gameplay capabilities
- Open benchmarking suite
- Multi-modal input handling
- Research-oriented toolset