Ministral 3 8B Instruct 2512 is a balanced, efficient model in the Ministral 3 family, offering strong multimodal capabilities within a compact footprint. It combines an 8.4B-parameter language model with a 0.4B vision encoder, enabling both text reasoning and image understanding. This FP8 instruct-fine-tuned variant is optimized for chat, instruction following, and structured outputs, making it ideal for daily assistant tasks and lightweight agentic workflows. Designed for edge deployment, the model can run on a wide range of hardware and fits locally on a single 12GB GPU, with the option for even smaller quantized configurations. Its multilingual support covers dozens of major languages, allowing it to work across diverse global environments and applications. The model adheres reliably to system prompts, supports native function calling, and outputs clean JSON, giving it strong tool-use behavior.
Features
- 8.4B language model paired with a 0.4B vision encoder for multimodal tasks
- FP8 instruct-tuned weights optimized for chat and instruction use cases
- Runs locally on a single 12GB GPU and uses even less memory when quantized
- Supports dozens of languages including English, Spanish, German, Chinese, Arabic, and more
- Strong system-prompt adherence for predictable instruction execution
- Native agentic capabilities with function calling and JSON output
- Large 256k context window for extended reasoning and document processing
- Edge-optimized design suitable for embedded, offline, and resource-constrained environments