ERNIE-4.5-VL-28B-A3B-Paddle is a multimodal MoE chat model designed for complex image-text tasks, featuring 28 billion total parameters with 3 billion activated per token. Built on PaddlePaddle, it excels in tasks like visual question answering, description generation, and multimodal reasoning. It employs a heterogeneous Mixture-of-Experts architecture that supports both thinking and non-thinking inference modes. The model benefits from advanced pretraining and posttraining strategies,...