Hunyuan-Vision-1.5
HunyuanVision is a cutting-edge vision-language model developed by Tencent’s Hunyuan team. It uses a mamba-transformer hybrid architecture to deliver strong performance and efficient inference in multimodal reasoning tasks. The version Hunyuan-Vision-1.5 is designed for “thinking on images,” meaning it not only understands vision+language content, but can perform deeper reasoning that involves manipulating or reflecting on image inputs, such as cropping, zooming, pointing, box drawing, or drawing on the image to acquire additional knowledge. It supports a variety of vision tasks (image + video recognition, OCR, diagram understanding), visual reasoning, and even 3D spatial comprehension, all in a unified multilingual framework. The model is built to work seamlessly across languages and tasks and is intended to be open sourced (including checkpoints, technical report, inference support) to encourage the community to experiment and adopt.
Learn more
HunyuanOCR
Tencent Hunyuan is a large-scale, multimodal AI model family developed by Tencent that spans text, image, video, and 3D modalities, designed for general-purpose AI tasks like content generation, visual reasoning, and business automation. Its model lineup includes variants optimized for natural language understanding, multimodal vision-language comprehension (e.g., image & video understanding), text-to-image creation, video generation, and 3D content generation. Hunyuan models leverage a mixture-of-experts architecture and other innovations (like hybrid “mamba-transformer” designs) to deliver strong performance on reasoning, long-context understanding, cross-modal tasks, and efficient inference. For example, the vision-language model Hunyuan-Vision-1.5 supports “thinking-on-image”, enabling deep multimodal understanding and reasoning on images, video frames, diagrams, or spatial data.
Learn more
Tencent Yuanbao
Tencent Yuanbao is an AI-powered assistant that has quickly become popular in China, leveraging advanced large language models, including Tencent's proprietary Hunyuan model, and integrating with DeepSeek. The application excels in areas like Chinese language processing, logical reasoning, and efficient task execution. Yuanbao's popularity has surged in recent months, even surpassing competitors such as DeepSeek to top the Apple App Store download charts in China. A key driver of its growth is its deep integration into the Tencent ecosystem, particularly within WeChat, further enhancing its accessibility and functionality. This rapid rise highlights Tencent's growing ambition in the competitive AI assistant market.
Learn more
Hunyuan3D 2.0
Tencent Hunyuan 3D is an AI-powered platform developed by Tencent that specializes in generating 3D content. Leveraging advanced artificial intelligence technology, the platform allows users to create realistic and dynamic 3D models and animations efficiently. It is designed for industries such as gaming, virtual reality, and digital media, offering a streamlined solution for high-quality 3D asset creation.
Learn more