Hermes 4 405B FP8 is a cutting-edge large language model developed by Nous Research, built on Llama-3.1-405B and optimized for frontier reasoning and alignment. It introduces a hybrid reasoning mode with explicit <think> segments, enabling the model to deliberate deeply when needed and switch to faster responses when desired. Post-training improvements include a vastly expanded corpus with ~60B tokens, boosting performance across math, code, STEM, logic, creativity, and structured outputs. The model is designed for schema adherence, producing valid JSON and repairing malformed outputs, making it highly suitable for tool use and function calling. Hermes 4 is engineered for superior steerability with reduced refusal rates, aligning responses to user values while preserving assistant quality. It achieves state-of-the-art results on RefusalBench, outperforming both closed and open models in balancing helpfulness with adaptability.

Features

  • Based on Llama-3.1-405B with 406B parameters
  • Hybrid reasoning mode with <think> deliberation segments
  • Trained on ~5M samples / ~60B tokens post-training corpus
  • State-of-the-art performance on RefusalBench benchmark
  • Schema adherence and structured JSON outputs with error repair
  • Supports function calling, tool use, and role-based chat formats
  • Improved steerability with reduced refusal rates and user-aligned values
  • Available in FP8, BF16, and GGUF quantized variants for flexible deployment

Project Samples

Project Activity

See All Activity >

Categories

AI Models

Follow Hermes 4

Hermes 4 Web Site

Other Useful Business Software
Retool your internal operations Icon
Retool your internal operations

Generate secure, production-grade apps that connect to your business data. Not just prototypes, but tools your team can actually deploy.

Build internal software that meets enterprise security standards without waiting on engineering resources. Retool connects to your databases, APIs, and data sources while maintaining the permissions and controls you need. Create custom dashboards, admin tools, and workflows from natural language prompts—all deployed in your cloud with security baked in. Stop duct-taping operations together, start building in Retool.
Build an app in Retool
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Hermes 4!

Additional Project Details

Registered

2025-09-01