...It enables agents to plan, reason, and communicate effectively to maximize outcomes in multi-turn negotiations over shared resources. The framework provides code for both supervised learning (training from human dialogue data) and reinforcement learning (via self-play and rollout-based planning). It introduces a hierarchical latent model, where high-level intents are first clustered and then translated into coherent language, improving dialogue diversity and goal consistency. ...