Interaction Dynamics as a Reward Signal for LLMs
arxiv.org·2h
Flag this post

Title:Interaction Dynamics as a Reward Signal for LLMs

View PDF HTML (experimental)

Abstract:The alignment of Large Language Models (LLMs) for multi-turn conversations typically relies on reward signals derived from the content of the text. This approach, however, overlooks a rich, complementary source of signal: the dynamics of the interaction itself. This paper introduces TRACE (Trajectory-based Reward for Agent Collaboration Estimation), a novel reward signal derived from the geometric properties of a dialogue’s embedding trajectory–a concept we term ‘conversational geometry’. Our central finding is that a reward model trained only on these structural signals achieves a pairwise accuracy (68.20…

Similar Posts

Loading similar posts...