Observer, Not Player: Simulating Theory of Mind in LLMs through Game Observation
arxiv.org·4d
📐Language Theory
Preview
Report Post

View PDF HTML (experimental)

Abstract:We present an interactive framework for evaluating whether large language models (LLMs) exhibit genuine "understanding" in a simple yet strategic environment. As a running example, we focus on Rock-Paper-Scissors (RPS), which, despite its apparent simplicity, requires sequential reasoning, adaptation, and strategy recognition. Our system positions the LLM as an Observer whose task is to identify which strategies are being played and to articulate the reasoning behind this judgment. The purpose is not to test knowledge of Rock-Paper-Scissors itself, but to probe whether the model can exhibit mind-like reasoning about sequential behavior. To support systematic evalu…

Similar Posts

Loading similar posts...