SPARTAN: A Sparse Transformer World Model Attending to What Matters
arxiv.org·10h
🧮Kolmogorov Bounds
Preview
Report Post

View PDF HTML (experimental)

Abstract:Capturing the interactions between entities in a structured way plays a central role in world models that flexibly adapt to changes in the environment. Recent works motivate the benefits of models that explicitly represent the structure of interactions and formulate the problem as discovering local causal structures. In this work, we demonstrate that reliably capturing these relationships in complex settings remains challenging. To remedy this shortcoming, we postulate that sparsity is a critical ingredient for the discovery of such local structures. To this end, we present the SPARse TrANsformer World model (SPARTAN), a Transformer-based world model that learns conte…

Similar Posts

Loading similar posts...