Domain Theory, Fixed Points, Continuations, Program Equivalence
Revisiting Landmarks: Learning from Previous Plans to Generalize over Problem Instances
arxiv.orgยท1d
Understanding LLMs: Insights from Mechanistic Interpretability
lesswrong.comยท2d
Time's arrow => decision theory
lesswrong.comยท3h
Think in Games: Learning to Reason in Games via Reinforcement Learning with Large Language Models
arxiv.orgยท1d
Loading...Loading more...