Poster
in
Workshop: Representational Alignment

From Interaction to Abstraction: Behavior and Neuroimaging Evidence That Reasoning Models Learn Games Like Humans

Botos Csaba ⋅ Sreejan Kumar ⋅ Austin Andrews ⋅ Rui Ponte Costa ⋅ Marcelo G Mattar ⋅ Momchil Tomov

Project Page [ OpenReview]

Abstract

Humans rapidly learn abstract knowledge when encountering novel environments and flexibly deploy this knowledge to guide efficient and intelligent action. Can modern AI systems learn and plan in a similar way? Leveraging a unique dataset of human gameplay with concurrent fMRI recordings, we evaluate model-free reinforcement learning agents, bayes-optimal model-based agents, and a frontier Large Reasoning Model, on two complementary dimensions: behavioral patterns and predictivity of human brain representations. Using encoding models, we assess how well each system’s internal representations predict brain activity in regions previously implicated in theory-based reasoning. We find that the Large Reasoning Model most closely matches human behavioral patterns during game discovery and predicts brain activity in theory-coding regions an order of magnitude better than both model-free and model-based alternatives. Our results shed light on the computational principles underlying human-like rapid learning and planning.

Chat is not available.