Metacognitive Sensitivity for Test-Time Dynamic Model Selection

Title:Metacognitive Sensitivity for Test-Time Dynamic Model Selection

Abstract:A key aspect of human cognition is metacognition - the ability to assess one’s own knowledge and judgment reliability. While deep learning models can express confidence in their predictions, they often suffer from poor calibration, a cognitive bias where expressed confidence does not reflect true competence. Do models truly know what they know? Drawing from human cognitive science, we propose a new framework for evaluating and leveraging AI metacognition. We introduce meta-d’, a psychologically-grounded measure of metacognitive sensitivity, to characterise how reliably a model’s confidence predicts…

Title:Metacognitive Sensitivity for Test-Time Dynamic Model Selection

View PDF HTML (experimental)

Abstract:A key aspect of human cognition is metacognition - the ability to assess one’s own knowledge and judgment reliability. While deep learning models can express confidence in their predictions, they often suffer from poor calibration, a cognitive bias where expressed confidence does not reflect true competence. Do models truly know what they know? Drawing from human cognitive science, we propose a new framework for evaluating and leveraging AI metacognition. We introduce meta-d’, a psychologically-grounded measure of metacognitive sensitivity, to characterise how reliably a model’s confidence predicts its own accuracy. We then use this dynamic sensitivity score as context for a bandit-based arbiter that performs test-time model selection, learning which of several expert models to trust for a given task. Our experiments across multiple datasets and deep learning model combinations (including CNNs and VLMs) demonstrate that this metacognitive approach improves joint-inference accuracy over constituent models. This work provides a novel behavioural account of AI models, recasting ensemble selection as a problem of evaluating both short-term signals (confidence prediction scores) and medium-term traits (metacognitive sensitivity).


Comments:	Accepted at the NeurIPS 2025 CogInterp Workshop
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2512.10451 [cs.LG]
	(or arXiv:2512.10451v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2512.10451 arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Le Tuan Minh Trinh [view email] [v1] Thu, 11 Dec 2025 09:15:05 UTC (162 KB)

Title:Metacognitive Sensitivity for Test-Time Dynamic Model Selection

Title:Metacognitive Sensitivity for Test-Time Dynamic Model Selection

Submission history

Similar Posts