Detecting Distillation Data from Reasoning Models
arxiv.org·23h

Title:Detecting Distillation Data from Reasoning Models

View PDF HTML (experimental)

Abstract:Reasoning distillation has emerged as an efficient and powerful paradigm for enhancing the reasoning capabilities of large language models. However, reasoning distillation may inadvertently cause benchmark contamination, where evaluation data included in distillation datasets can inflate performance metrics of distilled models. In this work, we formally define the task of distillation data detection, which is uniquely challenging due to the partial availability of distillation data. Then, we propose a novel and effective method Token Probability Deviation (TBD), which leverages the probability patterns of the …

Similar Posts

Loading similar posts...