Grokked Models are Better Unlearners
arxiv.org·6d
💻Local LLMs
Preview
Report Post

Title:Grokked Models are Better Unlearners

View PDF HTML (experimental)

Abstract:Grokking-delayed generalization that emerges well after a model has fit the training data-has been linked to robustness and representation quality. We ask whether this training regime also helps with machine unlearning, i.e., removing the influence of specified data without full retraining. We compare applying standard unlearning methods before versus after the grokking transition across vision (CNNs/ResNets on CIFAR, SVHN, and ImageNet) and language (a transformer on a TOFU-style setup). Starting from grokked checkpoints consistently yields (i) more efficient forgetting (fewer updates to reach a target forget level…

Similar Posts

Loading similar posts...