Grokked Models are Better Unlearners

Title:Grokked Models are Better Unlearners

Abstract:Grokking-delayed generalization that emerges well after a model has fit the training data-has been linked to robustness and representation quality. We ask whether this training regime also helps with machine unlearning, i.e., removing the influence of specified data without full retraining. We compare applying standard unlearning methods before versus after the grokking transition across vision (CNNs/ResNets on CIFAR, SVHN, and ImageNet) and language (a transformer on a TOFU-style setup). Starting from grokked checkpoints consistently yields (i) more efficient forgetting (fewer updates to reach a target forget level…

Title:Grokked Models are Better Unlearners

View PDF HTML (experimental)

Abstract:Grokking-delayed generalization that emerges well after a model has fit the training data-has been linked to robustness and representation quality. We ask whether this training regime also helps with machine unlearning, i.e., removing the influence of specified data without full retraining. We compare applying standard unlearning methods before versus after the grokking transition across vision (CNNs/ResNets on CIFAR, SVHN, and ImageNet) and language (a transformer on a TOFU-style setup). Starting from grokked checkpoints consistently yields (i) more efficient forgetting (fewer updates to reach a target forget level), (ii) less collateral damage (smaller drops on retained and test performance), and (iii) more stable updates across seeds, relative to early-stopped counterparts under identical unlearning algorithms. Analyses of features and curvature further suggest that post-grokking models learn more modular representations with reduced gradient alignment between forget and retain subsets, which facilitates selective forgetting. Our results highlight when a model is trained (pre- vs. post-grokking) as an orthogonal lever to how unlearning is performed, providing a practical recipe to improve existing unlearning methods without altering their algorithms.


Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2512.03437 [cs.LG]
	(or arXiv:2512.03437v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2512.03437 arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Yang Li [view email] [v1] Wed, 3 Dec 2025 04:35:49 UTC (93 KB)

Title:Grokked Models are Better Unlearners

Title:Grokked Models are Better Unlearners

Submission history

Similar Posts