SORTeD Rashomon Sets of Sparse Decision Trees: Anytime Enumeration

View PDF HTML (experimental)

Abstract:Sparse decision tree learning provides accurate and interpretable predictive models that are ideal for high-stakes applications by finding the single most accurate tree within a (soft) size limit. Rather than relying on a single “best” tree, Rashomon sets-trees with similar performance but varying structures-can be used to enhance variable importance analysis, enrich explanations, and enable users to choose simpler trees or those that satisfy stakeholder preferences (e.g., fairness) without hard-coding such criteria into the objective function. However, because finding the optimal tree is NP-hard, enumerating the Rashomon set is inherently challenging. Therefore, w…

View PDF HTML (experimental)

Abstract:Sparse decision tree learning provides accurate and interpretable predictive models that are ideal for high-stakes applications by finding the single most accurate tree within a (soft) size limit. Rather than relying on a single “best” tree, Rashomon sets-trees with similar performance but varying structures-can be used to enhance variable importance analysis, enrich explanations, and enable users to choose simpler trees or those that satisfy stakeholder preferences (e.g., fairness) without hard-coding such criteria into the objective function. However, because finding the optimal tree is NP-hard, enumerating the Rashomon set is inherently challenging. Therefore, we introduce SORTD, a novel framework that improves scalability and enumerates trees in the Rashomon set in order of the objective value, thus offering anytime behavior. Our experiments show that SORTD reduces runtime by up to two orders of magnitude compared with the state of the art. Moreover, SORTD can compute Rashomon sets for any separable and totally ordered objective and supports post-evaluating the set using other separable (and partially ordered) objectives. Together, these advances make exploring Rashomon sets more practical in real-world applications.


Comments:	32 pages, 10 figures, to be published in the proceedings of The Thirty-Ninth Annual Conference on Neural Information Processing Systems
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2511.03344 [cs.LG]
	(or arXiv:2511.03344v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2511.03344 arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Elif Arslan [view email] [v1] Wed, 5 Nov 2025 10:25:08 UTC (1,682 KB)

Submission history

Similar Posts