Title:Unsupervised Feature Selection Through Group Discovery
Abstract:Unsupervised feature selection (FS) is essential for high-dimensional learning tasks where labels are not available. It helps reduce noise, improve generalization, and enhance interpretability. However, most existing unsupervised FS methods evaluate features in isolation, even though informative signals often emerge from groups of related features. For example, adjacent pixels, functionally connected brain regions, or correlated financial indicators tend to act together, making independent evaluation suboptimal. Although some methods attempt to capture group structure, they typically rely on predefined p…
Title:Unsupervised Feature Selection Through Group Discovery
Abstract:Unsupervised feature selection (FS) is essential for high-dimensional learning tasks where labels are not available. It helps reduce noise, improve generalization, and enhance interpretability. However, most existing unsupervised FS methods evaluate features in isolation, even though informative signals often emerge from groups of related features. For example, adjacent pixels, functionally connected brain regions, or correlated financial indicators tend to act together, making independent evaluation suboptimal. Although some methods attempt to capture group structure, they typically rely on predefined partitions or label supervision, limiting their applicability. We propose GroupFS, an end-to-end, fully differentiable framework that jointly discovers latent feature groups and selects the most informative groups among them, without relying on fixed a priori groups or label supervision. GroupFS enforces Laplacian smoothness on both feature and sample graphs and applies a group sparsity regularizer to learn a compact, structured representation. Across nine benchmarks spanning images, tabular data, and biological datasets, GroupFS consistently outperforms state-of-the-art unsupervised FS in clustering and selects groups of features that align with meaningful patterns.
| Comments: | Accepted to AAAI 2026 |
| Subjects: | Machine Learning (cs.LG) |
| Cite as: | arXiv:2511.09166 [cs.LG] |
| (or arXiv:2511.09166v1 [cs.LG] for this version) | |
| https://doi.org/10.48550/arXiv.2511.09166 arXiv-issued DOI via DataCite (pending registration) |
Submission history
From: Shira Lifshitz [view email] [v1] Wed, 12 Nov 2025 10:05:03 UTC (5,612 KB)