Robust prioritization of genomic features with stability selection (opens in new tab)
AbstractMotivationThe heterogeneity of complex diseases including cancer leads to heavy-tailed distributions in the disease traits. In such settings, non-robust variable selection methods are inherently susceptible to data contamination and can yield unstable or misleading results. This vulnerability becomes more severe for recently proposed approaches that introduce pseudo-features as negative controls, as these methods further amplify the curse of dimensionality by expanding the genotype ma...
Read the original article