The Unseen Hand: Manipulating Model Fairness and SHAP with Targeted Identity Re-Association Attacks (opens in new tab)

As machine learning models grow more influential and opaque, algorithmic fairness and explainability are critical for ensuring accountability. However, we demonstrate that these auditing mechanisms are themselves vulnerable to subtle manipulation, camouflaging the influence of protected features. While prior work on data-agnostic attacks has exposed this vulnerability, they leave behind detectable artifacts that compromise their stealth. We in...

Read the original article