Representation Invariance and Allocation: When Subgroup Balance Matters
arxiv.org·4d
🧮Vector Embeddings
Preview
Report Post

Title:Representation Invariance and Allocation: When Subgroup Balance Matters

View PDF HTML (experimental)

Abstract:Unequal representation of demographic groups in training data poses challenges to model generalisation across populations. Standard practice assumes that balancing subgroup representation optimises performance. However, recent empirical results contradict this assumption: in some cases, imbalanced data distributions actually improve subgroup performance, while in others, subgroup performance remains unaffected by the absence of an entire subgroup during training. We conduct a systematic study of subgroup allocation across four vision and language models, varying training data composition to…

Similar Posts

Loading similar posts...