Complexity Analysis, Algorithm Verification, Formal Bounds, Optimization Theory
On the Algorithmic Bias of Aligning Large Language Models with RLHF: Preference Collapse and Matching Regularization
arxiv.org·23h
Loading...Loading more...
Complexity Analysis, Algorithm Verification, Formal Bounds, Optimization Theory