Exploring Protein Language Model Architecture-Induced Biases for Antibody Comprehension

View PDF HTML (experimental)

Abstract:Recent advances in protein language models (PLMs) have demonstrated remarkable capabilities in understanding protein sequences. However, the extent to which different model architectures capture antibody-specific biological properties remains unexplored. In this work, we systematically investigate how architectural choices in PLMs influence their ability to comprehend antibody sequence characteristics and functions. We evaluate three state-of-the-art PLMs-AntiBERTa, BioBERT, and ESM2–against a general-purpose language model (GPT-2) baseline on antibody target specificity prediction tasks. Our results demonstrate that while all PLMs achieve high classification accur…

View PDF HTML (experimental)

Abstract:Recent advances in protein language models (PLMs) have demonstrated remarkable capabilities in understanding protein sequences. However, the extent to which different model architectures capture antibody-specific biological properties remains unexplored. In this work, we systematically investigate how architectural choices in PLMs influence their ability to comprehend antibody sequence characteristics and functions. We evaluate three state-of-the-art PLMs-AntiBERTa, BioBERT, and ESM2–against a general-purpose language model (GPT-2) baseline on antibody target specificity prediction tasks. Our results demonstrate that while all PLMs achieve high classification accuracy, they exhibit distinct biases in capturing biological features such as V gene usage, somatic hypermutation patterns, and isotype information. Through attention attribution analysis, we show that antibody-specific models like AntiBERTa naturally learn to focus on complementarity-determining regions (CDRs), while general protein models benefit significantly from explicit CDR-focused training strategies. These findings provide insights into the relationship between model architecture and biological feature extraction, offering valuable guidance for future PLM development in computational antibody design.


Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2512.09894 [cs.LG]
	(or arXiv:2512.09894v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2512.09894 arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Mengren Liu [view email] [v1] Wed, 10 Dec 2025 18:22:51 UTC (6,387 KB)

Submission history

Similar Posts