Large Language Models for Software Engineering: A Reproducibility Crisis
arxiv.org·1w
🔄Reproducible Builds
Preview
Report Post

View PDF HTML (experimental)

Abstract:Reproducibility is a cornerstone of scientific progress, yet its state in large language model (LLM)-based software engineering (SE) research remains poorly understood. This paper presents the first large-scale, empirical study of reproducibility practices in LLM-for-SE research. We systematically mined and analyzed 640 papers published between 2017 and 2025 across premier software engineering, machine learning, and natural language processing venues, extracting structured metadata from publications, repositories, and documentation. Guided by four research questions, we examine (i) the prevalence of reproducibility smells, (ii) how reproducibility has evolved ove…

Similar Posts

Loading similar posts...