Synthesizing File-Level Data for Unit Test Generation with Chain-of-Thoughts via Self-Debugging

arXiv:2602.03181v1 Announce Type: new Abstract: Automatic unit test (UT) generation is essential for software quality assurance, but existing approaches–including symbolic execution, search-based approaches, and recent LLM-based generators–struggle to produce human-quality tests with correct, meaningful assertions and reliable chain-of-thought (CoT) explanations. We identify a gap in UT training data: repository-mined tests lack developer CoTs, while LLM-distilled CoTs are often incorrect or incomplete. To address this issue, we propose a novel data-distillation approach that uses self-debugging to produce high-quality UT training examples paired with faithful CoTs. Our approach combines (1) guided test repair, a heuristic loop (error-, failure-, and coverage-focused steps) that asks the…

Similar Posts