From Memorization to Creativity: LLM as a Designer of Novel Neural-Architectures

View PDF HTML (experimental)

Abstract:Large language models (LLMs) excel in program synthesis, yet their ability to autonomously navigate neural architecture design–balancing syntactic reliability, performance, and structural novelty–remains underexplored. We address this by placing a code-oriented LLM within a closed-loop synthesis framework, analyzing its evolution over 22 supervised fine-tuning cycles. The model synthesizes PyTorch convolutional networks which are validated, evaluated via low-fidelity performance signals (single-epoch accuracy), and filtered using a MinHash-Jaccard criterion to prevent structural redundancy. High-performing, novel architectures are converted into prompt-code pairs f…

View PDF HTML (experimental)

Abstract:Large language models (LLMs) excel in program synthesis, yet their ability to autonomously navigate neural architecture design–balancing syntactic reliability, performance, and structural novelty–remains underexplored. We address this by placing a code-oriented LLM within a closed-loop synthesis framework, analyzing its evolution over 22 supervised fine-tuning cycles. The model synthesizes PyTorch convolutional networks which are validated, evaluated via low-fidelity performance signals (single-epoch accuracy), and filtered using a MinHash-Jaccard criterion to prevent structural redundancy. High-performing, novel architectures are converted into prompt-code pairs for iterative fine-tuning via parameter-efficient LoRA adaptation, initialized from the LEMUR dataset. Across cycles, the LLM internalizes empirical architectural priors, becoming a robust generator. The valid generation rate stabilizes at 50.6 percent (peaking at 74.5 percent), while mean first-epoch accuracy rises from 28.06 percent to 50.99 percent, and the fraction of candidates exceeding 40 percent accuracy grows from 2.04 percent to 96.81 percent. Analyses confirm the model moves beyond replicating existing motifs, synthesizing 455 high-performing architectures absent from the original corpus. By grounding code synthesis in execution feedback, this work provides a scalable blueprint for transforming stochastic generators into autonomous, performance-driven neural designers, establishing that LLMs can internalize empirical, non-textual rewards to transcend their training data.


Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2601.02997 [cs.LG]
	(or arXiv:2601.02997v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2601.02997 arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Waleed Khalid [view email] [v1] Tue, 6 Jan 2026 13:20:28 UTC (2,073 KB)

Submission history

Similar Posts