MALBO: Optimizing LLM-Based Multi-Agent Teams via Multi-Objective Bayesian Optimization

View PDF HTML (experimental)

Abstract:The optimal assignment of Large Language Models (LLMs) to specialized roles in multi-agent systems is a significant challenge, defined by a vast combinatorial search space, expensive black-box evaluations, and an inherent trade-off between performance and cost. Current optimization methods focus on single-agent settings and lack a principled framework for this multi-agent, multi-objective problem. This thesis introduces MALBO (Multi-Agent LLM Bayesian Optimization), a systematic framework designed to automate the efficient composition of LLM-based agent teams. We formalize the assignment challenge as a multi-objective optimization problem, aiming to identify the Pa…

View PDF HTML (experimental)

Abstract:The optimal assignment of Large Language Models (LLMs) to specialized roles in multi-agent systems is a significant challenge, defined by a vast combinatorial search space, expensive black-box evaluations, and an inherent trade-off between performance and cost. Current optimization methods focus on single-agent settings and lack a principled framework for this multi-agent, multi-objective problem. This thesis introduces MALBO (Multi-Agent LLM Bayesian Optimization), a systematic framework designed to automate the efficient composition of LLM-based agent teams. We formalize the assignment challenge as a multi-objective optimization problem, aiming to identify the Pareto front of configurations between task accuracy and inference cost. The methodology employs multi-objective Bayesian Optimization (MOBO) with independent Gaussian Process surrogate models. By searching over a continuous feature-space representation of the LLMs, this approach performs a sample-efficient exploration guided by the expected hypervolume improvement. The primary contribution is a principled and automated methodology that yields a Pareto front of optimal team configurations. Our results demonstrate that the Bayesian optimization phase, compared to an initial random search, maintained a comparable average performance while reducing the average configuration cost by over 45%. Furthermore, MALBO identified specialized, heterogeneous teams that achieve cost reductions of up to 65.8% compared to homogeneous baselines, all while maintaining maximum performance. The framework thus provides a data-driven tool for deploying cost-effective and highly specialized multi-agent AI systems.


Comments:	Master’s Thesis, University of Milano-Bicocca, 2025
Subjects:	Multiagent Systems (cs.MA); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2511.11788 [cs.MA]
	(or arXiv:2511.11788v1 [cs.MA] for this version)
	https://doi.org/10.48550/arXiv.2511.11788 arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Antonio Sabbatella Mr [view email] [v1] Fri, 14 Nov 2025 18:01:08 UTC (12,808 KB)

Submission history

Similar Posts