Post-Training Recipe, More Than Model Family, Shapes Multi-Agent LLM Conversational Behavior (opens in new tab)
Multi-LLM systems use multiple language models to deliberate, judge each other's outputs, or coordinate as agents. Their value depends on the models producing measurably different conversational behaviors when given the same input. Prior offline studies recommend drawing one model per family for behavioral diversity, because LLMs prefer outputs from their own family when rating one another in isolation. Whether the same family label predicts b...
Read the original article