The LLM Judge Controversy (opens in new tab)
Should or shouldn't we use LLMs for quality evaluation of production ML systems?
Read the original articleShould or shouldn't we use LLMs for quality evaluation of production ML systems?
Read the original article