mlfrontiers.substack.com

The LLM Judge Controversy (opens in new tab)

Discussed on Substack

Should or shouldn't we use LLMs for quality evaluation of production ML systems?

Read the original article

Sign in to keep reading the full article.