How good a detective is an AI? A Sherlock Holmes board game as an LLM-agent eval (opens in new tab)
A Sherlock Holmes board game as an LLM-agent eval
Read the original articleA Sherlock Holmes board game as an LLM-agent eval
Read the original article