A bitter lesson for medicine, or a benchmark problem? (opens in new tab)
a nature medicine paper says general llms beat specialized clinical tools: a close read of what its main benchmark actually measured, and what it left out.
Read the original article