BioMysteryBench, our new bioinformatics eval, tests whether Claude can devise creative solutions to open-ended research problems. (opens in new tab)
BioMysteryBench, our new bioinformatics eval, tests whether Claude can devise creative solutions to open-ended research problems. Read more: anthropic.com/research/Evalu…
Read the original article