Our evaluation of Claude Mythos Preview’s cyber capabilities (opens in new tab)
We conducted cyber evaluations of Anthropic’s Claude Mythos Preview and found continued improvement in capture-the-flag (CTF) challenges and significant improvement on multi-step cyber-attack simulations.
Read the original article