DRACO: a Cross-Domain Benchmark for Deep Research Accuracy, Completeness, and Objectivity (opens in new tab)
arXiv:2602.11685v1 Announce Type: cross Abstract: We present DRACO (Deep Research Accuracy, Completeness, and Objectivity), a benchmark of complex deep research tasks. These tasks, which span 10 domains and draw on information sources from 40 countries, originate from anonymized real-world usage patterns within a large-scale deep research system. Tasks are sampled from a de-identified dataset of Perplexity Deep Research requests, then filtered and augmented to ensure that the tasks are anony...
Read the original article