Integrity Verification, Hash Trees, Fixity Checking, Data Validation
Say One Thing, Do Another? Diagnosing Reasoning-Execution Gaps in VLM-Powered Mobile-Use Agents
arxiv.org·1d
BloomAPR: A Bloom's Taxonomy-based Framework for Assessing the Capabilities of LLM-Powered APR Solutions
arxiv.org·3d
Loading...Loading more...