Shortest Path, Network Analysis, Graph Traversal, Social Networks
Research Areas in Benchmark Design and Evaluation (The Alignment Project by UK AISI)
lesswrong.com·2d
Research Areas in Evaluation and Guarantees in Reinforcement Learning (The Alignment Project by UK AISI)
lesswrong.com·2d
Exploring molecular assembly as a biosignature using mass spectrometry and machine learning
arxiv.org·6d
Uncertain Updates: July 2025
lesswrong.com·4d
RePaCA: Leveraging Reasoning Large Language Models for Static Automated Patch Correctness Assessment
arxiv.org·3d
LLM-Crowdsourced: A Benchmark-Free Paradigm for Mutual Evaluation of Large Language Models
arxiv.org·3d
Chain-of-Cooking:Cooking Process Visualization via Bidirectional Chain-of-Thought Guidance
arxiv.org·4d
Loading...Loading more...