This Puzzle Shows Just How Far LLMs Have Progressed in a Little Over a Year
towardsdatascience.comยท1d
Read Between the Lines: A Benchmark for Uncovering Political Bias in Bangla News Articles
arxiv.orgยท1d
Mitigating Premature Exploitation in Particle-based Monte Carlo for Inference-Time Scaling
arxiv.orgยท17h
Aligning Language Models with Clinical Expertise: DPO for Heart Failure Nursing Documentation in Critical Care
arxiv.orgยท17h
Pathology-CoT: Learning Visual Chain-of-Thought Agent from Expert Whole Slide Image Diagnosis Behavior
arxiv.orgยท1d
LLMs as Policy-Agnostic Teammates: A Case Study in Human Proxy Design for Heterogeneous Agent Teams
arxiv.orgยท17h
Large Language Models Achieve Gold Medal Performance at International Astronomy & Astrophysics Olympiad
arxiv.orgยท1d
Learning to Route: A Rule-Driven Agent Framework for Hybrid-Source Retrieval-Augmented Generation
arxiv.orgยท2d
Loading...Loading more...