Read Between the Lines: A Benchmark for Uncovering Political Bias in Bangla News Articles
arxiv.orgยท1d
Pathology-CoT: Learning Visual Chain-of-Thought Agent from Expert Whole Slide Image Diagnosis Behavior
arxiv.orgยท1d
Beyond the Final Answer: Evaluating the Reasoning Trajectories of Tool-Augmented Agents
arxiv.orgยท2d
Small Language Models for Agentic Systems: A Survey of Architectures, Capabilities, and Deployment Trade offs
arxiv.orgยท1d
Mitigating Premature Exploitation in Particle-based Monte Carlo for Inference-Time Scaling
arxiv.orgยท7h
From Neural Activity to Computation: Biological Reservoirs for Pattern Recognition in Digit Classification
arxiv.orgยท7h
Training Dynamics of Parametric and In-Context Knowledge Utilization in Language Models
arxiv.orgยท2d
Loading...Loading more...