Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
๐ SLAM Datasets
Specific
Benchmarking, Ground Truth, TUM RGB-D, KITTI, Evaluation Metrics
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
185684
posts in
16.5
ms
One Single Hub Text Breaks CLIP: Identifying Vulnerabilities in Cross-Modal
Encoders
via
Hubness
ย
๐๏ธ
Computer vision
arxiv.org
ยท
16h
Useless
but Safe? Benchmarking Utility Recovery with User Intent
Clarification
in Multi-Turn Conversations
ย
๐ค
llm
arxiv.org
ยท
16h
When Your LLM Reaches End-of-Life: A Framework for
Confident
Model
Migration
in Production Systems
ย
๐ค
llm
arxiv.org
ยท
16h
Beyond
Accuracy
: Benchmarking Cross-Task
Consistency
in Unified Multimodal Models
ย
๐๏ธ
Computer vision
arxiv.org
ยท
2d
From
Coarse
to Fine: Benchmarking and Reward Modeling for
Writing-Centric
Generation Tasks
ย
๐ค
llm
arxiv.org
ยท
16h
Epistemic reflections on AI answering our questions: overwatch,
erudite
,
logician
, interlocutor
ย
๐ค
llm
arxiv.org
ยท
16h
Benchmarking
Layout-Guided
Diffusion Models through Unified Semantic-Spatial Evaluation in Closed and Open Settings
ย
๐๏ธ
Computer vision
arxiv.org
ยท
2d
CrossBench
: Generalized
Crosstalk
Benchmark Generation for Quantum Computers
ย
๐
Waveguides
arxiv.org
ยท
16h
ShapeY
: A
Principled
Framework for Measuring Shape Recognition Capacity via Nearest-Neighbor Matching
ย
๐๏ธ
Computer vision
arxiv.org
ยท
2d
HuM-Eval
: A
Coarse-to-Fine
Framework for Human-Centric Video Evaluation
ย
๐๏ธ
Computer vision
arxiv.org
ยท
2d
DV-World
: Benchmarking Data
Visualization
Agents in Real-World Scenarios
ย
๐ฑ
Triton
arxiv.org
ยท
2d
Benchmarking
Complex Multimodal Document Processing
Pipelines
: A Unified Evaluation Framework for Enterprise AI
ย
๐ฑ
Edge AI
arxiv.org
ยท
1d
Human-in-the-Loop Benchmarking of Heterogeneous LLMs for Automated
Competency
Assessment in Secondary Level
Mathematics
ย
๐ค
llm
arxiv.org
ยท
1d
Benchmarking the Safety of Large Language Models for
Robotic
Health
Attendant
Control
ย
๐ก๏ธ
Robotics Safety
arxiv.org
ยท
1d
FCMBench-Video
:
Benchmarking
Document Video Intelligence
ย
๐๏ธ
Computer vision
arxiv.org
ยท
2d
Bug-Report-Driven Fault
Localization
: Industrial Benchmarking and Lesson Learned at
ABB
Robotics
ย
๐ญ
Robotic Manufacturing
arxiv.org
ยท
2d
Benchmarking
OCR
Pipelines with Adaptive Enhancement for Multi-Domain Retail Bill
Digitization
ย
๐๏ธ
Computer vision
arxiv.org
ยท
2d
Benchmarking
and Improving
GUI
Agents in High-Dynamic Environments
ย
๐๏ธ
Isaac Gym
arxiv.org
ยท
2d
TrialCalibre
: A Fully Automated Causal Engine for
RCT
Benchmarking and Observational Trial Calibration
ย
๐ฑ
Edge AI
arxiv.org
ยท
2d
SpaMEM
: Benchmarking Dynamic Spatial Reasoning via Perception-Memory Integration in
Embodied
Environments
ย
๐๏ธ
Computer vision
arxiv.org
ยท
4d
Sign up or log in to see more results
Sign Up
Login
« Page 2
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help