Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
📊 Model Evaluation
Benchmarking, Performance Metrics, A/B Testing, Quality Assessment
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
81480
posts in
740.6
ms
Testing 80 LLMs on
spatial
reasoning on
grids
mihai.page
·
18h
·
Discuss:
Hacker News
🤖
AI Agent
Custom AI Tool Development in
Regulated
Industries: Why
Off-The-Shelf
LLM Solutions Fall Short
analyticsvidhya.com
·
5h
🤖
AI Agent
Testing software in the era of coding agents
garymm.org
·
3h
·
Discuss:
Hacker News
🤖
AI Agent
Evaluating and Enhancing the
Vulnerability
Reasoning
Capabilities
of Large Language Models
arxiv.org
·
13h
🤖
AI Agent
Measuring
Model
Overconfidence
: When AI Thinks It Knows
dev.to
·
1d
·
Discuss:
DEV
🤖
AI Agent
Performance
Tip
of the Week #94: Decision making in a
data-imperfect
world
abseil.io
·
1d
🤖
AI Agent
Show HN:
C-CMCP
–
Validated
AI development workflow with quality gates
news.ycombinator.com
·
2h
·
Discuss:
Hacker News
🤖
AI Agent
System
tests
keygen.sh
·
12h
🤖
AI Agent
Data Modeling for the Agentic Era:
Semantics
, Speed, and
Stewardship
rilldata.com
·
1h
·
Discuss:
Hacker News
🤖
AI Agent
Manufacturing
QMS
Software
samrian.com
·
2h
·
Discuss:
Hacker News
🤖
AI Agent
Study: Platforms that
rank
the latest LLMs can be
unreliable
news.mit.edu
·
13h
🤖
LLM
Reducing
Technical
Debt: Top Five Coding Resources
loufranco.com
·
3h
🔧
Functional Programming
Import AI 444: LLM
societies
; Huawei makes kernels with AI;
ChipBench
importai.substack.com
·
4h
·
Discuss:
Substack
🤖
AI Agent
How to Design LLM
Applications
for Production: A System Design Guide
dev.to
·
13h
·
Discuss:
DEV
🤖
LLM
How AI coding makes
developers
56% faster and 19%
slower
thenewstack.io
·
6h
🤖
AI Agent
What Is
Exploratory
Data Analysis (
EDA
)?
slubowisko.pl
·
10h
🔧
Functional Programming
Building LLMs in
Resource-Constrained
Environments
: A Hands-On Perspective
infoq.com
·
6h
🤖
AI Agent
**Abstract:** This paper introduces a novel framework for Automated Calibration Uncertainty
Propagation
Modeling (
ACUPM
), addressing a critical bottleneck in...
freederia.com
·
3d
🤖
LLM
Compound
Engineering: The
Definitive
Guide
kill-the-newsletter.com
·
2h
🤖
AI Agent
Main
Content ||
Math
∩ Programming
jeremykun.com
·
19h
🔧
Functional Programming
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help