Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
📊 Model Evaluation
Benchmarking, Performance Metrics, A/B Testing, Quality Assessment
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
81907
posts in
1.08
s
Testing 80 LLMs on
spatial
reasoning on
grids
mihai.page
·
23h
·
Discuss:
Hacker News
🤖
AI Agent
Statistical-Based
Metric
Threshold
Setting Method for Software Fault Prediction in Firmware Projects: An Industrial Experience
arxiv.org
·
18h
🔧
Functional Programming
AI is a High Pass
Filter
for Software
Delivery
bryanfinster.substack.com
·
7m
·
Discuss:
Substack
🤖
AI Agent
Guide: Getting started with
choosing
a Machine Learning CLIP Model for Smart Search ·
immich-app/immich
github.com
·
2m
🤖
AI Agent
Testing software in the era of coding agents
garymm.org
·
8h
·
Discuss:
Hacker News
🤖
AI Agent
Measuring
Model
Overconfidence
: When AI Thinks It Knows
dev.to
·
1d
·
Discuss:
DEV
🤖
AI Agent
Custom AI Tool Development in
Regulated
Industries: Why
Off-The-Shelf
LLM Solutions Fall Short
analyticsvidhya.com
·
11h
🤖
AI Agent
Show HN:
C-CMCP
–
Validated
AI development workflow with quality gates
news.ycombinator.com
·
7h
·
Discuss:
Hacker News
🤖
AI Agent
System
tests
keygen.sh
·
17h
🤖
AI Agent
Study: Platforms that
rank
the latest LLMs can be
unreliable
news.mit.edu
·
18h
🤖
LLM
Data Modeling for the Agentic Era:
Semantics
, Speed, and
Stewardship
rilldata.com
·
7h
·
Discuss:
Hacker News
🤖
AI Agent
A
practical
systems engineering guide:
Architecting
AI-ready infrastructure for the agentic era
thenewstack.io
·
1h
🤖
AI Agent
Manufacturing
QMS
Software
samrian.com
·
8h
·
Discuss:
Hacker News
🤖
AI Agent
Performance
Tip
of the Week #94: Decision making in a
data-imperfect
world
abseil.io
·
2d
🤖
AI Agent
The Potential of
RLMs
dbreunig.com
·
6h
🤖
AI Agent
Reducing
Technical
Debt: Top Five Coding Resources
loufranco.com
·
8h
🔧
Functional Programming
Import AI 444: LLM
societies
; Huawei makes kernels with AI;
ChipBench
importai.substack.com
·
9h
·
Discuss:
Substack
🤖
AI Agent
How to Design LLM
Applications
for Production: A System Design Guide
dev.to
·
18h
·
Discuss:
DEV
🤖
LLM
Evaluating and Enhancing the
Vulnerability
Reasoning
Capabilities
of Large Language Models
arxiv.org
·
18h
🤖
AI Agent
Why
Spec-Driven
Development
Breaks
at Scale (and How to Fix It)
arcturus-labs.com
·
1h
·
Discuss:
Hacker News
🤖
AI Agent
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help