Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
📊 Model Evaluation
Benchmarking, Performance Metrics, A/B Testing, Quality Assessment
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
112322
posts in
403.7
ms
From 97% Model Accuracy to 74% Clinical Reliability: Building
RSN-NNSL-GATE-001
dev.to
·
16h
·
Discuss:
DEV
🤖
AI Agent
Feedback
Control for Computer Systems
janert.org
·
1d
🤖
AI Agent
Software AI
Platforms
trendhunter.com
·
10h
🤖
AI Agent
Show HN: We
achieved
72.2% issue resolution on
SWE-bench
Verified using AI teams
agyn.io
·
17h
·
Discuss:
Hacker News
🤖
AI Agent
Ahold
Delhaize: Defensive Compounder Approaching Fair Value (OTCMKTS:
ADRNY
)
seekingalpha.com
·
9h
🤖
LLM
Supercharge
Your Testing with Our
Automation
Testing Services
primeqasolutions.com
·
1d
·
Discuss:
DEV
🤖
AI Agent
I
benchmarked
4 CLI coding agents on an
NP-hard
optimization problem I solved by hand 8 years ago. One of them beat me.
charlesazam.com
·
17h
·
Discuss:
Hacker News
🤖
AI Agent
Benchmarking 8 remote browser
providers
with 250
concurrent
AI agents
research.aimultiple.com
·
1d
·
Discuss:
Hacker News
🤖
AI Agent
LLM Performance in
Astro
, React,
Tailwind
and Cloudflare
10xbench.ai
·
2d
·
Discuss:
Hacker News
🤖
LLM
AI-native software
factory
with the
Phoenix
Architecture
gist.github.com
·
5h
·
Discuss:
Hacker News
🤖
AI Agent
Breaking the
Tractability
Barrier: A Generic Low-Level Solver for
NP-Hard
Instances (N=63) on Commodity 64-Bit Silicon
zenodo.org
·
1h
·
Discuss:
r/programming
🔧
Functional Programming
Issue 638
datascienceweekly.substack.com
·
12h
·
Discuss:
Substack
🔧
Functional Programming
Omnibenchmark
: transparent, reproducible, extensible and
standardized
orchestration of solo and collaborative benchmarks
arxiv.org
·
1d
🔧
Functional Programming
SotA
ARC-AGI-2 Results with
REPL
Agents
symbolica.ai
·
23h
·
Discuss:
Hacker News
🤖
AI Agent
CodeSpeak
: Software Engineering with AI
codespeak.dev
·
12h
·
Discuss:
Lobsters
,
Hacker News
🤖
AI Agent
Task 2: Refactor
SimulationConfig
for
DSGE-HA
· Issue #15
github.com
·
20h
🤖
AI Agent
How To
Utilize
LMS
Data: Use Cases For Enhancing L&D Insights
elearningindustry.com
·
14h
🤖
LLM
anthropic
news.smol.ai
·
1d
🤖
AI Agent
Building a Production ML Inference Stack with
KServe
, vLLM, and
Karmada
dev.to
·
5h
·
Discuss:
DEV
🤖
AI Agent
Property-based
testing is about to
rule
the (software) world
tybug.dev
·
1d
·
Discuss:
Hacker News
🔧
Functional Programming
Loading...
Loading more...
« Page 1
•
Page 3 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help