Skip to main content
Scour
Discover
Docs
Login
Sign Up
Discover
About
Docs
Changelog
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Back to article
cameronrwolfe.substack.com
4w
4 weeks ago
Agent Evaluation: A Detailed Guide
(opens in new tab)
Covers
7 stories
See all stories this covers
including
MCP is an open protocol that standardizes how apps provide context to LLMs
Covered by
tldr.tech
Discussed on
Substack
Love
Like
Not for me
Save
|
|
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Feeds
Deep (Learning) Focus
cameronrwolfe.substack.com
I contextualize and explain important topics in AI research.
Agent Evaluation: A Detailed Guide
4w
4 weeks ago
RL Scaling Laws for LLMs
8w
8 weeks ago
The Anatomy of an LLM Benchmark
11w
11 weeks ago
TLDR FEED Feed
bullrich.dev
World Cup Crypto Fraud Wave: Why Betting Markets Need Better Fan-Safety UX (9 minute read)
2d
2 days ago
Agents breaking on real websites? See how Browserbase runs 35M+ sessions a month (Sponsor)
2d
2 days ago
What's an AI runtime? (Sponsor)
2d
2 days ago
+356 more in the past week
Hacker News: Newest
hnrss.org
The deskilling of web dev is damaging our health
1h
1 hour ago
There are only two file formats, txt and zip (explainer)
1h
1 hour ago
A tiny (18KB for rpi zero)easy to read file listing tool. rust no_std and Libc
1h
1 hour ago
+245 more in the past day
Hacker News: Newest
hnrss.org
"Boing " in 64 Bytes
21m
21 minutes ago
Show HN: An AI video prompt cookbook for image-to-video workflows
27m
27 minutes ago
Show HN: AdvertBench, ranking the ability of LLMs to create image ads
30m
30 minutes ago
+368 more in the past day
Keyboard Shortcuts
Navigation
Next / previous post
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Discover
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help
Like
Save
Not for me
Report