Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
⚠️ AI Safety
Alignment, AI Risk, Existential Risk, AI Governance
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
58
posts in
14.9
ms
Apply now to
Human-Aligned
AI
Summer School 2026
✍️
Prompt Engineering
lesswrong.com
·
19h
BiomniBench: Process-level Evaluation of LLM Agents for Real-world Biomedical
Research
📊
LLM Evaluation
biorxiv.org
·
5d
Enhancing the Code Reasoning Capabilities of LLMs via Consistency-based Reinforcement Learning
📊
LLM Evaluation
arxiv.org
·
2d
InferenceBench: A Benchmark for
Open-Ended
Inference
Optimization
by
AI
Agents
📊
LLM Evaluation
inferencebench.ai
·
4h
·
Hacker News
Training a small model to write better OCaml with RLVR and GRPO
📊
LLM Evaluation
blog.nilenso.com
·
9h
·
Hacker News
Cursor’s Composer 2.5 Brings Smarter, More Reliable
AI
Coding Agents
🤖
Agentic AI
devops.com
·
1d
Risk
reports need to address deployment-time spread of misalignment
🛡️
AI Security
alignmentforum.org
·
5d
How much should we worry about secretly loyal AIs?
🛡️
AI Security
the-substrate.net
·
12h
·
Hacker News
Frontier
Risk
Report (February to March 2026)
📊
LLM Evaluation
metr.org
·
1d
Cursor bets on cheaper coding with Composer 2.5 and Kimi K2.5
🛠️
Developer Tools
thenewstack.io
·
17h
Will It Come to This?
🤖
Agentic AI
connectedworld.com
·
3d
[AINews] How to land a job at a frontier lab (on Pretraining)
🤝
AI Agents
latent.space
·
1d
Cursor launches Composer 2.5 model for long-running
AI
coding tasks at cheaper token cost
🛠️
Developer Tools
indianexpress.com
·
1d
What can
AI
teach us about ‘emotions’?
🤖
Agentic AI
thetransmitter.org
·
3d
Introducing Composer 2.5
🛠️
Developer Tools
cursor.com
·
2d
·
Hacker News
Fixing LLM Writing with Distribution Fine Tuning
📊
LLM Evaluation
rosmine.ai
·
2d
·
Hacker News
Human Observations on Mythos Runs
🦠
Malware Analysis
exploitbench.ai
·
5d
Truly an all-star cast, on one of the most important questions in
AI
.
📱
Edge AI
twitter.macworks.dev
·
4d
The Case for Evaluating Model Behaviors
📊
LLM Evaluation
lesswrong.com
·
9h
Agentic Workflows for Alpha
Research
[Jonathan Kinlay]
🤖
Agentic AI
jonathankinlay.com
·
3d
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help