Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
LLMs
馃 LLMs
Specific
large language models, GPT, Claude, AI models
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
27
posts in
13.4
ms
Principled Agent Debate: Adversarial Arbitration for Sycophancy Reduction in
Large
Language
Models
聽
馃
AI
聽
Content type:
Academic
arxiv.org
路
1d
1 day ago
Actions for Principled Agent Debate: Adversarial Arbitration for Sycophancy Reduction in Large Language Models
Less-relevant results
Neglected Basics of
AI
Alignment
聽
馃
AI
lesswrong.com
路
3d
3 days ago
Actions for Neglected Basics of AI Alignment
EvalStop: Using World Feedback to Detect and Correct Reward Overoptimization in Multi-Tenant
RLHF
Platforms
聽
鈿欙笍
C++
聽
Content type:
Academic
arxiv.org
路
6d
6 days ago
Actions for EvalStop: Using World Feedback to Detect and Correct Reward Overoptimization in Multi-Tenant RLHF Platforms
Sequential Data Poisoning in
LLM
Post-Training
聽
馃
AI
聽
Content type:
Academic
arxiv.org
路
6d
6 days ago
Actions for Sequential Data Poisoning in LLM Post-Training
Do We Want a Superintelligent People-Pleaser?
聽
馃幆
Career Growth
lesswrong.com
路
5d
5 days ago
Actions for Do We Want a Superintelligent People-Pleaser?
BiasGRPO: Stabilizing Bias Mitigation in High-Variance Reward Landscapes via Group-Relative Policy Optimization
聽
馃寪
Distributed Systems
聽
Content type:
Academic
arxiv.org
路
6d
6 days ago
Actions for BiasGRPO: Stabilizing Bias Mitigation in High-Variance Reward Landscapes via Group-Relative Policy Optimization
Sparse Mixture-of-Experts Reward
Models
Learn Interpretable and Specialized Experts for Personalized Preference Modeling
聽
馃
AI
聽
Content type:
Academic
arxiv.org
路
6d
6 days ago
Actions for Sparse Mixture-of-Experts Reward Models Learn Interpretable and Specialized Experts for Personalized Preference Modeling
« Page 1
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help