Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
LLMs
🤖 LLMs
Specific
large language models, GPT, Claude, AI models
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
23
posts in
6.7
ms
The Neutral Mask: How
RLHF
Provides Shallow Alignment while Leaving Partisan Structure Intact in a
Large
Language
Model
🏗️
Compilers
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for The Neutral Mask: How RLHF Provides Shallow Alignment while Leaving Partisan Structure Intact in a Large Language Model
Nvidia Nemotron 3 Ultra
⚡
Performance
research.nvidia.com
·
6d
6 days ago
·
Hacker News
Actions for Nvidia Nemotron 3 Ultra
Why
Claude
Produces High-Quality Output: A Developer’s Guide to Token Efficiency and
Hallucination
…
🏗️
Compilers
Content type:
Blog
medium.com
·
5d
5 days ago
Actions for Why Claude Produces High-Quality Output: A Developer’s Guide to Token Efficiency and Hallucination…
SLUUG Talk: Demystifying
Large
Language
Models
on Linux
🤖
AI
Content type:
Code
github.com
·
3d
3 days ago
·
DEV
Actions for SLUUG Talk: Demystifying Large Language Models on Linux
AI
Paper Review: Training
Language
Models
to Follow Instructions with Human Feedback (InstructGPT)
🤖
AI
freecodecamp.org
·
6d
6 days ago
Actions for AI Paper Review: Training Language Models to Follow Instructions with Human Feedback (InstructGPT)
My research agenda and work
🤖
AI
lesswrong.com
·
4d
4 days ago
Actions for My research agenda and work
Stack Overflow didn't just help
AI
learn to code
🤖
AI
zozo123.github.io
·
3d
3 days ago
·
Hacker News
Actions for Stack Overflow didn't just help AI learn to code
Multilingual Sentiment Aware Text Summarization A Reinforcement Learning Approach for Consistency Maintenance
🏗️
Compilers
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Multilingual Sentiment Aware Text Summarization A Reinforcement Learning Approach for Consistency Maintenance
umair-tareen/philosopher-council: An eleven-philosopher
LLM
council - ask it questions or point it at
AI-research
trends.
Claude-powered
deliberation through the four classical branches of philosophy. Methodology, not metaphysics.
🤖
AI
Content type:
Code
github.com
·
4d
4 days ago
·
r/SideProject
Actions for umair-tareen/philosopher-council: An eleven-philosopher LLM council - ask it questions or point it at AI-research trends. Claude-powered deliberation through the four classical branches of philosophy. Methodology, not metaphysics.
Less-relevant results
Reasoning
RL
in 2026: GRPO, DPO, RLVR, Agentic PO & Beyond
🧮
Algorithms
turingpost.com
·
3d
3 days ago
Actions for Reasoning RL in 2026: GRPO, DPO, RLVR, Agentic PO & Beyond
A Regret Minimization Framework on Preference Learning in
Large
Language
Models
🏗️
Compilers
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for A Regret Minimization Framework on Preference Learning in Large Language Models
🔬Scaling Past Informal
AI
- Carina Hong, Axiom Math
🤖
AI
latent.space
·
6d
6 days ago
·
Hacker News
Actions for 🔬Scaling Past Informal AI - Carina Hong, Axiom Math
What Do People Actually Want From
AI
? Mapping Preference Plurality
🤖
AI
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for What Do People Actually Want From AI? Mapping Preference Plurality
A Unifying Lens on Reward Uncertainty in
RLHF
🤖
AI
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for A Unifying Lens on Reward Uncertainty in RLHF
(VERY PARTIAL) CROSSPOST: ALEX HEATH: SubStack Is Opening Up to
AI
: Interviewing CEO Chris Best
🌐
Open Source
Content type:
News
Content type:
Blog
braddelong.substack.com
·
5d
5 days ago
·
Substack
Actions for (VERY PARTIAL) CROSSPOST: ALEX HEATH: SubStack Is Opening Up to AI: Interviewing CEO Chris Best
Hidden Consensus:Preference-Validity Compression in Human Feedback
🧮
Algorithms
Content type:
Academic
arxiv.org
·
9h
9 hours ago
Actions for Hidden Consensus:Preference-Validity Compression in Human Feedback
Principled Agent Debate: Adversarial Arbitration for Sycophancy Reduction in
Large
Language
Models
🤖
AI
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Principled Agent Debate: Adversarial Arbitration for Sycophancy Reduction in Large Language Models
Neglected Basics of
AI
Alignment
🤖
AI
lesswrong.com
·
3d
3 days ago
Actions for Neglected Basics of AI Alignment
EvalStop: Using World Feedback to Detect and Correct Reward Overoptimization in Multi-Tenant
RLHF
Platforms
⚙️
C++
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for EvalStop: Using World Feedback to Detect and Correct Reward Overoptimization in Multi-Tenant RLHF Platforms
Do We Want a Superintelligent People-Pleaser?
🎯
Career Growth
lesswrong.com
·
4d
4 days ago
Actions for Do We Want a Superintelligent People-Pleaser?
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help