Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
AI
馃 AI
Broad
Claude, OpenAI, Codex, Cursor, Anthropic, Copilot.
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
80
posts in
5.9
ms
Representation-Aware Advantage Estimation: Your Reward
Model
Provides More Than A Scalar Output
聽
馃
AI Models
聽
Content type:
Academic
arxiv.org
路
22h
22 hours ago
Actions for Representation-Aware Advantage Estimation: Your Reward Model Provides More Than A Scalar Output
Introducing the Third Generation of Apple鈥檚 Foundation
Models
聽
馃
AI
machinelearning.apple.com
路
3d
3 days ago
路
Hacker News
,
r/apple
Actions for Introducing the Third Generation of Apple鈥檚 Foundation Models
Reasoning
RL
in 2026: GRPO, DPO, RLVR, Agentic PO & Beyond
聽
馃
AI Models
turingpost.com
路
3d
3 days ago
Actions for Reasoning RL in 2026: GRPO, DPO, RLVR, Agentic PO & Beyond
Stack Overflow didn't just help
AI
learn to
code
聽
馃
AI
zozo123.github.io
路
3d
3 days ago
路
Hacker News
Actions for Stack Overflow didn't just help AI learn to code
Phantom transitions in
language
model
fine-tuning
聽
馃
AI
聽
Content type:
Academic
arxiv.org
路
1d
1 day ago
Actions for Phantom transitions in language model fine-tuning
STAT+:
AI
titans push Congress for DNA safeguards
聽
馃
AI
statnews.com
路
6d
6 days ago
Actions for STAT+: AI titans push Congress for DNA safeguards
What Do People Actually Want From
AI
? Mapping Preference Plurality
聽
馃
AI Models
聽
Content type:
Academic
arxiv.org
路
2d
2 days ago
Actions for What Do People Actually Want From AI? Mapping Preference Plurality
(VERY PARTIAL) CROSSPOST: ALEX HEATH: SubStack Is Opening Up to
AI
: Interviewing CEO Chris Best
聽
馃殌
startups
聽
Content type:
News
聽
Content type:
Blog
braddelong.substack.com
路
5d
5 days ago
路
Substack
Actions for (VERY PARTIAL) CROSSPOST: ALEX HEATH: SubStack Is Opening Up to AI: Interviewing CEO Chris Best
Train your own GPT-2 (124M).
聽
馃
AI
聽
Content type:
Blog
medium.com
路
5d
5 days ago
Actions for Train your own GPT-2 (124M).
Building Semantic Search with
Transformers.js
and Sentence Embeddings
聽
馃
AI
machinelearningmastery.com
路
5d
5 days ago
Actions for Building Semantic Search with Transformers.js and Sentence Embeddings
A Unifying Lens on Reward Uncertainty in
RLHF
聽
馃
AI Models
聽
Content type:
Academic
arxiv.org
路
1d
1 day ago
Actions for A Unifying Lens on Reward Uncertainty in RLHF
Neglected Basics of
AI
Alignment
聽
馃
AI Models
lesswrong.com
路
3d
3 days ago
Actions for Neglected Basics of AI Alignment
Hidden Consensus:Preference-Validity Compression in Human Feedback
聽
馃
AI Models
聽
Content type:
Academic
arxiv.org
路
22h
22 hours ago
Actions for Hidden Consensus:Preference-Validity Compression in Human Feedback
Analyzing the geometric dependence of thermoelastic Q -factor in micro hemispherical resonators via a data-augmented
CNN-transformer
model
聽
馃
AI
聽
Content type:
Academic
nature.com
路
6d
6 days ago
Actions for Analyzing the geometric dependence of thermoelastic Q -factor in micro hemispherical resonators via a data-augmented CNN-transformer model
A Regret Minimization Framework on Preference Learning in
Large
Language
Models
聽
馃
LLMs
聽
Content type:
Academic
arxiv.org
路
1d
1 day ago
Actions for A Regret Minimization Framework on Preference Learning in Large Language Models
NVIDIA/cosmos: NVIDIA Cosmos is an
open
platform of world
models
, datasets, and tools that enables developers to build Physical
AI
for robots, autonomous vehicles, smart infrastructure, and more.
聽
馃
AI
聽
Content type:
Code
github.com
路
6d
6 days ago
Actions for NVIDIA/cosmos: NVIDIA Cosmos is an open platform of world models, datasets, and tools that enables developers to build Physical AI for robots, autonomous vehicles, smart infrastructure, and more.
Multilingual Sentiment Aware Text Summarization A Reinforcement Learning Approach for Consistency Maintenance
聽
馃
LLMs
聽
Content type:
Academic
arxiv.org
路
1d
1 day ago
Actions for Multilingual Sentiment Aware Text Summarization A Reinforcement Learning Approach for Consistency Maintenance
Do We Want a Superintelligent People-Pleaser?
聽
馃
AI Models
lesswrong.com
路
5d
5 days ago
Actions for Do We Want a Superintelligent People-Pleaser?
Towards Robust Arabic Speech Emotion Recognition with Deep Learning
聽
馃
AI
聽
Content type:
Academic
arxiv.org
路
22h
22 hours ago
Actions for Towards Robust Arabic Speech Emotion Recognition with Deep Learning
Principled Agent Debate: Adversarial Arbitration for Sycophancy Reduction in
Large
Language
Models
聽
馃
AI Models
聽
Content type:
Academic
arxiv.org
路
1d
1 day ago
Actions for Principled Agent Debate: Adversarial Arbitration for Sycophancy Reduction in Large Language Models
Sign up or log in to see more results
Sign Up
Login
« Page 2
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help