AI

Feeds to Scour
SubscribedAll
Scoured 80 posts in 5.9 ms

Representation-Aware Advantage Estimation: Your Reward Model Provides More Than A Scalar Output

馃AI ModelsContent type: Academic
arxiv.org

Introducing the Third Generation of Apple鈥檚 Foundation Models

馃AI

Reasoning RL in 2026: GRPO, DPO, RLVR, Agentic PO & Beyond

馃AI Models
turingpost.com

Stack Overflow didn't just help AI learn to code

馃AI

Phantom transitions in language model fine-tuning

馃AI Content type: Academic
arxiv.org

STAT+: AI titans push Congress for DNA safeguards

馃AI
statnews.com

What Do People Actually Want From AI? Mapping Preference Plurality

馃AI ModelsContent type: Academic
arxiv.org

(VERY PARTIAL) CROSSPOST: ALEX HEATH: SubStack Is Opening Up to AI: Interviewing CEO Chris Best

馃殌startupsContent type: NewsContent type: Blog

Train your own GPT-2 (124M).

馃AI Content type: Blog
medium.com

Building Semantic Search with Transformers.js and Sentence Embeddings

馃AI

A Unifying Lens on Reward Uncertainty in RLHF

馃AI ModelsContent type: Academic
arxiv.org

Neglected Basics of AI Alignment

馃AI Models
lesswrong.com

Hidden Consensus:Preference-Validity Compression in Human Feedback

馃AI ModelsContent type: Academic
arxiv.org

Analyzing the geometric dependence of thermoelastic Q -factor in micro hemispherical resonators via a data-augmented CNN-transformer model

馃AI Content type: Academic
nature.com

A Regret Minimization Framework on Preference Learning in Large Language Models

馃LLMsContent type: Academic
arxiv.org

NVIDIA/cosmos: NVIDIA Cosmos is an open platform of world models, datasets, and tools that enables developers to build Physical AI for robots, autonomous vehicles, smart infrastructure, and more.

馃AI Content type: Code
github.com

Multilingual Sentiment Aware Text Summarization A Reinforcement Learning Approach for Consistency Maintenance

馃LLMsContent type: Academic
arxiv.org

Do We Want a Superintelligent People-Pleaser?

馃AI Models
lesswrong.com

Towards Robust Arabic Speech Emotion Recognition with Deep Learning

馃AI Content type: Academic
arxiv.org

Principled Agent Debate: Adversarial Arbitration for Sycophancy Reduction in Large Language Models

馃AI ModelsContent type: Academic
arxiv.org
Sign up or log in to see more results

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help