Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
RLHF
🎯 RLHF
Specific
Reinforcement Learning, Human Feedback, LLM Alignment
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
142
posts in
11.3
ms
RL
Excursions during
Pre-Training
:
Re-examining
Policy Optimization for LLM training
🎛️
Fine-tuning
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for RL Excursions during Pre-Training: Re-examining Policy Optimization for LLM training
DOG-DPO
:Dynamic
Optimization
in Geometry for Safety
Alignment
🎯
Fine-Tuning
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for DOG-DPO:Dynamic Optimization in Geometry for Safety Alignment
Hidden Consensus:
Preference-Validity
Compression in
Human
Feedback
🎯
Fine-Tuning
Content type:
Academic
arxiv.org
·
15h
15 hours ago
Actions for Hidden Consensus:Preference-Validity Compression in Human Feedback
What Do People Actually Want From AI? Mapping
Preference
Plurality
🎛️
Fine-tuning
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for What Do People Actually Want From AI? Mapping Preference Plurality
Training LLMs to Enforce Multi-Level Instruction Hierarchies via Gravity-Weighted
Direct
Preference
Optimization
🎛️
Fine-tuning
Content type:
Academic
arxiv.org
·
15h
15 hours ago
Actions for Training LLMs to Enforce Multi-Level Instruction Hierarchies via Gravity-Weighted Direct Preference Optimization
TLA-Prover: Verifiable TLA+ Specification Synthesis via
Preference-Optimized
Low-Rank Adaptation
🎛️
Fine-tuning
Content type:
Academic
arxiv.org
·
5d
5 days ago
Actions for TLA-Prover: Verifiable TLA+ Specification Synthesis via Preference-Optimized Low-Rank Adaptation
Multilingual Refusal
Alignment
for Safer Large Language
Models
🎯
Fine-Tuning
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Multilingual Refusal Alignment for Safer Large Language Models
Pareto-Guided Teacher
Alignment
for Fair Personalized Text Generation
🎛️
Fine-tuning
Content type:
Academic
arxiv.org
·
15h
15 hours ago
Actions for Pareto-Guided Teacher Alignment for Fair Personalized Text Generation
Attention Amnesia in Hybrid LLMs: When CoT
Fine-Tuning
Breaks Long-Range Recall, and How to Fix It
🎛️
Fine-tuning
Content type:
Academic
arxiv.org
·
15h
15 hours ago
Actions for Attention Amnesia in Hybrid LLMs: When CoT Fine-Tuning Breaks Long-Range Recall, and How to Fix It
Better Literary Translation: A Multi-Aspect Data Generation and
LLM
Training Approach
🎛️
Fine-tuning
Content type:
Academic
arxiv.org
·
5d
5 days ago
Actions for Better Literary Translation: A Multi-Aspect Data Generation and LLM Training Approach
Emergence of Context Characteristics Sensitivity in Large Language
Models
🎛️
Fine-tuning
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Emergence of Context Characteristics Sensitivity in Large Language Models
PriFT: Prior-Support Guided
Supervised
Fine-Tuning
🎛️
Fine-tuning
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for PriFT: Prior-Support Guided Supervised Fine-Tuning
Gradient-Guided
Reward
Optimization
for Inference-time
Alignment
🎛️
Fine-tuning
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Gradient-Guided Reward Optimization for Inference-time Alignment
GRAIL: Gradient-Reweighted Advantages for
Reinforcement
Learning
with Verifiable
Rewards
🎮
Reinforcement Learning
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for GRAIL: Gradient-Reweighted Advantages for Reinforcement Learning with Verifiable Rewards
On the Geometry of
On-Policy
Distillation
🎛️
Fine-tuning
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for On the Geometry of On-Policy Distillation
DynaCF: Mitigating Shortcut
Learning
in
Reward
Models
via Dynamic Counterfactual Sensitivity
🎮
Reinforcement Learning
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for DynaCF: Mitigating Shortcut Learning in Reward Models via Dynamic Counterfactual Sensitivity
SCI-PRM: A Tool Aware Process
Reward
Model
for Scientific Reasoning Verification
🎮
Reinforcement Learning
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for SCI-PRM: A Tool Aware Process Reward Model for Scientific Reasoning Verification
Belief-Space Quantum-Inspired
Reinforcement
Learning
for Partially Observable Autonomous Cyber Defense in the Internet of Vehicles
🎮
Reinforcement Learning
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Belief-Space Quantum-Inspired Reinforcement Learning for Partially Observable Autonomous Cyber Defense in the Internet of Vehicles
Dynamic Multi-Pair Trading Strategy in Cryptocurrency Markets with Deep
Reinforcement
Learning
🎮
Reinforcement Learning
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for Dynamic Multi-Pair Trading Strategy in Cryptocurrency Markets with Deep Reinforcement Learning
Learning
to Attack and Defend: Adaptive Red Teaming of Language
Models
via GRPO
🎮
Reinforcement Learning
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Learning to Attack and Defend: Adaptive Red Teaming of Language Models via GRPO
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help