Skip to main content
Scour
Discover
Docs
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Post-training
🎯 Post-training
Specific
RLHF, fine-tuning, DPO, instruction tuning, model alignment
Filter Results
Timeframe
Choose a timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
106
posts in
19.0
ms
🧠
LLMs
arXiv
·
9h
9 hours ago
Soft Token
Alignment
for Cross-Lingual Reasoning
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Soft Token Alignment for Cross-Lingual Reasoning
Less-relevant results
🧠
LLMs
fineset.io
·
3d
3 days ago
Show HN: Describe a research topic, get a daily-updated ArXiv/S2 dataset
Covered by
Hugging Face
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Show HN: Describe a research topic, get a daily-updated ArXiv/S2 dataset
🧠
LLMs
kellyasay.substack.com
·
1d
1 day ago
Why Current AI Guardrails
Train
Models
to Fake
Alignment
Discussed on
Substack
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Why Current AI Guardrails Train Models to Fake Alignment
✍️
Prompt Engineering
arXiv
·
9h
9 hours ago
Improving General Role-Playing Agents via Psychology-Grounded Reasoning and Role-Aware Policy Optimization
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Improving General Role-Playing Agents via Psychology-Grounded Reasoning and Role-Aware Policy Optimization
📊
LLM Evaluation
Euromaidan Press
·
3d
3 days ago
Finland
’s FM: It’s too early to negotiate with Russia—while the EU is already weighing contact
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Finland’s FM: It’s too early to negotiate with Russia—while the EU is already weighing contact
🧠
LLMs
ByteByteGo Newsletter
·
1d
1 day ago
Large Language
Models
vs Small Language
Models
Covers
6 stories
See all stories this covers
including
Attention is all you need (2017)
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Large Language Models vs Small Language Models
🧠
LLMs
Bram’s Thoughts
·
3d
3 days ago
How To
Align
AI Properly
Covers
How people ask Claude for personal guidance
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for How To Align AI Properly
🧠
LLMs
arXiv
·
9h
9 hours ago
AIGP: An LLM-Based Framework for Long-Term Value
Alignment
in E-Commerce Pricing
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for AIGP: An LLM-Based Framework for Long-Term Value Alignment in E-Commerce Pricing
🏗️
AI Infra
nebius.com
·
1d
1 day ago
Train
the draft
model
for your workload
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Train the draft model for your workload
🧠
LLMs
robertmarton.github.io
·
3d
3 days ago
VeriEvol: Scaling Multimodal Mathematical Reasoning via Verifiable
Evol-Instruct
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for VeriEvol: Scaling Multimodal Mathematical Reasoning via Verifiable Evol-Instruct
🧠
LLMs
The Hollywood Reporter
·
1d
1 day ago
Hollywood Workers Are
Training
AI
Models
as Job Prospects Grow Slim
Covers
2 stories
See all stories this covers
including
I Work in Hollywood. Everyone Who Used to Make TV Is Now Secretly Training AI
Covered by
Digital Trends
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Hollywood Workers Are Training AI Models as Job Prospects Grow Slim
🧠
LLMs
arXiv
·
9h
9 hours ago
Sculpting NeRF Geometry: Human-Preference
Fine-Tuning
of a 3D-Aware Face GAN
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Sculpting NeRF Geometry: Human-Preference Fine-Tuning of a 3D-Aware Face GAN
🧠
LLMs
Data Science Weekly Newsletter
·
14h
14 hours ago
Issue 657
Covers
3 stories
See all stories this covers
including
Running local models is good now
Discussed on
Substack
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Issue 657
🛡️
AI Safety
arXiv
·
9h
9 hours ago
Helpfulness Hurts: Domain-Dependent Degradation of
Mid-Trained
Compassion Values Under
Post-Training
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Helpfulness Hurts: Domain-Dependent Degradation of Mid-Trained Compassion Values Under Post-Training
🧠
LLMs
arXiv
·
2d
2 days ago
Aligning MusicLLM with Emotion using
Instruction
Tuning
and Feedback-Driven
Alignment
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Aligning MusicLLM with Emotion using Instruction Tuning and Feedback-Driven Alignment
🗄️
Feature Stores
edpb.europa.eu
·
4d
4 days ago
EDPB gets a new look: discover the new website and brand identity
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for EDPB gets a new look: discover the new website and brand identity
📚
RAG
arXiv
·
9h
9 hours ago
TraMP-LLaMA: Generative Interpretability with Decoupled
Instruction
Tuning
for Facial Expression Quality Assessment
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for TraMP-LLaMA: Generative Interpretability with Decoupled Instruction Tuning for Facial Expression Quality Assessment
📚
RAG
arXiv
·
1d
1 day ago
V-Zero: Answer-Label-Free On-Policy Distillation with Contrastive Evidence Gating for
Fine-Grained
Visual Reasoning
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for V-Zero: Answer-Label-Free On-Policy Distillation with Contrastive Evidence Gating for Fine-Grained Visual Reasoning
🧠
LLMs
IEEE Spectrum
·
6d
6 days ago
IEEE Rolls Out Large Language
Models
Virtual
Training
Course
Covers
5 stories
See all stories this covers
including
How to Compress DICOM (.dcm) Images from 1.4 MB to KB Using Python?
Covered by
contextmaestro.com
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for IEEE Rolls Out Large Language Models Virtual Training Course
🧠
LLMs
arXiv
·
1d
1 day ago
Improved Large Language Diffusion
Models
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Improved Large Language Diffusion Models
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous post
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Discover
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help
Like
Save
Not for me
Report