Skip to main content
Scour
Discover
Docs
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Post-training
🎯 Post-training
Specific
RLHF, fine-tuning, DPO, instruction tuning, model alignment
Filter Results
Timeframe
Choose a timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
107
posts in
25.6
ms
🧠
LLMs
IEEE Spectrum
·
6d
6 days ago
IEEE Rolls Out Large Language
Models
Virtual
Training
Course
Covers
5 stories
See all stories this covers
including
How to Compress DICOM (.dcm) Images from 1.4 MB to KB Using Python?
Covered by
contextmaestro.com
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for IEEE Rolls Out Large Language Models Virtual Training Course
🤖
AI Agents
arXiv
·
1d
1 day ago
The Hitchhiker's Guide to Agentic AI: From Foundations to Systems
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for The Hitchhiker's Guide to Agentic AI: From Foundations to Systems
✍️
Prompt Engineering
Kilo Blog
·
3d
3 days ago
Announcing Next-Edit in Kilo, Powered by Inception
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Announcing Next-Edit in Kilo, Powered by Inception
🔌
MCP
Euractiv
·
2d
2 days ago
What the Taliban wants from Europe
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for What the Taliban wants from Europe
🛡️
AI Safety
arXiv
·
2d
2 days ago
E-MRL: Cross-view
Aligned
Evidence-driven Multimodal Reinforcement Learning for Reliable 3D Tumor Analysis
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for E-MRL: Cross-view Aligned Evidence-driven Multimodal Reinforcement Learning for Reliable 3D Tumor Analysis
🏗️
AI Infra
arXiv
·
10h
10 hours ago
EvoOptiGraph: Weakness-Driven Coevolution via Graph-Based Structural Generation for Optimization
Modeling
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for EvoOptiGraph: Weakness-Driven Coevolution via Graph-Based Structural Generation for Optimization Modeling
🗄️
Vector Databases
arXiv
·
10h
10 hours ago
Scaling Multi-Reference Image Generation with Dynamic Reward Optimization
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Scaling Multi-Reference Image Generation with Dynamic Reward Optimization
🏗️
AI Infra
arXiv
·
1d
1 day ago
WinDOM: Self-Family Distillation for
Small-Model
GUI Grounding
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for WinDOM: Self-Family Distillation for Small-Model GUI Grounding
🛡️
AI Safety
BLiTZ
·
6d
6 days ago
Why did
Finland
just lift a ban on nuclear weapons?
Covers
Finland tears up nuclear weapons ban in NATO shift
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Why did Finland just lift a ban on nuclear weapons?
📚
RAG
arXiv
·
1d
1 day ago
Retrieval-Augmented Personalization with Foundation
Models
for Wearable Stress Detection
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Retrieval-Augmented Personalization with Foundation Models for Wearable Stress Detection
🔭
Observability
GitHub
·
4d
4 days ago
Hellotravisss/cloakpii: PII desensitization + AES-256-GCM encryption + compliance reporting for cross-border data transfers (PIPL / PDPA / GDPR). Pytho...
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Hellotravisss/cloakpii: PII desensitization + AES-256-GCM encryption + compliance reporting for cross-border data transfers (PIPL / PDPA / GDPR). Pytho...
🧠
LLMs
arXiv
·
10h
10 hours ago
\textsc{DiARC}: Distinguishing
Positive
and Negative Samples Helps Improving ARC-like Reasoning Ability of Large Language
Models
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for \textsc{DiARC}: Distinguishing Positive and Negative Samples Helps Improving ARC-like Reasoning Ability of Large Language Models
📊
LLM Evaluation
arXiv
·
1d
1 day ago
Riazi-8B: An Urdu Large Language
Model
for Mathematical Reasoning
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Riazi-8B: An Urdu Large Language Model for Mathematical Reasoning
🧠
LLMs
Forbes
·
5d
5 days ago
Solution To The Curious Mystery Of Why AI Keeps Inventing The Same Fake Names Over And Over Again
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Solution To The Curious Mystery Of Why AI Keeps Inventing The Same Fake Names Over And Over Again
🔄
MLOps
arXiv
·
10h
10 hours ago
NebulaExp-8B: An Empirical
Post-Training
Pipeline via Full-Scale Ablation Research
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for NebulaExp-8B: An Empirical Post-Training Pipeline via Full-Scale Ablation Research
📊
LLM Evaluation
arXiv
·
1d
1 day ago
The Geometry of Sequential Learning: Lie-Bracket Prediction of Transfer Order
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for The Geometry of Sequential Learning: Lie-Bracket Prediction of Transfer Order
✍️
Prompt Engineering
arXiv
·
3d
3 days ago
Provable Benefits of RLVR over SFT for Reasoning
Models
: Learning to Backtrack Efficiently
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Provable Benefits of RLVR over SFT for Reasoning Models: Learning to Backtrack Efficiently
🧠
LLMs
arXiv
·
10h
10 hours ago
Towards Explainable Adjudicative Variance: Quantifying Judicial Discretion via Gated Multi-Task Learning
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Towards Explainable Adjudicative Variance: Quantifying Judicial Discretion via Gated Multi-Task Learning
🛡️
AI Safety
arXiv
·
1d
1 day ago
ASSCG: Just-Right Gating over Chattering for Fast-Slow LLM Planning in Autonomous Driving
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for ASSCG: Just-Right Gating over Chattering for Fast-Slow LLM Planning in Autonomous Driving
🛡️
AI Safety
arXiv
·
3d
3 days ago
MAGNIFIED:
RL
Fine-tuning
of Multimodal Large Language Models for Motion Planning
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for MAGNIFIED: RL Fine-tuning of Multimodal Large Language Models for Motion Planning
Sign up or log in to see more results
Sign Up
Login
« Page 2
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous post
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Discover
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help
Like
Save
Not for me
Report