Skip to main content
Scour
Discover
Docs
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Post-training
🎯 Post-training
Specific
RLHF, fine-tuning, DPO, instruction tuning, model alignment
Filter Results
Timeframe
Choose a timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
107
posts in
22.9
ms
🧠
LLMs
fareedkhan-dev.github.io
·
5d
5 days ago
Train
LLM from Scratch
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Train LLM from Scratch
📊
LLM Evaluation
arXiv
·
2d
2 days ago
Weight-Space Geometry of Offline Reasoning
Training
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Weight-Space Geometry of Offline Reasoning Training
Less-relevant results
🏗️
AI Infra
Liquid AI
·
8h
8 hours ago
LFM2.5-230M: Built to Run Anywhere
Covered by
VentureBeat
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for LFM2.5-230M: Built to Run Anywhere
🛡️
AI Safety
Pangeanic Blog
·
1d
1 day ago
From
Fine-Tuning
to Red Teaming: The Data Operations Behind Reliable AI
Models
Covers
AI Risk Management Framework
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for From Fine-Tuning to Red Teaming: The Data Operations Behind Reliable AI Models
🧠
LLMs
Bloomberg
·
4d
4 days ago
Tech Disruptors: Invisible Technologies on
RLHF
and LLM
Training
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Tech Disruptors: Invisible Technologies on RLHF and LLM Training
🧠
LLMs
zentara.co
·
1d
1 day ago
LLM Refusal Behavior on Open-Weight
Model
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for LLM Refusal Behavior on Open-Weight Model
📚
RAG
Hacker News
·
4d
4 days ago
Good results
fine
tuning
a local LLM like Qwen 3:0.6B to categorize questions
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Good results fine tuning a local LLM like Qwen 3:0.6B to categorize questions
🧠
LLMs
GitHub
·
6d
6 days ago
Show HN: NanoEuler – GPT-2 scale
model
in pure C/CUDA from scratch
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Show HN: NanoEuler – GPT-2 scale model in pure C/CUDA from scratch
🧠
LLMs
Digital Trends
·
1d
1 day ago
As Hollywood jobs dry up, workers are quietly
training
AI
models
to survive
Covers
I Work in Hollywood. Everyone Who Used to Make TV Is Now Secretly Training AI
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for As Hollywood jobs dry up, workers are quietly training AI models to survive
🛡️
AI Safety
arXiv
·
10h
10 hours ago
Paved with True Intents: Intent-Aware
Training
Improves LLM Safety Classification Across
Training
Regimes
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Paved with True Intents: Intent-Aware Training Improves LLM Safety Classification Across Training Regimes
📊
LLM Evaluation
Helsinki Times
·
4d
4 days ago
Orpo
intervenes in NGO funding dispute as Soste faces major job cuts
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Orpo intervenes in NGO funding dispute as Soste faces major job cuts
🏗️
AI Infra
lemmy.ml
·
3d
3 days ago
VibeThinker-3B: Exploring the Frontier of Verifiable Reasoning in Small Language
Models
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for VibeThinker-3B: Exploring the Frontier of Verifiable Reasoning in Small Language Models
🗄️
Vector Databases
Nature
·
14h
14 hours ago
Patterns of Edtech use and mastery among university students: an exploratory socio-cognitive analysis
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Patterns of Edtech use and mastery among university students: an exploratory socio-cognitive analysis
🤖
AI Agents
chapterpal.com
·
2d
2 days ago
Sakana Fugu Technical Report
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Sakana Fugu Technical Report
🧠
LLMs
biorxiv.org
·
3d
3 days ago
CellTosg2Sequence: A Unified Text-Omics-Signaling-Graph Large Language
Model
for Single-Cell Analysis
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for CellTosg2Sequence: A Unified Text-Omics-Signaling-Graph Large Language Model for Single-Cell Analysis
🧠
LLMs
arXiv
·
10h
10 hours ago
Reasoning Quality Emerges Early: Data Curation for Reasoning
Models
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Reasoning Quality Emerges Early: Data Curation for Reasoning Models
✍️
Prompt Engineering
Hugging Face
·
2d
2 days ago
Qwen-AgentWorld-35B-A3B: a 3B-active MoE
trained
to simulate MCP, terminal, SWE, Android, web and OS environments
Covers
2 stories
See all stories this covers
including
vllm-project/vllm
Covered by
3 sources
See all sources covering this story
including
GitHub
,
indiehacker.news
Discussed on
r/LocalLLaMA
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Qwen-AgentWorld-35B-A3B: a 3B-active MoE trained to simulate MCP, terminal, SWE, Android, web and OS environments
✍️
Prompt Engineering
MicroScope
·
5h
5 hours ago
Met Palantir pilot: The DPIA that raises more questions than answers
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Met Palantir pilot: The DPIA that raises more questions than answers
🛡️
AI Safety
gdpredirect.com
·
1d
1 day ago
Become EU compliant in one line of code (satire)
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Become EU compliant in one line of code (satire)
✍️
Prompt Engineering
fig.inc
·
6d
6 days ago
Breaking Browser-Use
Models
Using Domain Randomization
Covers
Kimi K2.5: Visual Agentic Intelligence
Discussed on
Hacker News
and
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Breaking Browser-Use Models Using Domain Randomization
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous post
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Discover
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help
Like
Save
Not for me
Report