Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Post-training
🎯 Post-training
Specific
fine-tuning, RLHF, instruction tuning, alignment
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
155
posts in
6.7
ms
GDPR request
📊
ML
wiki.openfoodfacts.org
·
6d
6 days ago
Actions for GDPR request
How to reduce capability degradation from
off-model
SFT
💬
LLMs
lesswrong.com
·
2d
2 days ago
Actions for How to reduce capability degradation from off-model SFT
Lius: Translation
Model
Based
Instructional
Lingustic Using Continual Instruction
Tuning
In Kupang Malay
💬
LLMs
Content type:
Academic
arxiv.org
·
10h
10 hours ago
Actions for Lius: Translation Model Based Instructional Lingustic Using Continual Instruction Tuning In Kupang Malay
Posting
for authoring
💬
LLMs
turingpost.com
·
3d
3 days ago
Actions for Posting for authoring
Would a prepaid pass for a coding agent solve a real need or is it just my itch?
💬
LLMs
codehamr.com
·
6d
6 days ago
·
r/SideProject
Actions for Would a prepaid pass for a coding agent solve a real need or is it just my itch?
Some Interesting Papers on RLVR
🎮
RL
lesswrong.com
·
1d
1 day ago
Actions for Some Interesting Papers on RLVR
GraphInfer-Bench: Benchmarking
LLM
's Inference Capability on Graphs
💬
LLMs
Content type:
Academic
arxiv.org
·
10h
10 hours ago
Actions for GraphInfer-Bench: Benchmarking LLM's Inference Capability on Graphs
How to
Train
Your Goblin
💬
LLMs
goblins.mchen.workers.dev
·
4d
4 days ago
·
Hacker News
,
Hacker News
Actions for How to Train Your Goblin
ApodexAI/AgentHarness: Evaluation harness for Apodex-1.0 on public deep-research benchmarks.
💬
LLMs
Content type:
Code
github.com
·
1d
1 day ago
·
Hacker News
Actions for ApodexAI/AgentHarness: Evaluation harness for Apodex-1.0 on public deep-research benchmarks.
Cohere open-sources a coding agent that runs on a single H100
💻
Software Engineering
venturebeat.com
·
1d
1 day ago
Actions for Cohere open-sources a coding agent that runs on a single H100
Stack Overflow didn't just help AI learn to code
🧠
AI
zozo123.github.io
·
4d
4 days ago
·
Hacker News
Actions for Stack Overflow didn't just help AI learn to code
The Art of Interrogation: Consistency Amplifies Factuality in Spatial Reasoning
🌐
World Models
Content type:
Academic
arxiv.org
·
10h
10 hours ago
Actions for The Art of Interrogation: Consistency Amplifies Factuality in Spatial Reasoning
Introducing the Third Generation of Apple’s Foundation
Models
💬
LLMs
machinelearning.apple.com
·
3d
3 days ago
·
Hacker News
,
r/apple
Actions for Introducing the Third Generation of Apple’s Foundation Models
Mult-DPO
: Multinomial
Direct
Preference
Optimization for Recommender Systems
💬
LLMs
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Mult-DPO: Multinomial Direct Preference Optimization for Recommender Systems
Why Claude Produces High-Quality Output: A Developer’s Guide to Token Efficiency and Hallucination…
💬
LLMs
Content type:
Blog
medium.com
·
6d
6 days ago
Actions for Why Claude Produces High-Quality Output: A Developer’s Guide to Token Efficiency and Hallucination…
When
RL
Fails after
SFT
: Rejuvenating
Model
Plasticity for Robust
SFT-to-RL
Handoff
🌐
World Models
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for When RL Fails after SFT: Rejuvenating Model Plasticity for Robust SFT-to-RL Handoff
local AI agents for Cursor with
pre-tuned
marketplace/commu
💬
LLMs
locaible.com
·
1d
1 day ago
·
Hacker News
Actions for local AI agents for Cursor with pre-tuned marketplace/commu
PSA: Convoy offers
SFT-70
4000K, CRI 90 (
pre-production
)
🎮
RL
convoylight.com
·
6d
6 days ago
·
r/flashlight
Actions for PSA: Convoy offers SFT-70 4000K, CRI 90 (pre-production)
The Substitution Wave in AI
🌐
World Models
tomtunguz.com
·
4d
4 days ago
Actions for The Substitution Wave in AI
Alignment
Defends LLMs from Property Inference Attacks
🧠
AI
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Alignment Defends LLMs from Property Inference Attacks
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help