Post-training

Feeds to Scour
SubscribedAll
Scoured 155 posts in 6.7 ms

GDPR request

 📊ML

How to reduce capability degradation from off-model SFT

 💬LLMs
lesswrong.com·

Lius: Translation Model Based Instructional Lingustic Using Continual Instruction Tuning In Kupang Malay

 💬LLMs  Content type: Academic
arxiv.org·

Posting for authoring

 💬LLMs
turingpost.com·

Would a prepaid pass for a coding agent solve a real need or is it just my itch?

 💬LLMs

Some Interesting Papers on RLVR

 🎮RL
lesswrong.com·

GraphInfer-Bench: Benchmarking LLM's Inference Capability on Graphs

 💬LLMs  Content type: Academic
arxiv.org·

How to Train Your Goblin

 💬LLMs

ApodexAI/AgentHarness: Evaluation harness for Apodex-1.0 on public deep-research benchmarks.

 💬LLMs  Content type: Code
github.com··Hacker News

Cohere open-sources a coding agent that runs on a single H100

 💻Software Engineering
venturebeat.com·

Stack Overflow didn't just help AI learn to code

 🧠AI

The Art of Interrogation: Consistency Amplifies Factuality in Spatial Reasoning

 🌐World Models  Content type: Academic
arxiv.org·

Introducing the Third Generation of Apple’s Foundation Models

 💬LLMs

Mult-DPO: Multinomial Direct Preference Optimization for Recommender Systems

 💬LLMs  Content type: Academic
arxiv.org·

Why Claude Produces High-Quality Output: A Developer’s Guide to Token Efficiency and Hallucination…

 💬LLMs  Content type: Blog
medium.com·

When RL Fails after SFT: Rejuvenating Model Plasticity for Robust SFT-to-RL Handoff

 🌐World Models  Content type: Academic
arxiv.org·

local AI agents for Cursor with pre-tuned marketplace/commu

 💬LLMs

PSA: Convoy offers SFT-70 4000K, CRI 90 (pre-production)

 🎮RL

The Substitution Wave in AI

 🌐World Models
tomtunguz.com·

Alignment Defends LLMs from Property Inference Attacks

 🧠AI  Content type: Academic
arxiv.org·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help