Post-Training

Feeds to Scour
SubscribedAll
Scoured 157 posts in 5.8 ms

MoQ GGUFs and GSQ: Low-Bit GGUFs Are About to Get Much Better

馃LLM InferenceContent type: NewsContent type: Blog

ApodexAI/AgentHarness: Evaluation harness for Apodex-1.0 on public deep-research benchmarks.

馃LLM InferenceContent type: Code
github.comHacker News

Attention Amnesia in Hybrid LLMs: When CoT Fine-Tuning Breaks Long-Range Recall, and How to Fix It

馃LLM InferenceContent type: Academic
arxiv.org

AI2's Nathan Lambert says Nvidia's multi-teacher on-policy distillation for Nemotron 3 Ultra is the post-training industry standard

鈿欙笍AI Infrastructure
digg.com

We Should Take Text Optimization More Seriously

馃LLM InferenceContent type: Blog
yoonholee.comHacker News

Stack Overflow didn't just help AI learn to code

馃LLM Inference

Cohere open-sources a coding agent that runs on a single H100

馃攧Agentic Systems
venturebeat.com

How to Train Your Goblin

馃LLM Inference

Supervised Fine-tuning with Synthetic Rationale Data Hurts Real-World Disease Prediction

馃LLM InferenceContent type: Academic
arxiv.org

local AI agents for Cursor with pre-tuned marketplace/commu

馃攧Agentic Systems
locaible.comHacker News

Posting for authoring

馃攧Agentic Systems
turingpost.com

From 1 July, the AP will check the registration of scan cars in the algorithm register

馃攧Agentic Systems

Mult-DPO: Multinomial Direct Preference Optimization for Recommender Systems

馃LLM InferenceContent type: Academic
arxiv.org

KJLdefeated/RL.cu: RLVR training for LLM in CUDA/C++

鈿欙笍AI InfrastructureContent type: Code
github.comHacker News

Emergence of Context Characteristics Sensitivity in Large Language Models

馃攳Retrieval-Augmented GenerationContent type: Academic
arxiv.org

Would a prepaid pass for a coding agent solve a real need or is it just my itch?

馃LLM Inference
codehamr.comr/SideProject

EDPB meets with EU Commissioner McGrath and adopts common data breach notification template

馃攳Retrieval-Augmented Generation
edpb.europa.eu

The sample efficiency black hole

鈿欙笍AI InfrastructureContent type: News
dwarkesh.comHacker News

Alignment Defends LLMs from Property Inference Attacks

馃LLM InferenceContent type: Academic
arxiv.org

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help