🎯 Post-Training - touyou · Scour

MoQ GGUFs and GSQ: Low-Bit GGUFs Are About to Get Much Better

🤖LLM Inference News Blog

kaitchup.substack.com··r/LocalLLaMA

ApodexAI/AgentHarness: Evaluation harness for Apodex-1.0 on public deep-research benchmarks.

🤖LLM Inference Code

github.com··Hacker News

Attention Amnesia in Hybrid LLMs: When CoT Fine-Tuning Breaks Long-Range Recall, and How to Fix It

🤖LLM Inference Academic

AI2's Nathan Lambert says Nvidia's multi-teacher on-policy distillation for Nemotron 3 Ultra is the post-training industry standard

⚙️AI Infrastructure

We Should Take Text Optimization More Seriously

🤖LLM Inference Blog

yoonholee.com··Hacker News

Stack Overflow didn't just help AI learn to code

🤖LLM Inference

zozo123.github.io··Hacker News

Cohere open-sources a coding agent that runs on a single H100

🔄Agentic Systems

venturebeat.com·

How to Train Your Goblin

🤖LLM Inference

goblins.mchen.workers.dev··Hacker News, Hacker News

Supervised Fine-tuning with Synthetic Rationale Data Hurts Real-World Disease Prediction

🤖LLM Inference Academic

local AI agents for Cursor with pre-tuned marketplace/commu

🔄Agentic Systems

locaible.com··Hacker News

Posting for authoring

🔄Agentic Systems

turingpost.com·

From 1 July, the AP will check the registration of scan cars in the algorithm register

🔄Agentic Systems

autoriteitpersoonsgegevens.nl·

Mult-DPO: Multinomial Direct Preference Optimization for Recommender Systems

🤖LLM Inference Academic

GDPR request

🔍Retrieval-Augmented Generation

wiki.openfoodfacts.org·

KJLdefeated/RL.cu: RLVR training for LLM in CUDA/C++

⚙️AI Infrastructure Code

github.com··Hacker News

Emergence of Context Characteristics Sensitivity in Large Language Models

🔍Retrieval-Augmented Generation Academic

Would a prepaid pass for a coding agent solve a real need or is it just my itch?

🤖LLM Inference

codehamr.com··r/SideProject

EDPB meets with EU Commissioner McGrath and adopts common data breach notification template

🔍Retrieval-Augmented Generation

edpb.europa.eu·

The sample efficiency black hole

⚙️AI Infrastructure News

dwarkesh.com··Hacker News

Alignment Defends LLMs from Property Inference Attacks

🤖LLM Inference Academic

Log in to enable infinite scrolling