🎯 Post-training - samveed · Scour

GDPR request

wiki.openfoodfacts.org·

How to reduce capability degradation from off-model SFT

lesswrong.com·

Lius: Translation Model Based Instructional Lingustic Using Continual Instruction Tuning In Kupang Malay

💬LLMs Academic

Posting for authoring

turingpost.com·

Would a prepaid pass for a coding agent solve a real need or is it just my itch?

codehamr.com··r/SideProject

Some Interesting Papers on RLVR

lesswrong.com·

GraphInfer-Bench: Benchmarking LLM's Inference Capability on Graphs

💬LLMs Academic

How to Train Your Goblin

goblins.mchen.workers.dev··Hacker News, Hacker News

ApodexAI/AgentHarness: Evaluation harness for Apodex-1.0 on public deep-research benchmarks.

💬LLMs Code

github.com··Hacker News

Cohere open-sources a coding agent that runs on a single H100

💻Software Engineering

venturebeat.com·

Stack Overflow didn't just help AI learn to code

zozo123.github.io··Hacker News

The Art of Interrogation: Consistency Amplifies Factuality in Spatial Reasoning

🌐World Models Academic

Introducing the Third Generation of Apple’s Foundation Models

machinelearning.apple.com··Hacker News, r/apple

Mult-DPO: Multinomial Direct Preference Optimization for Recommender Systems

💬LLMs Academic

Why Claude Produces High-Quality Output: A Developer’s Guide to Token Efficiency and Hallucination…

💬LLMs Blog

When RL Fails after SFT: Rejuvenating Model Plasticity for Robust SFT-to-RL Handoff

🌐World Models Academic

local AI agents for Cursor with pre-tuned marketplace/commu

locaible.com··Hacker News

PSA: Convoy offers SFT-70 4000K, CRI 90 (pre-production)

convoylight.com··r/flashlight

The Substitution Wave in AI

🌐World Models

tomtunguz.com·

Alignment Defends LLMs from Property Inference Attacks

🧠AI Academic

Log in to enable infinite scrolling