🎯 RLHF - moyutianzun

Less-relevant results

The week AI infrastructure crossed from a technology story to a financial one

⚙post training infra News

mlwhiz.com·

Why LLMs (still) lack taste

⚙post training infra

beyondtheprior.com··Hacker News

Anthropic’s Bet: Interview with Dario Amodei

🔄Transformers

4sysops.com·

Tracing Eval-Awareness Emergence Through Training of OLMo 3

⚙post training infra

lesswrong.com·

umair-tareen/philosopher-council: An eleven-philosopher LLM council - ask it questions or point it at AI-research trends. Claude-powered deliberation through the four classical branches of philosophy. Methodology, not metaphysics.

⚙post training infra Code

github.com··r/SideProject

APOSM: Pairwise preference learning improves generative small-molecule design

🎭Mixture of Experts Academic

biorxiv.org·

Why Claude Produces High-Quality Output: A Developer’s Guide to Token Efficiency and Hallucination…

🔍RAG Blog

medium.com·

Show HN: The Deterministic Core Architecture for AI-Augmented Applications

🔧MLIR

brandonbellsystems.com··Hacker News

The Neutral Mask: How RLHF Provides Shallow Alignment while Leaving Partisan Structure Intact in a Large Language Model

⚙post training infra Academic

arxiv.org·

Claude vs GPT-4: Which AI API Is Better for Developers? (2026)

🤖agentic system

kalyna.pro··DEV

GDPR request

⚙post training infra

wiki.openfoodfacts.org·

local AI agents for Cursor with pre-tuned marketplace/commu

🤖agentic system

locaible.com··Hacker News

Would a prepaid pass for a coding agent solve a real need or is it just my itch?

🤖agentic system

codehamr.com··r/SideProject

Alignment Defends LLMs from Property Inference Attacks

⚙post training infra Academic

arxiv.org·

Neglected Basics of AI Alignment

⚙post training infra

lesswrong.com·

Posting for authoring

⚙post training infra

turingpost.com·

A Regret Minimization Framework on Preference Learning in Large Language Models

⚙post training infra Academic

arxiv.org·

Stack Overflow didn't just help AI learn to code

⚙post training infra

zozo123.github.io··Hacker News

Reasoning RL in 2026: GRPO, DPO, RLVR, Agentic PO & Beyond

Mult-DPO: Multinomial Direct Preference Optimization for Recommender Systems

The week AI infrastructure crossed from a technology story to a financial one

Why LLMs (still) lack taste

Anthropic’s Bet: Interview with Dario Amodei

Tracing Eval-Awareness Emergence Through Training of OLMo 3

umair-tareen/philosopher-council: An eleven-philosopher LLM council - ask it questions or point it at AI-research trends. Claude-powered deliberation through the four classical branches of philosophy. Methodology, not metaphysics.

APOSM: Pairwise preference learning improves generative small-molecule design

Why Claude Produces High-Quality Output: A Developer’s Guide to Token Efficiency and Hallucination…

Show HN: The Deterministic Core Architecture for AI-Augmented Applications

The Neutral Mask: How RLHF Provides Shallow Alignment while Leaving Partisan Structure Intact in a Large Language Model

Claude vs GPT-4: Which AI API Is Better for Developers? (2026)

GDPR request

local AI agents for Cursor with pre-tuned marketplace/commu

Would a prepaid pass for a coding agent solve a real need or is it just my itch?

Alignment Defends LLMs from Property Inference Attacks

Neglected Basics of AI Alignment

Posting for authoring

A Regret Minimization Framework on Preference Learning in Large Language Models

Stack Overflow didn't just help AI learn to code