🤖 LLMs - machacek.vitek · Scour

The Neutral Mask: How RLHF Provides Shallow Alignment while Leaving Partisan Structure Intact in a Large Language Model

🏗️Compilers Academic

Nvidia Nemotron 3 Ultra

research.nvidia.com··Hacker News

Why Claude Produces High-Quality Output: A Developer’s Guide to Token Efficiency and Hallucination…

🏗️Compilers Blog

SLUUG Talk: Demystifying Large Language Models on Linux

🤖AI Code

github.com··DEV

AI Paper Review: Training Language Models to Follow Instructions with Human Feedback (InstructGPT)

freecodecamp.org·

My research agenda and work

lesswrong.com·

Stack Overflow didn't just help AI learn to code

zozo123.github.io··Hacker News

Multilingual Sentiment Aware Text Summarization A Reinforcement Learning Approach for Consistency Maintenance

🏗️Compilers Academic

umair-tareen/philosopher-council: An eleven-philosopher LLM council - ask it questions or point it at AI-research trends. Claude-powered deliberation through the four classical branches of philosophy. Methodology, not metaphysics.

🤖AI Code

github.com··r/SideProject

Less-relevant results

Reasoning RL in 2026: GRPO, DPO, RLVR, Agentic PO & Beyond

turingpost.com·

A Regret Minimization Framework on Preference Learning in Large Language Models

🏗️Compilers Academic

🔬Scaling Past Informal AI - Carina Hong, Axiom Math

latent.space··Hacker News

What Do People Actually Want From AI? Mapping Preference Plurality

🤖AI Academic

A Unifying Lens on Reward Uncertainty in RLHF

🤖AI Academic

(VERY PARTIAL) CROSSPOST: ALEX HEATH: SubStack Is Opening Up to AI: Interviewing CEO Chris Best

🌐Open Source News Blog

braddelong.substack.com

Hidden Consensus:Preference-Validity Compression in Human Feedback

🧮Algorithms Academic

Principled Agent Debate: Adversarial Arbitration for Sycophancy Reduction in Large Language Models

🤖AI Academic

Neglected Basics of AI Alignment

lesswrong.com·

EvalStop: Using World Feedback to Detect and Correct Reward Overoptimization in Multi-Tenant RLHF Platforms

⚙️C++ Academic

Do We Want a Superintelligent People-Pleaser?

🎯Career Growth

lesswrong.com·

Log in to enable infinite scrolling