🎯 AI Alignment - faruk · Scour

The Neutral Mask: How RLHF Provides Shallow Alignment while Leaving Partisan Structure Intact in a Large Language Model

🧠LLMs Academic

Mechanistic Interpretability: The Key to Trusting Agentic AI

🧠LLMs Discussion

bradenkelley.com·

The Ghost of Alignment — Why AI Should Never Fully Obey Humanity

📊AI Monitoring Blog

·

[Recorded talk] "AI Alignment Versus AI Ethical Treatment: 10 Challenges"

🧩Epistemics Blog

meditationsondigitalminds.substack.com··Substack

Sequent: scale and automation for higher confidence in alignment

lesswrong.com·

How authoritarian governments twist AI safety to coerce tech companies to comply

📊AI Monitoring

fastcompany.com·

Criti-hyping is the best thing that happened to Big Tech

📝Long-form Essays

reveriesofahuman.com·

Controversial smut as an AI alignment issue

🧩Epistemics News Blog

thingofthings.substack.com··Substack

Why LLMs (still) lack taste

beyondtheprior.com··Hacker News

The crucial human component in computing and AI

🧩Epistemics Academic

Solsong Chord Updates

Reasoning RL in 2026: GRPO, DPO, RLVR, Agentic PO & Beyond

turingpost.com·

Existential Indifference: Self-Nonpreservation as a Necessary Architectural Condition for Aligned Superintelligence (or: The Suicidal AI)

⚙️AI Infrastructure Academic

Op Ed: Consultant Tony O’Connor On The Agentic Trojan Horse

📊AI Monitoring

thecompanydime.com·

scMTG reconstructs single-cell temporal dynamics with Markov transition generators

🧠LLMs Academic

Stack Overflow didn't just help AI learn to code

zozo123.github.io··Hacker News

Less-relevant results

Complete Drosophila Nervous System Mapped

⚙️AI Infrastructure

neurosciencenews.com·

The Three Filters: Why Almost Every Plan to Survive ASI Fails Miserably

lesswrong.com·

Why Claude Produces High-Quality Output: A Developer’s Guide to Token Efficiency and Hallucination…

📡Information Retrieval Blog

Designer babies. Self-improving AI. Are we ready for either?

🧩Epistemics News

·

Log in to enable infinite scrolling