🧭 Ethics - vhpoet · Scour

Sixteen schemes for AI safety

lesswrong.com·

Measuring Embedding Drift: Why Hybrid Search Saves Stale Models.

pub.towardsai.net

·

The crucial human component in computing and AI

🤖AI Academic

Sequent: scale and automation for higher confidence in alignment

lesswrong.com·

Coverage-driven alignment - What ‘Teaching Claude Why’ can borrow from AV verification

lesswrong.com·

ML4Good Summer 2026 Bootcamps - Applications Open!

lesswrong.com·

Anthropic Refused to Let the Pentagon Spy on Americans. It Got Blacklisted.

🏛️Philosophy

pub.towardsai.net

·

How valuable are weak AI safety regulations?

lesswrong.com·

I Started an AI Safety Research Org and Think These 7 Things Matter

🪞Metacognition

lesswrong.com·

The Three Filters: Why Almost Every Plan to Survive ASI Fails Miserably

lesswrong.com·

You Can Catch Sleeper Agents by Teaching Another Model to Imitate Them

lesswrong.com·

Phonies

lesswrong.com·

A Mike's-Eye View of ARC's Research

lesswrong.com·

Learnings from starting an AI safety research team

lesswrong.com·

Towards a Formal Scientific Epistemology

🏛️Philosophy

lesswrong.com·

Neglected Basics of AI Alignment

lesswrong.com·

Iliad is Hiring

lesswrong.com·

Bun's Migration from Zig to Rust as a Potential Case Study for Gradual Disempowerment

lesswrong.com··Hacker News

The Alignment Coin

🧠Philosophy of Mind

lesswrong.com·

Is it unethical to work on robotics capabilities research?

lesswrong.com·

Log in to enable infinite scrolling