🔒 Agentic Safety - buckman · Scour

[Recorded talk] "AI Alignment Versus AI Ethical Treatment: 10 Challenges"

⚖️Ethics Blog

meditationsondigitalminds.substack.com··Substack

Iliad is Hiring

🛡️AI Safety Evals

lesswrong.com·

Less-relevant results

Controversial smut as an AI alignment issue

⚖️Ethics News Blog

thingofthings.substack.com··Substack

Criti-hyping is the best thing that happened to Big Tech

reveriesofahuman.com·

OpenClaw Won: How Big Tech Adopted the AI Agent

thelettertwo.com·

The crucial human component in computing and AI

⚖️Ethics Academic

Contra Dance at LessOnline

Against Corrigibility

lesswrong.com·

Beyond Safety Through Filtering: Toward Responsible Training on Human Distress

🔍Intelligence Analysis Blog

compliancearchitecture.substack.com··r/OpenAI

Op Ed: Consultant Tony O’Connor On The Agentic Trojan Horse

⚖️AI Regulation

thecompanydime.com·

Neglected Basics of AI Alignment

🛡️LLM Security

lesswrong.com·

AI Paper Review: Training Language Models to Follow Instructions with Human Feedback (InstructGPT)

freecodecamp.org·

High Dynamic Range DIY Air Testing

Book of Cron Job

🐚Shell Scripting

lesswrong.com·

Towards a Formal Scientific Epistemology

🧠Rationality

lesswrong.com·

SecureBio Detection is Hiring Software Engineers

Linkpost for June

🌍Civilizational Risk Blog

thingofthings.substack.com

Aligning Superintelligent Humans

lesswrong.com·

Coming Around To Political Donations

🗳️Elections

Sixteen schemes for AI safety

🛡️AI Safety

lesswrong.com·

Log in to enable infinite scrolling