Evidence on language model consciousness
lesswrong.com·1d
🪞Metacognition
Flag this post
We are building AI slaves. Alignment through control will fail
utopai.substack.com·3d·
Discuss: Substack
🧭Ethics
Flag this post
Human Values ≠ Goodness
lesswrong.com·3h
🧭Ethics
Flag this post
Think GPT-5 Is Halfway to AGI? Think Again.
pub.towardsai.net·16h
🤖AI
Flag this post
Reason About Intelligence, Not AI
lesswrong.com·3h
🧭Ethics
Flag this post
Decision theory when you can't make decisions
lesswrong.com·1d
🪞Metacognition
Flag this post
Anthropic Research Shows How LLMs Perceive Text via @sejournal, @martinibuster
searchenginejournal.com·3d
🤖AI
Flag this post
Emergent Introspective Awareness in Large Language Models
transformer-circuits.pub·4d·
🪞Metacognition
Flag this post
I Wondered Why I Procrastinate Even On Things I Am "Passionate" About
lesswrong.com·7h
👤Psychology
Flag this post
Take Weird Ideas Seriously
notboring.co·3d·
Discuss: Hacker News
🧭Ethics
Flag this post
LLM-generated text is not testimony
lesswrong.com·1d
💬LLM
Flag this post
Economics and Transformative AI (by Tom Cunningham)
lesswrong.com·1d
🧭Ethics
Flag this post
25 Que
lesswrong.com·10h
👤Psychology
Flag this post
Brainstorming 25 Questions I Am Interested In
lesswrong.com·10h
👤Psychology
Flag this post
How I Learned to Stop Worrying and Love My Shitty Life
thedriftmag.com·2d·
Discuss: Hacker News
👤Psychology
Flag this post
The business of the culture war
feeds.feedblitz.com·18h
💬LLM
Flag this post
FTL travel and scientific realism
lesswrong.com·16h
🏛️Philosophy
Flag this post
Introducing Project Telos: Modeling, Measuring, and Intervening on Goal-directed Behavior in AI Systems
lesswrong.com·2d
🤖AI
Flag this post
A toy model of corrigibility
lesswrong.com·4h
🤖AI
Flag this post
Model welfare and open source
lesswrong.com·20h
🤖AI
Flag this post