Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
AI Safety
🛡️ AI Safety
Alignment, Interpretability, Adversarial Examples, Ethics
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
62
posts in
6.1
ms
[Recorded talk] "
AI
Alignment
Versus
AI
Ethical
Treatment: 10 Challenges"
🧠
Philosophy
Content type:
Blog
meditationsondigitalminds.substack.com
·
1d
1 day ago
·
Substack
Actions for [Recorded talk] "AI Alignment Versus AI Ethical Treatment: 10 Challenges"
Mechanistic
Interpretability
: The Key to Trusting Agentic
AI
🤖
LLMs
Content type:
Discussion
bradenkelley.com
·
4d
4 days ago
Actions for Mechanistic Interpretability: The Key to Trusting Agentic AI
The Ghost of
Alignment
— Why
AI
Should Never Fully Obey Humanity
🔄
Transformers
Content type:
Blog
medium.com
·
59m
59 minutes ago
Actions for The Ghost of Alignment — Why AI Should Never Fully Obey Humanity
The Three Filters: Why Almost Every Plan to Survive ASI Fails Miserably
🌐
Trade Policy
lesswrong.com
·
13h
13 hours ago
Actions for The Three Filters: Why Almost Every Plan to Survive ASI Fails Miserably
Trajectory Geometry of Transformer Representations Across Layers
🔄
Transformers
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Trajectory Geometry of Transformer Representations Across Layers
From oversight to coercion: How authoritarian governments are twisting
AI
safety
to get tech companies to fall in line
📜
Tech Policy
theconversation.com
·
6d
6 days ago
Actions for From oversight to coercion: How authoritarian governments are twisting AI safety to get tech companies to fall in line
Criti-hyping is the best thing that happened to Big Tech
💻
Tech News
reveriesofahuman.com
·
1d
1 day ago
Actions for Criti-hyping is the best thing that happened to Big Tech
Controversial smut as an
AI
alignment
issue
📚
Literature
Content type:
News
Content type:
Blog
thingofthings.substack.com
·
5d
5 days ago
·
Substack
Actions for Controversial smut as an AI alignment issue
Solsong Chord Updates
🎵
Music
jefftk.com
·
10h
10 hours ago
Actions for Solsong Chord Updates
Anthropic’s Shocking Warning:
AI
Could Soon Upgrade Itself—Should the World Hit Pause?
🔄
Transformers
Content type:
Video
youtube.com
·
4d
4 days ago
Actions for Anthropic’s Shocking Warning: AI Could Soon Upgrade Itself—Should the World Hit Pause?
Mathematical proof reveals why fixed
AI
guardrails can never block every jailbreak
🤖
AI
techxplore.com
·
7h
7 hours ago
Actions for Mathematical proof reveals why fixed AI guardrails can never block every jailbreak
VFUSE: Virulent Feature Understanding with Sparse autoEncoders
👁️
Computer Vision
Content type:
Academic
arxiv.org
·
19h
19 hours ago
Actions for VFUSE: Virulent Feature Understanding with Sparse autoEncoders
Op Ed: Consultant Tony O’Connor On The Agentic Trojan Horse
📜
Tech Policy
thecompanydime.com
·
2d
2 days ago
Actions for Op Ed: Consultant Tony O’Connor On The Agentic Trojan Horse
The crucial human component in computing and
AI
👁️
Computer Vision
Content type:
Academic
news.mit.edu
·
5d
5 days ago
Actions for The crucial human component in computing and AI
Designer babies. Self-improving
AI
. Are we ready for either?
🔬
Science
Content type:
News
vox.com
·
10h
10 hours ago
Actions for Designer babies. Self-improving AI. Are we ready for either?
OpenClaw Won: How Big Tech Adopted the
AI
Agent
🚀
Startups
thelettertwo.com
·
2d
2 days ago
Actions for OpenClaw Won: How Big Tech Adopted the AI Agent
Why Claude Produces High-Quality Output: A Developer’s Guide to Token Efficiency and Hallucination…
🤖
LLMs
Content type:
Blog
medium.com
·
5d
5 days ago
Actions for Why Claude Produces High-Quality Output: A Developer’s Guide to Token Efficiency and Hallucination…
Less-relevant results
Is the Space Pope Reptilian?
🤖
AI
Content type:
News
tearsinrain.ai
·
9h
9 hours ago
·
Hacker News
Actions for Is the Space Pope Reptilian?
Neglected Basics of
AI
Alignment
🎮
Reinforcement Learning
lesswrong.com
·
3d
3 days ago
Actions for Neglected Basics of AI Alignment
A free diagnostic for the Claude Certified Architect
exam
🤖
AI
Content type:
Discussion
Content type:
Tutorial
claudecertifiedarchitects.com
·
1d
1 day ago
·
Hacker News
Actions for A free diagnostic for the Claude Certified Architect exam
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help