Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Alignment
🎯 Alignment
Broad
AI Safety, Constitutional AI, Value Alignment, Preference Learning
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
233
posts in
6.6
ms
OpenAI says it will comply with Trump's order to let the government review
AI
models before release
🔬
Interpretability
qz.com
·
5d
5 days ago
Actions for OpenAI says it will comply with Trump's order to let the government review AI models before release
Anthropic releases Mythos-derived model with cyber guardrails
🦾
Embodied AI
metacurity.com
·
13h
13 hours ago
Actions for Anthropic releases Mythos-derived model with cyber guardrails
Assessing the Polyglot Chatbot: Multilingual
Safety
in
AI
Systems
🎨
Multimodal AI
cdt.org
·
1d
1 day ago
Actions for Assessing the Polyglot Chatbot: Multilingual Safety in AI Systems
More than automation: What agentic
AI
means for person-centred government
🤖
Agent
Content type:
News
themandarin.com.au
·
20h
20 hours ago
Actions for More than automation: What agentic AI means for person-centred government
AI
Safety
— Genuine or Performative?
🦾
Embodied AI
Content type:
Blog
medium.com
·
5d
5 days ago
Actions for AI Safety — Genuine or Performative?
Anthropic’s Dario Amodei wants governments to have the power to block ‘dangerous’
AI
systems
🦾
Embodied AI
siliconangle.com
·
57m
57 minutes ago
Actions for Anthropic’s Dario Amodei wants governments to have the power to block ‘dangerous’ AI systems
The Best Politician In A Generation
🦾
Embodied AI
Content type:
News
Content type:
Blog
benthams.substack.com
·
1d
1 day ago
·
Substack
Actions for The Best Politician In A Generation
Anthropic Urges Governments to Secure Power to Halt Dangerous
AI
🦾
Embodied AI
pymnts.com
·
35m
35 minutes ago
Actions for Anthropic Urges Governments to Secure Power to Halt Dangerous AI
new mantra just dropped
🎨
Multimodal AI
aphie.xyz
·
15h
15 hours ago
Actions for new mantra just dropped
Paving the way for agents in biology
🤖
Agent
anthropic.com
·
2d
2 days ago
·
Hacker News
Actions for Paving the way for agents in biology
AI
Scientist Bengio: Building Systems We Don't Know How to Control
🤖
Agent
Content type:
News
bloomberg.com
·
6d
6 days ago
Actions for AI Scientist Bengio: Building Systems We Don't Know How to Control
I Started an
AI
Safety
Research Org and Think These 7 Things Matter
💾
Memory Systems
lesswrong.com
·
12h
12 hours ago
Actions for I Started an AI Safety Research Org and Think These 7 Things Matter
Criti-hyping is the best thing that happened to Big Tech
🧠
Neuroscience
reveriesofahuman.com
·
1d
1 day ago
Actions for Criti-hyping is the best thing that happened to Big Tech
Hidden Consensus:
Preference-Validity
Compression in Human Feedback
💾
Memory Systems
Content type:
Academic
arxiv.org
·
22h
22 hours ago
Actions for Hidden Consensus:Preference-Validity Compression in Human Feedback
What Will Canada’s
AI
Strategy Mean for Jobs and
Safety
?
🔬
Interpretability
Content type:
News
thetyee.ca
·
6d
6 days ago
Actions for What Will Canada’s AI Strategy Mean for Jobs and Safety?
Cisco
AI
Defense Policy Studio: Turning Unwritten Policy into Adaptive
AI
Guardrails
🤖
Agent
Content type:
Blog
blogs.cisco.com
·
2h
2 hours ago
Actions for Cisco AI Defense Policy Studio: Turning Unwritten Policy into Adaptive AI Guardrails
Mankirat47/Dao-Heart-v3.14: Dao Heart v3.14 : a bounded symbolic
AI
value
governance research scaffold for studying
value
drift,
oversight
, warmth preservation, and identity stability under pressure.
🔬
Interpretability
Content type:
Code
github.com
·
7h
7 hours ago
·
Hacker News
Actions for Mankirat47/Dao-Heart-v3.14: Dao Heart v3.14 : a bounded symbolic AI value governance research scaffold for studying value drift, oversight, warmth preservation, and identity stability under pressure.
White House Defangs
AI-Testing
Unit at the Worst Possible Time
🔬
Interpretability
gizmodo.com
·
5h
5 hours ago
Actions for White House Defangs AI-Testing Unit at the Worst Possible Time
Controversial smut as an
AI
alignment
issue
🦾
Embodied AI
Content type:
News
Content type:
Blog
thingofthings.substack.com
·
5d
5 days ago
·
Substack
Actions for Controversial smut as an AI alignment issue
Anthropic's Model Naming, Extrapolated
🔬
Interpretability
samwilkinson.io
·
1d
1 day ago
·
Hacker News
Actions for Anthropic's Model Naming, Extrapolated
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help