Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
AI Safety
🛡️ AI Safety
AI safety, alignment, RLHF, AI ethics, superalignment
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
333
posts in
6.9
ms
The Neutral Mask: How
RLHF
Provides Shallow
Alignment
while Leaving Partisan Structure Intact in a Large Language Model
🧠
LLMs
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for The Neutral Mask: How RLHF Provides Shallow Alignment while Leaving Partisan Structure Intact in a Large Language Model
AI
Governance
Tools: How To Achieve Compliance and Visibility
⚙️
Workflow Automation
Content type:
Blog
blog.n8n.io
·
6h
6 hours ago
Actions for AI Governance Tools: How To Achieve Compliance and Visibility
Learnings from starting an
AI
safety
research team
🔶
Claude
lesswrong.com
·
5d
5 days ago
Actions for Learnings from starting an AI safety research team
What is
AI
Governance
? (10 minute read)
🤖
AI Agents
Content type:
Blog
docker.com
·
2d
2 days ago
Actions for What is AI Governance? (10 minute read)
Shadow
AI
Governance
: How to Secure Employee
AI
Use in 2026
🤖
AI Agents
Content type:
Blog
cswithsanjay.blogspot.com
·
17h
17 hours ago
Actions for Shadow AI Governance: How to Secure Employee AI Use in 2026
Mechanistic
Interpretability
: The Key to Trusting Agentic
AI
🤖
AI Agents
Content type:
Discussion
bradenkelley.com
·
4d
4 days ago
Actions for Mechanistic Interpretability: The Key to Trusting Agentic AI
My Oslo Freedom Forum Keynote: Authoritarians and
AI
🛠️
AI Tooling
Content type:
Blog
redpacket.substack.com
·
1d
1 day ago
·
Substack
Actions for My Oslo Freedom Forum Keynote: Authoritarians and AI
Musk's xAI accused of illegally firing engineer who raised
safety
concerns
✍️
Prompt Engineering
Content type:
News
ca.finance.yahoo.com
·
4h
4 hours ago
Actions for Musk's xAI accused of illegally firing engineer who raised safety concerns
Veeam Adds Three Agentic
AI
Agents to the DataAI Command Platform for Privacy and
AI
Governance
🤖
AI Agents
storagereview.com
·
2h
2 hours ago
Actions for Veeam Adds Three Agentic AI Agents to the DataAI Command Platform for Privacy and AI Governance
Ethical
Considerations and
AI
Governance
🤖
AI Agents
Content type:
Blog
blog.domb.net
·
1d
1 day ago
Actions for Ethical Considerations and AI Governance
From oversight to coercion: How authoritarian governments are twisting
AI
safety
to get tech companies to fall in line
🔶
Claude
theconversation.com
·
6d
6 days ago
Actions for From oversight to coercion: How authoritarian governments are twisting AI safety to get tech companies to fall in line
Advanced
AI
Safety
Addendum
🛠️
AI Tooling
cloud.google.com
·
1d
1 day ago
·
Hacker News
Actions for Advanced AI Safety Addendum
AI
giant says its own models could soon improve themselves — and now it wants a global pause
🔶
Claude
thecooldown.com
·
9h
9 hours ago
Actions for AI giant says its own models could soon improve themselves — and now it wants a global pause
new mantra just dropped
✨
Vibe Coding
aphie.xyz
·
10h
10 hours ago
Actions for new mantra just dropped
Germany to create
AI
safety
agency
💡
AI
techxplore.com
·
1d
1 day ago
Actions for Germany to create AI safety agency
Claude Fable 5 and new
AI
safety
fables
🔶
Claude
Content type:
News
interconnects.ai
·
23h
23 hours ago
·
Hacker News
Actions for Claude Fable 5 and new AI safety fables
The technical community can't be the main character in
AI
safety
anymore
✍️
Prompt Engineering
substackcdn.com
·
3d
3 days ago
·
Substack
Actions for The technical community can't be the main character in AI safety anymore
Agentic
AI
Governance
: Designing for Accountability and Control | The JetBrains
AI
Blog
🤖
AI Agents
Content type:
Blog
blog.jetbrains.com
·
22h
22 hours ago
Actions for Agentic AI Governance: Designing for Accountability and Control | The JetBrains AI Blog
Mankirat47/Dao-Heart-v3.14: Dao Heart v3.14 : a bounded symbolic
AI
value
governance
research scaffold for studying value drift, oversight, warmth preservation, and identity stability under pressure.
🏺
Hermes
Content type:
Code
github.com
·
2h
2 hours ago
·
Hacker News
Actions for Mankirat47/Dao-Heart-v3.14: Dao Heart v3.14 : a bounded symbolic AI value governance research scaffold for studying value drift, oversight, warmth preservation, and identity stability under pressure.
OpenAI says it will comply with Trump's order to let the government review
AI
models before release
🛠️
AI Tooling
qz.com
·
5d
5 days ago
Actions for OpenAI says it will comply with Trump's order to let the government review AI models before release
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help