Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
AI Safety
🛡️ AI Safety
AI safety, alignment, RLHF, AI ethics, superalignment
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
341
posts in
12.6
ms
The Neutral Mask: How
RLHF
Provides Shallow
Alignment
while Leaving Partisan Structure Intact in a Large Language Model
🧠
LLMs
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for The Neutral Mask: How RLHF Provides Shallow Alignment while Leaving Partisan Structure Intact in a Large Language Model
AI
Governance
Tools: How To Achieve Compliance and Visibility
⚙️
Workflow Automation
Content type:
Blog
blog.n8n.io
·
9h
9 hours ago
Actions for AI Governance Tools: How To Achieve Compliance and Visibility
Learnings from starting an
AI
safety
research team
🔶
Claude
lesswrong.com
·
5d
5 days ago
Actions for Learnings from starting an AI safety research team
xAI fired an engineer who raised alarms about Grok
safety
, new lawsuit claims
✍️
Prompt Engineering
techcrunch.com
·
2h
2 hours ago
Actions for xAI fired an engineer who raised alarms about Grok safety, new lawsuit claims
My Oslo Freedom Forum Keynote: Authoritarians and
AI
🛠️
AI Tooling
Content type:
Blog
redpacket.substack.com
·
2d
2 days ago
·
Substack
Actions for My Oslo Freedom Forum Keynote: Authoritarians and AI
Shadow
AI
Governance
: How to Secure Employee
AI
Use in 2026
🤖
AI Agents
Content type:
Blog
cswithsanjay.blogspot.com
·
20h
20 hours ago
Actions for Shadow AI Governance: How to Secure Employee AI Use in 2026
What is
AI
Governance
? (10 minute read)
🤖
AI Agents
Content type:
Blog
docker.com
·
3d
3 days ago
Actions for What is AI Governance? (10 minute read)
Ethical
Considerations and
AI
Governance
🤖
AI Agents
Content type:
Blog
blog.domb.net
·
1d
1 day ago
Actions for Ethical Considerations and AI Governance
When
AI
Fails, What Actually Failed? The Distinction
AI
Governance
Keeps Missing
🤖
AI Agents
techpolicy.press
·
11h
11 hours ago
Actions for When AI Fails, What Actually Failed? The Distinction AI Governance Keeps Missing
Advanced
AI
Safety
Addendum
🛠️
AI Tooling
cloud.google.com
·
1d
1 day ago
·
Hacker News
Actions for Advanced AI Safety Addendum
Mechanistic
Interpretability
: The Key to Trusting Agentic
AI
🤖
AI Agents
Content type:
Discussion
bradenkelley.com
·
4d
4 days ago
Actions for Mechanistic Interpretability: The Key to Trusting Agentic AI
Musk's xAI accused of illegally firing engineer who raised
safety
concerns
✍️
Prompt Engineering
Content type:
News
ca.finance.yahoo.com
·
7h
7 hours ago
Actions for Musk's xAI accused of illegally firing engineer who raised safety concerns
Veeam Adds Three Agentic
AI
Agents to the DataAI Command Platform for Privacy and
AI
Governance
🤖
AI Agents
storagereview.com
·
4h
4 hours ago
Actions for Veeam Adds Three Agentic AI Agents to the DataAI Command Platform for Privacy and AI Governance
Germany to create
AI
safety
agency
💡
AI
techxplore.com
·
1d
1 day ago
Actions for Germany to create AI safety agency
AI
giant says its own models could soon improve themselves — and now it wants a global pause
🔶
Claude
thecooldown.com
·
12h
12 hours ago
Actions for AI giant says its own models could soon improve themselves — and now it wants a global pause
From oversight to coercion: How authoritarian governments are twisting
AI
safety
to get tech companies to fall in line
🔶
Claude
theconversation.com
·
6d
6 days ago
Actions for From oversight to coercion: How authoritarian governments are twisting AI safety to get tech companies to fall in line
new mantra just dropped
✨
Vibe Coding
aphie.xyz
·
13h
13 hours ago
Actions for new mantra just dropped
Anand Rathore The Man Who Invented AIWoW —
AI
Ways of Working™: The Only Blueprint That Can Save the Planet From the
AI
Carbon Crisis and Pollution
✨
Vibe Coding
easternherald.com
·
2d
2 days ago
Actions for Anand Rathore The Man Who Invented AIWoW — AI Ways of Working™: The Only Blueprint That Can Save the Planet From the AI Carbon Crisis and Pollution
Mankirat47/Dao-Heart-v3.14: Dao Heart v3.14 : a bounded symbolic
AI
value
governance
research scaffold for studying value drift, oversight, warmth preservation, and identity stability under pressure.
🏺
Hermes
Content type:
Code
github.com
·
5h
5 hours ago
·
Hacker News
Actions for Mankirat47/Dao-Heart-v3.14: Dao Heart v3.14 : a bounded symbolic AI value governance research scaffold for studying value drift, oversight, warmth preservation, and identity stability under pressure.
The technical community can't be the main character in
AI
safety
anymore
✍️
Prompt Engineering
substackcdn.com
·
3d
3 days ago
·
Substack
Actions for The technical community can't be the main character in AI safety anymore
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help