Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
AI Safety
🛡️ AI Safety
alignment, RLHF, safety, interpretability
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
295
posts in
15.1
ms
Clearing Up The Confusion About What
Anthropic
Really Said On Globally Pausing The Unrelenting Race Toward
AI
That Builds
AI
🤖
AI Engineering
forbes.com
·
2d
2 days ago
Actions for Clearing Up The Confusion About What Anthropic Really Said On Globally Pausing The Unrelenting Race Toward AI That Builds AI
My Oslo Freedom Forum Keynote: Authoritarians and
AI
🤖
AI Engineering
Content type:
Blog
redpacket.substack.com
·
1d
1 day ago
·
Substack
Actions for My Oslo Freedom Forum Keynote: Authoritarians and AI
Anthropic
calls for pause of global
AI
development
🤖
Robotics
techxplore.com
·
5d
5 days ago
Actions for Anthropic calls for pause of global AI development
DTEX adds
AI
Risk
Management to track how agents and employees use
AI
🤖
AI Engineering
siliconangle.com
·
1d
1 day ago
Actions for DTEX adds AI Risk Management to track how agents and employees use AI
Hidden Consensus:Preference-Validity Compression in Human Feedback
🧠
LLM Research
Content type:
Academic
arxiv.org
·
15h
15 hours ago
Actions for Hidden Consensus:Preference-Validity Compression in Human Feedback
From oversight to coercion: How authoritarian governments are twisting
AI
safety
to get tech companies to fall in line
🤖
AI Engineering
phys.org
·
6d
6 days ago
Actions for From oversight to coercion: How authoritarian governments are twisting AI safety to get tech companies to fall in line
Phonies
🌐
Distributed Systems
lesswrong.com
·
4h
4 hours ago
Actions for Phonies
Anthropic
May Be Reconsidering the Pace of
AI
🤖
Robotics
thinkingabout.ai
·
1d
1 day ago
Actions for Anthropic May Be Reconsidering the Pace of AI
Anthropic
confronts the RSI clock
🎙️
Speech AI
therundown.ai
·
5d
5 days ago
Actions for Anthropic confronts the RSI clock
Anthropic
says Claude writes 80% of its own code and the world needs a plan to hit the brakes
🤖
AI Engineering
Content type:
News
thenextweb.com
·
5d
5 days ago
Actions for Anthropic says Claude writes 80% of its own code and the world needs a plan to hit the brakes
All signs point to Trump pushing
AI
growth
🤖
AI Engineering
Content type:
News
theguardian.com
·
1d
1 day ago
Actions for All signs point to Trump pushing AI growth
AI
regulation in Africa: why copying the European
model
won’t work
🔮
Multimodal AI
theconversation.com
·
5h
5 hours ago
Actions for AI regulation in Africa: why copying the European model won’t work
Agentic
AI
Risk
Catches Eye of Financial Stability Board
🎯
Reinforcement Learning
pymnts.com
·
3h
3 hours ago
Actions for Agentic AI Risk Catches Eye of Financial Stability Board
Germany to create
AI
safety
agency
🔮
Multimodal AI
techxplore.com
·
1d
1 day ago
Actions for Germany to create AI safety agency
Anthropic
:
AI
is advancing too fast to leave unchecked
🤖
AI Engineering
metacurity.com
·
5d
5 days ago
Actions for Anthropic: AI is advancing too fast to leave unchecked
Assessing the Polyglot Chatbot: Multilingual
Safety
in
AI
Systems
🎙️
Speech AI
cdt.org
·
22h
22 hours ago
Actions for Assessing the Polyglot Chatbot: Multilingual Safety in AI Systems
Paving the way for agents in biology
🤖
AI Engineering
anthropic.com
·
2d
2 days ago
·
Hacker News
Actions for Paving the way for agents in biology
Amazon and Google have billions riding on
Anthropic
. The IPO will finally reveal how much.
🤖
AI Engineering
fortune.com
·
6d
6 days ago
Actions for Amazon and Google have billions riding on Anthropic. The IPO will finally reveal how much.
ToxicSkills Revisit: Loch Ness Levels of Mythical
AI
Risk
🧠
LLM Research
flyingpenguin.com
·
2d
2 days ago
Actions for ToxicSkills Revisit: Loch Ness Levels of Mythical AI Risk
Anthropic
urges a way to pause
AI
development as
risks
grow with the tech advances
🤖
Robotics
the-journal.com
·
5d
5 days ago
Actions for Anthropic urges a way to pause AI development as risks grow with the tech advances
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help