Skip to main content
Scour
Discover
Docs
Login
Sign Up
Discover
About
Docs
Changelog
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
AI Safety
🛡️ AI Safety
Alignment, Interpretability, Adversarial Examples, Ethics
Filter Results
Timeframe
Choose a timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
231
posts in
50.0
ms
⚠️
Information Hazards
lesswrong.com
·
3d
3 days ago
AI
Safety
Ecosystem Research notes
CoversÂ
2Â stories
See all stories this covers
 includingÂ
Looking for a Developer for Gork3 Discord Trading Server (Unpaid, But Valuable Opportunity!)
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for AI Safety Ecosystem Research notes
🌍
Civilizational Risk
E-International Relations
·
8h
8 hours ago
Interview – Andrea Miotti
CoversÂ
2Â stories
See all stories this covers
 includingÂ
When AI Builds Itself
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Interview – Andrea Miotti
🎯
AI Alignment
medium.com
·
2d
2 days ago
What I Learned Studying Whether Fine-Tuning Breaks a Transformer’s “Copy
Mechanism
”
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for What I Learned Studying Whether Fine-Tuning Breaks a Transformer’s “Copy Mechanism”
đźŽ
Anthropic Claude
notes.seantaylor.work
·
13h
13 hours ago
Monsters and Mirrors
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Monsters and Mirrors
đźŽ
Anthropic Claude
tehnologijaviews.medium.com
·
2d
2 days ago
Is the US Government’s Anthropic Ban Actually Helping the Brand? A Surprising Turn in
AI
Regulation
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Is the US Government’s Anthropic Ban Actually Helping the Brand? A Surprising Turn in AI Regulation
⚠️
Information Hazards
science.org
·
3d
3 days ago
Researchers caught in the crossfire as companies and government grapple over
AI
safety
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Researchers caught in the crossfire as companies and government grapple over AI safety
🔬
Anthropic
stevekinney.com
·
3d
3 days ago
Some Thoughts on
AI
Safety
CoversÂ
10Â stories
See all stories this covers
 includingÂ
Goodhart's Law
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Some Thoughts on AI Safety
🌍
Civilizational Risk
lesswrong.com
·
15h
15 hours ago
On revolutionary love in
AI
safety
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for On revolutionary love in AI safety
⚠️
Information Hazards
medium.com
·
3d
3 days ago
Ninety Percent of Physicians Trust Their Clinical
AI
. They Catch a Third of Its Dangerous Errors.
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Ninety Percent of Physicians Trust Their Clinical AI. They Catch a Third of Its Dangerous Errors.
🤖
AI
TechRadar
·
6d
6 days ago
'
AI
will probably most likely lead to the end of the world, but in the meantime, there’ll be great companies' - quote of the day by OpenAI CEO Sam Altman
CoversÂ
Sam Altman May Control Our Future—Can He Be Trusted?
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for 'AI will probably most likely lead to the end of the world, but in the meantime, there’ll be great companies' - quote of the day by OpenAI CEO Sam Altman
🔬
Anthropic
lesswrong.com
·
1d
1 day ago
The Cookie Monster Explains
AI
Safety
CoversÂ
10Â stories
See all stories this covers
 includingÂ
Anthropic confidentially submits draft S-1 to the SEC
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for The Cookie Monster Explains AI Safety
⚖️
Ethics
ft.com
·
3d
3 days ago
Letter: Argentina’s
AI
fix widens the gap it is meant to close
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Letter: Argentina’s AI fix widens the gap it is meant to close
🤖
AI Coding Tools
briefing.forwardfuture.ai
·
5d
5 days ago
AI
Superintelligence
, ChatGPT Slips, & SpaceX's
AI
Deal
CoversÂ
4Â stories
See all stories this covers
 includingÂ
ChatGPT’s market share slips below 50% for first time
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for AI Superintelligence, ChatGPT Slips, & SpaceX's AI Deal
🌍
Civilizational Risk
lesswrong.com
·
2d
2 days ago
Thoughts on Likelihood of Existential Risks by Misaligned
AIs
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Thoughts on Likelihood of Existential Risks by Misaligned AIs
⚖️
AI Regulation
highcapacity.org
·
6d
6 days ago
Mythos, China, and a New Era of
AI
Regulation
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Mythos, China, and a New Era of AI Regulation
⚠️
Information Hazards
CNBC
·
4d
4 days ago
Synthesia CEO: Creating a coalition around an
AI
code of conduct will help build the
AI
future we all want
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Synthesia CEO: Creating a coalition around an AI code of conduct will help build the AI future we all want
đź’»
Tech News
The Blog of Author Tim Ferriss
·
5d
5 days ago
Sebastian Mallaby, Biographer of Demis Hassabis — Lessons from 100+
AI
Insiders on The Race to
Superintelligence
, The Religion of
AI
, and Spotting Breakthroughs...
CoversÂ
19Â stories
See all stories this covers
 includingÂ
Claude Fable 5 and Claude Mythos 5
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Sebastian Mallaby, Biographer of Demis Hassabis — Lessons from 100+ AI Insiders on The Race to Superintelligence, The Religion of AI, and Spotting Breakthroughs...
🛡️
LLM Security
tehnologijaviews.medium.com
·
4d
4 days ago
The Trump Administration’s Push to Block
AI
Jailbreaks: A
Safety
Measure or Political Theater?
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for The Trump Administration’s Push to Block AI Jailbreaks: A Safety Measure or Political Theater?
⚠️
Information Hazards
lesswrong.com
·
6d
6 days ago
Tips for Cracking the
AI
Safety
Technical Interview
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Tips for Cracking the AI Safety Technical Interview
⚠️
Information Hazards
lesswrong.com
·
3d
3 days ago
A brief list of ways
AI
safety
efforts could be net negative
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for A brief list of ways AI safety efforts could be net negative
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous post
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Discover
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help
Like
Save
Not for me
Report