Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
AI Safety
🛡️ AI Safety
AI alignment, AI safety, AI risk, RLHF, constitutional AI
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
233
posts in
20.2
ms
From oversight to coercion: How authoritarian governments are twisting
AI
safety
to get tech companies to fall in line
🧠
AI
theconversation.com
·
5d
5 days ago
Actions for From oversight to coercion: How authoritarian governments are twisting AI safety to get tech companies to fall in line
Paving the way for agents in biology
🤖
AI Agents
anthropic.com
·
1d
1 day ago
·
Hacker News
Actions for Paving the way for agents in biology
The technical community can't be the main character in
AI
safety
anymore
🧠
AI
substackcdn.com
·
3d
3 days ago
·
Substack
Actions for The technical community can't be the main character in AI safety anymore
AI
, at a Crossroads
🧠
AI
Content type:
News
Content type:
Blog
edgyoptimist.substack.com
·
19h
19 hours ago
·
Substack
Actions for AI, at a Crossroads
SLUUG Talk: Demystifying
Large
Language
Models
on Linux
💬
LLMs
Content type:
Code
github.com
·
3d
3 days ago
·
DEV
Actions for SLUUG Talk: Demystifying Large Language Models on Linux
Multilingual Sentiment Aware Text Summarization A Reinforcement Learning Approach for Consistency Maintenance
💬
LLMs
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Multilingual Sentiment Aware Text Summarization A Reinforcement Learning Approach for Consistency Maintenance
AI
red teaming comes of age
🟢
OpenAI
csoonline.com
·
2h
2 hours ago
Actions for AI red teaming comes of age
Complex Objects: Why
AI
Safety
Can’t Just Think in Posts
🤖
AI Agents
Content type:
Blog
medium.com
·
5d
5 days ago
Actions for Complex Objects: Why AI Safety Can’t Just Think in Posts
Ted Lieu slams bipartisan
AI
proposal
🧠
AI
politico.com
·
19h
19 hours ago
Actions for Ted Lieu slams bipartisan AI proposal
How valuable are weak
AI
safety
regulations?
🧠
AI
lesswrong.com
·
1d
1 day ago
Actions for How valuable are weak AI safety regulations?
AI
Scientist Bengio: Building Systems We Don't Know How to Control
🤖
AI Agents
Content type:
News
bloomberg.com
·
5d
5 days ago
Actions for AI Scientist Bengio: Building Systems We Don't Know How to Control
'World of
AI
is very different': Ashwini Vaishnaw sees need for new
AI
law in India | Today News
💻
Tech
Content type:
News
livemint.com
·
8h
8 hours ago
Actions for 'World of AI is very different': Ashwini Vaishnaw sees need for new AI law in India | Today News
Lawmakers Are Aiming To Regulate
AI-Builds-AI
Before
AI
Gets Entirely Beyond Human Control
🧠
AI
forbes.com
·
1d
1 day ago
Actions for Lawmakers Are Aiming To Regulate AI-Builds-AI Before AI Gets Entirely Beyond Human Control
China may move toward U.S. path on
AI
as firms poach employees
🔵
Google AI
Content type:
News
cnbc.com
·
5d
5 days ago
Actions for China may move toward U.S. path on AI as firms poach employees
AI
Safety
— Genuine or Performative?
🧠
AI
Content type:
Blog
medium.com
·
4d
4 days ago
Actions for AI Safety — Genuine or Performative?
I used ChatGPT and Gemini side-by-side for a month on Android, and only one behaved like a senior
AI
tool
🔷
Anthropic
androidpolice.com
·
2d
2 days ago
Actions for I used ChatGPT and Gemini side-by-side for a month on Android, and only one behaved like a senior AI tool
Claude Fable 5: Anthropic releases a '
safe
' version of Claude Mythos
🔷
Anthropic
Content type:
News
mashable.com
·
17h
17 hours ago
Actions for Claude Fable 5: Anthropic releases a 'safe' version of Claude Mythos
Anthropic releases a version of its vaunted Mythos
model
to developers
🔷
Anthropic
fastcompany.com
·
18h
18 hours ago
Actions for Anthropic releases a version of its vaunted Mythos model to developers
What Will Canada’s
AI
Strategy Mean for Jobs and
Safety
?
🧠
AI
Content type:
News
thetyee.ca
·
5d
5 days ago
Actions for What Will Canada’s AI Strategy Mean for Jobs and Safety?
A Unifying Lens on Reward Uncertainty in
RLHF
💬
LLMs
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for A Unifying Lens on Reward Uncertainty in RLHF
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help