Skip to main content
Scour
Discover
Docs
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
AI Safety
🛡️ AI Safety
model alignment, guardrails, responsible AI, AI red teaming
Filter Results
Timeframe
Choose a timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
389
posts in
56.0
ms
🤖
AI Agents
tehnologijaviews.medium.com
·
6d
6 days ago
Is the US Government’s Anthropic Ban Actually Helping the Brand? A Surprising Turn in
AI
Regulation
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Is the US Government’s Anthropic Ban Actually Helping the Brand? A Surprising Turn in AI Regulation
✍️
Prompt Engineering
GitHub
·
1d
1 day ago
Show HN: SentryGuard – detect Agentjacking
prompt
injection
in Sentry events
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Show HN: SentryGuard – detect Agentjacking prompt injection in Sentry events
đź’ł
Fintech
Music Business Worldwide
·
22h
22 hours ago
UAE’s MusicNation joins the Human Artistry Campaign as its first Middle East signatory
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for UAE’s MusicNation joins the Human Artistry Campaign as its first Middle East signatory
🤖
AI Agents
latent.space
·
3d
3 days ago
Red-Teaming
after Mythos — Zico Kolter & Matt Fredrikson, Gray Swan
CoversÂ
The lethal trifecta for AI agents: private data, untrusted content, and external communication
Covered byÂ
tldr.tech
,
contextmaestro.com
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Red-Teaming after Mythos — Zico Kolter & Matt Fredrikson, Gray Swan
✍️
Prompt Engineering
medium.com
·
2d
2 days ago
Why
prompt
injection
works: a Transformer-level view
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Why prompt injection works: a Transformer-level view
🤖
AI Agents
TechRadar
·
1h
1 hour ago
Know your agent: building the foundation of autonomous commerce
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Know your agent: building the foundation of autonomous commerce
🗄️
Vector Databases
arXiv
·
1d
1 day ago
Yuvion VL: A Multimodal Foundation
Model
for
Adversarial
Content and
AI
Safety
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Yuvion VL: A Multimodal Foundation Model for Adversarial Content and AI Safety
🤖
AI Agents
SiliconANGLE
·
3d
3 days ago
Nvidia introduces Halos for Robotics to bridge the physical
AI
safety
gap
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Nvidia introduces Halos for Robotics to bridge the physical AI safety gap
🏗️
AI Infra
Phys.org
·
11h
11 hours ago
New research outlines human-centered
AI
framework for online student success
CoversÂ
2Â stories
See all stories this covers
 includingÂ
Andrew Zinin - Science X
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for New research outlines human-centered AI framework for online student success
✍️
Prompt Engineering
joshs.bearblog.dev
·
1d
1 day ago
How not to think about risk
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for How not to think about risk
✍️
Prompt Engineering
medium.com
·
5d
5 days ago
Fictional Framing Part 3: Does the Fix Generalize, or Did I Just Patch One Sentence?
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Fictional Framing Part 3: Does the Fix Generalize, or Did I Just Patch One Sentence?
🏗️
AI Infra
Business Insider
·
1d
1 day ago
A New York primary winner has a defiant message for OpenAI and Anthropic
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for A New York primary winner has a defiant message for OpenAI and Anthropic
🤖
AI Agents
Financial Times
·
14h
14 hours ago
Ethical
AI
rows open way to wave of litigation
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Ethical AI rows open way to wave of litigation
🎯
Post-training
fareedkhan-dev.github.io
·
5d
5 days ago
Train
LLM from Scratch
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Train LLM from Scratch
🤖
AI Agents
Science
·
16h
16 hours ago
Researchers caught in the crossfire as firms and U.S. government grapple over
AI
safety
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Researchers caught in the crossfire as firms and U.S. government grapple over AI safety
✍️
Prompt Engineering
Anthropic
·
2d
2 days ago
Claude Tag
CoversÂ
2Â stories
See all stories this covers
 includingÂ
Agent identity in Claude Tag: a new access model for autonomous, team-wide AI
Covered byÂ
32Â sources
See all sources covering this story
 includingÂ
The Rundown AI
,
9to5Mac
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Claude Tag
🤖
AI Agents
MinnPost
·
18h
18 hours ago
AI
can’t be trustworthy without data center transparency
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for AI can’t be trustworthy without data center transparency
đź”—
APIs
AWS
·
2d
2 days ago
Securing
AI-driven
APIs on AWS with Wallarm
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Securing AI-driven APIs on AWS with Wallarm
🤖
AI Agents
BGR
·
1d
1 day ago
Amateur Hacker Used Claude And OpenAI Agents To Hack 14 Companies
CoversÂ
2Â stories
See all stories this covers
 includingÂ
Claude Fable 5 and Claude Mythos 5
Covered byÂ
sh.itjust.works
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Amateur Hacker Used Claude And OpenAI Agents To Hack 14 Companies
đź’ł
Fintech
SecurityWeek
·
23h
23 hours ago
Runlayer Raises $30 Million in Series A Funding
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Runlayer Raises $30 Million in Series A Funding
Sign up or log in to see more results
Sign Up
Login
« Page 2
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous post
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Discover
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help
Like
Save
Not for me
Report