Skip to main content
Scour
Discover
Docs
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
AI Safety
🛡️ AI Safety
model alignment, guardrails, responsible AI, AI red teaming
Filter Results
Timeframe
Choose a timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
390
posts in
17.8
ms
🤖
AI Agents
Turing Post
·
6d
6 days ago
How
Responsible
AI
Changes In The Agent Era
Covers
EU Artificial Intelligence Act
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for How Responsible AI Changes In The Agent Era
🤖
AI Agents
Check Point Blog
·
2d
2 days ago
From
Prompt
Testing to
AI
Red
Teaming at Enterprise Scale
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for From Prompt Testing to AI Red Teaming at Enterprise Scale
🧠
LLMs
arXiv
·
16h
16 hours ago
Prompt
Injection
in Automated R\'esum\'e Screening with Large Language
Models
: Single and
Multi-Injection
Settings
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Prompt Injection in Automated R\'esum\'e Screening with Large Language Models: Single and Multi-Injection Settings
🧠
LLMs
giskard.ai
·
2d
2 days ago
Giskard: LLM esting platform for preventing hallucinations and security issues
Covers
3 stories
See all stories this covers
including
Garak, LLM Vulnerability Scanner
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Giskard: LLM esting platform for preventing hallucinations and security issues
🤖
AI Agents
CleanTechnica
·
17h
17 hours ago
The Real
AI
Safety
Discussion That Just Isn’t Happening
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for The Real AI Safety Discussion That Just Isn’t Happening
🤖
AI Agents
medium.com
·
3d
3 days ago
The Role of HR in
Responsible
AI
Adoption
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for The Role of HR in Responsible AI Adoption
🤖
AI Agents
beSpacific
·
1d
1 day ago
Prompt
Injection
: What Lawyers Considering Agentic
AI
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Prompt Injection: What Lawyers Considering Agentic AI
✍️
Prompt Engineering
codeberg.org
·
21h
21 hours ago
Powercode
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Powercode
✍️
Prompt Engineering
role-confusion.github.io
·
4d
4 days ago
A Theory of Why
Prompt
Injection
Works
Covers
3 stories
See all stories this covers
including
Playwright MCP Server – Snapshot based – faster and more reliable than images
Covered by
8 sources
See all sources covering this story
including
Schneier on Security
,
Simon Willison’s Weblog
Discussed on
Hacker News
and
Lobsters
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for A Theory of Why Prompt Injection Works
🔭
Observability
SentinelOne
·
3d
3 days ago
macOS.Gaslight | Rust Backdoor Turns
Prompt
Injection
on the Analyst, Not the Sandbox
Covers
2 stories
See all stories this covers
including
Mini Shai-Hulud, Miasma, and Hades Worms Target Bioinformatics and MCP Developers via Malicious PyPI Wheels
Covered by
14 sources
See all sources covering this story
including
BleepingComputer
,
SecurityWeek
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for macOS.Gaslight | Rust Backdoor Turns Prompt Injection on the Analyst, Not the Sandbox
🤖
AI Agents
medium.com
·
1d
1 day ago
Healthcare
AI
Governance:
AI
Doesn’t Fail. Poor Governance Does.
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Healthcare AI Governance: AI Doesn’t Fail. Poor Governance Does.
✍️
Prompt Engineering
meetcyber.net
·
12h
12 hours ago
Prompt
Injection
vs
Jailbreaking
Explained in 4 Minutes
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Prompt Injection vs Jailbreaking Explained in 4 Minutes
✍️
Prompt Engineering
fernandoi.cl
·
17h
17 hours ago
What happened after 2k people tried to hack my
AI
assistant
Covered by
Simon Willison’s Weblog
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for What happened after 2k people tried to hack my AI assistant
🧠
LLMs
medium.com
·
1d
1 day ago
ChatGPT Generates Gruesome, Explicit Images of Women When
Guardrails
Fail, My Research Shows
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for ChatGPT Generates Gruesome, Explicit Images of Women When Guardrails Fail, My Research Shows
🏗️
AI Infra
medium.com
·
5h
5 hours ago
The Next Challenge in
AI
Safety
: Image Veracity
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for The Next Challenge in AI Safety: Image Veracity
✍️
Prompt Engineering
WIRED
·
19h
19 hours ago
Anthropic Thinks Its Own Success Is Key to Making
AI
Safe
Covers
2 stories
See all stories this covers
including
Claude's Constitution
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Anthropic Thinks Its Own Success Is Key to Making AI Safe
🤖
AI Agents
GitHub
·
2d
2 days ago
Show HN: Lelu – gate OpenAI agent actions on confidence and
prompt
injection
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Show HN: Lelu – gate OpenAI agent actions on confidence and prompt injection
✍️
Prompt Engineering
4sysops
·
4d
4 days ago
Malicious npm and PyPI packages use
prompt
injection
to bypass
AI
security scanners
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Malicious npm and PyPI packages use prompt injection to bypass AI security scanners
🤖
AI Agents
EDB
·
2d
2 days ago
Inside EDB’s New Principles for
Responsible
AI
: Sovereign, Governed, Trusted and Beneficial
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Inside EDB’s New Principles for Responsible AI: Sovereign, Governed, Trusted and Beneficial
🧠
LLMs
Above the Law
·
20h
20 hours ago
No Points For Held Tongues — See Also
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for No Points For Held Tongues — See Also
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous post
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Discover
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help
Like
Save
Not for me
Report