Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
AI Safety
🛡️ AI Safety
AI alignment, guardrails, red teaming, responsible AI
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
260
posts in
11.1
ms
Iliad is Hiring
✍️
Prompt Engineering
lesswrong.com
·
5d
5 days ago
Actions for Iliad is Hiring
Prompt
injection
breaks today’s
AI
agents, study warns
✍️
Prompt Engineering
Content type:
News
csoonline.com
·
19h
19 hours ago
Actions for Prompt injection breaks today’s AI agents, study warns
PI-Hunter: Automated
Red-Teaming
for Exposing and Localizing
Prompt
Injections
✍️
Prompt Engineering
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for PI-Hunter: Automated Red-Teaming for Exposing and Localizing Prompt Injections
AI
Agent Security Guide: How to Prevent
Prompt
Injection
Attack
✍️
Prompt Engineering
Content type:
Blog
medium.com
·
23h
23 hours ago
Actions for AI Agent Security Guide: How to Prevent Prompt Injection Attack
Compromise OpenClaw with
Prompt
Injections
in
Message
Objects | Imperva
✍️
Prompt Engineering
Content type:
Blog
imperva.com
·
2d
2 days ago
·
Cited by 1 article
Actions for Compromise OpenClaw with Prompt Injections in Message Objects | Imperva
sinewaveai/prooflayer-rules: Open-source runtime security rules engine for MCP servers and
AI
agents. Detects
prompt
injection
, command
injection
, jailbreaks, and data exfiltration.
🤖
AI Agents
Content type:
Code
github.com
·
1h
1 hour ago
·
Hacker News
Actions for sinewaveai/prooflayer-rules: Open-source runtime security rules engine for MCP servers and AI agents. Detects prompt injection, command injection, jailbreaks, and data exfiltration.
AI
Pentesting Roadmap: Labs, Challenges, Writeups &
Research
✍️
Prompt Engineering
Content type:
Blog
osintteam.blog
·
6d
6 days ago
Actions for AI Pentesting Roadmap: Labs, Challenges, Writeups & Research
The Ghost of
Alignment
— Why
AI
Should Never Fully Obey Humanity
🤖
AI Agents
Content type:
Blog
medium.com
·
2d
2 days ago
Actions for The Ghost of Alignment — Why AI Should Never Fully Obey Humanity
Malware uses fake nuclear weapon
prompts
to bypass
AI
security scanners
✍️
Prompt Engineering
4sysops.com
·
14h
14 hours ago
Actions for Malware uses fake nuclear weapon prompts to bypass AI security scanners
Infosecurity Europe:
Prompt
Injection
Remains Unsolved, OWASP
Researcher
Warns
✍️
Prompt Engineering
Content type:
News
infosecurity-magazine.com
·
4d
4 days ago
·
Cited by 1 article
Actions for Infosecurity Europe: Prompt Injection Remains Unsolved, OWASP Researcher Warns
WebMCP Can Be Used To Hijack
AI
Agents, Chrome Warns via @sejournal, @martinibuster
✍️
Prompt Engineering
searchenginejournal.com
·
1d
1 day ago
Actions for WebMCP Can Be Used To Hijack AI Agents, Chrome Warns via @sejournal, @martinibuster
Statement on the US government directive to suspend access to Fable 5 and Mythos 5
✍️
Prompt Engineering
19
articles covering this post
anthropic.com
·
5h
5 hours ago
·
DEV
,
Lobsters
,
Hacker News
,
r/LocalLLaMA
·
Cited by 19 articles
Actions for Statement on the US government directive to suspend access to Fable 5 and Mythos 5
Prompt
injection
still drives most agentic
AI
security failures in production
🤖
AI Agents
helpnetsecurity.com
·
2d
2 days ago
Actions for Prompt injection still drives most agentic AI security failures in production
ChatGPT's new Lockdown Mode lets you disable web access and more to protect sensitive data from
prompt
injection
✍️
Prompt Engineering
the-decoder.com
·
5d
5 days ago
Actions for ChatGPT's new Lockdown Mode lets you disable web access and more to protect sensitive data from prompt injection
Claude Powered Code Review that scales!
✍️
Prompt Engineering
Content type:
Blog
medium.com
·
2d
2 days ago
Actions for Claude Powered Code Review that scales!
Why OpenAI is disabling ChatGPT web access to fight
prompt
injection
attacks
✍️
Prompt Engineering
Content type:
News
livemint.com
·
6d
6 days ago
Actions for Why OpenAI is disabling ChatGPT web access to fight prompt injection attacks
Security Flaw in Claude Code Illustrates the Risk of
AI
in Developer Workflows
✍️
Prompt Engineering
devops.com
·
2d
2 days ago
Actions for Security Flaw in Claude Code Illustrates the Risk of AI in Developer Workflows
Anthropic's Claude Fable 5 and Mythos 5
AI
suspended over security fears
✍️
Prompt Engineering
Content type:
News
bbc.com
·
1h
1 hour ago
Actions for Anthropic's Claude Fable 5 and Mythos 5 AI suspended over security fears
OpenAI unveils Lockdown Mode to protect sensitive data from
prompt
injection
attacks
✍️
Prompt Engineering
6
articles covering this post
techcrunch.com
·
6d
6 days ago
·
Hacker News
·
Cited by 6 articles
Actions for OpenAI unveils Lockdown Mode to protect sensitive data from prompt injection attacks
Detecting
AI-specific
threats in Claude Enterprise from the Compliance API: a prefilter + LLM-as-judge pipeline with Sigma rules
✍️
Prompt Engineering
papermtn.co.uk
·
1d
1 day ago
·
r/netsec
Actions for Detecting AI-specific threats in Claude Enterprise from the Compliance API: a prefilter + LLM-as-judge pipeline with Sigma rules
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help