AI Security

Feeds to Scour
SubscribedAll
Scoured 348 posts in 8.4 ms

PI-Hunter: Automated Red-Teaming for Exposing and Localizing Prompt Injections

 🪄Prompt Engineering  Content type: Academic
arxiv.org·

AI red teaming comes of age

 🪄Prompt Engineering

The Quest To Find The Next Big Communicators In AI Safety

 🪄Prompt Engineering
lesswrong.com·

The Fable 5 Jailbreak Shows Why AI Guardrails Alone Are Not Enough

 💉Prompt Injection  Content type: Blog
agilehunt.com··Hacker News

Compromise OpenClaw with Prompt Injections in Message Objects | Imperva

 🪄Prompt Engineering  Content type: Blog

Trump’s AI security order acknowledges risks but stops short of regulating industry

 🤖AI
theconversation.com·

AI Pentesting Roadmap: Labs, Challenges, Writeups & Research

 🪄Prompt Engineering  Content type: Blog
osintteam.blog
·

WARNING: An AI Safety Blind Spot That Could Cost Lives

 👨‍💻AI Coding  Content type: Blog
medium.com
·

AI Agent Security Guide: How to Prevent Prompt Injection Attack

 💉Prompt Injection  Content type: Blog
medium.com
·

US government forces Anthropic to disable Claude Fable 5 and Mythos 5 for all customers worldwide

 💉Prompt Injection
the-decoder.com
·

sinewaveai/prooflayer-rules: Open-source runtime security rules engine for MCP servers and AI agents. Detects prompt injection, command injection, jailbreaks, and data exfiltration.

 💉Prompt Injection  Content type: Code
github.com··Hacker News

Infosecurity Europe: Prompt Injection Remains Unsolved, OWASP Researcher Warns

 🪄Prompt Engineering  Content type: News

Detecting AI-specific threats in Claude Enterprise from the Compliance API: a prefilter + LLM-as-judge pipeline with Sigma rules

 💉Prompt Injection
papermtn.co.uk··r/netsec

RoboHack AI CTF (Robotic Hacking Community at DEFCON 34)

 🪄Prompt Engineering
ctftime.org·

My last observation re: Anthropic's sabotage

 🕷️Web Crawling
xcancel.com··Hacker News

Indirect Prompt Injection remains a fundamental security challenge for AI

 💉Prompt Injection  Content type: Blog
brave.com·

Human psychology tricks can bypass AI safety guardrails

 💉Prompt Injection  Content type: News
psypost.org·

AI Security: explanation to Exploitation || Part 1

 💉Prompt Injection
infosecwriteups.com
·

WebMCP Can Be Used To Hijack AI Agents, Chrome Warns via @sejournal, @martinibuster

 🪄Prompt Engineering

Why OpenAI is disabling ChatGPT web access to fight prompt injection attacks

 🪄Prompt Engineering  Content type: News
livemint.com·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help