Skip to main content
Scour
Discover
Docs
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Prompt Injection
💉 Prompt Injection
Specific
prompt injection attack, LLM security, jailbreak, AI vulnerability
Filter Results
Timeframe
Choose a timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
283
posts in
122.5
ms
🛡️
AI Security
arXiv
·
2d
2 days ago
When AUC 0.998 Is Not Enough: A Candidate Evaluation Protocol for Hidden-State Probes of
Indirect
Prompt
Injection
in Multimodal Computer-Use Agents
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for When AUC 0.998 Is Not Enough: A Candidate Evaluation Protocol for Hidden-State Probes of Indirect Prompt Injection in Multimodal Computer-Use Agents
🛡️
AI Security
arXiv
·
2d
2 days ago
DE-FIVE: Detecting Malicious Image
Prompts
via Fourier Features and Image Vector Embeddings
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for DE-FIVE: Detecting Malicious Image Prompts via Fourier Features and Image Vector Embeddings
🛡️
AI Security
arXiv
·
2d
2 days ago
GIF: Locally Sound Geometric Information Flow Control for LLMs
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for GIF: Locally Sound Geometric Information Flow Control for LLMs
🕳
LLM Vulnerabilities
arXiv
·
1d
1 day ago
LLMs
Prompted
for Legal Context Object More: Overrefusal from Small On-Premises LLMs in Criminal Legal Context
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for LLMs Prompted for Legal Context Object More: Overrefusal from Small On-Premises LLMs in Criminal Legal Context
🕳
LLM Vulnerabilities
arXiv
·
2d
2 days ago
TROPT: An Open Framework for Unifying and Advancing Discrete Text Optimization
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for TROPT: An Open Framework for Unifying and Advancing Discrete Text Optimization
🕳
LLM Vulnerabilities
arXiv
·
2d
2 days ago
Scalable Hierarchical Attention Transformers for Multi-Turn
Jailbreak
Detection in Long Conversations
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Scalable Hierarchical Attention Transformers for Multi-Turn Jailbreak Detection in Long Conversations
🧠
LLM
arXiv
·
6d
6 days ago
A Layered
Security
Framework Against
Prompt
Injection
in RAG-Based Chatbots
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for A Layered Security Framework Against Prompt Injection in RAG-Based Chatbots
🕳
LLM Vulnerabilities
arXiv
·
6d
6 days ago
SafeSpec: Fast and Safe
LLM
via Dynamic Reflective Sampling
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for SafeSpec: Fast and Safe LLM via Dynamic Reflective Sampling
🕳
LLM Vulnerabilities
arXiv
·
6d
6 days ago
Analyzing Defensive Misdirection Against
Model-Guided
Automated
Attacks
on Agentic
AI
Systems
Covered by
DEV Community
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Analyzing Defensive Misdirection Against Model-Guided Automated Attacks on Agentic AI Systems
💬
LLM Prompting
role-confusion.github.io
·
3d
3 days ago
A Theory of Why
Prompt
Injection
Works
Covers
3 stories
See all stories this covers
including
Playwright MCP Server – Snapshot based – faster and more reliable than images
Covered by
8 sources
See all sources covering this story
including
Simon Willison’s Weblog
,
Schneier on Security
Discussed on
Hacker News
and
Lobsters
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for A Theory of Why Prompt Injection Works
🛡️
AI Security
Schneier on Security
·
13h
13 hours ago
Interesting Paper Exploring
Prompt
Injection
Covers
3 stories
See all stories this covers
including
A Theory of Why Prompt Injection Works
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Interesting Paper Exploring Prompt Injection
🕳
LLM Vulnerabilities
arXiv
·
6d
6 days ago
LLM
agent safety, multi-turn red-teaming,
jailbreak
benchmarks,
adversarial
robustness, safety-critical systems
Covered by
DEV Community
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for LLM agent safety, multi-turn red-teaming, jailbreak benchmarks, adversarial robustness, safety-critical systems
« Page 1
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous post
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Discover
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help
Like
Save
Not for me
Report