Skip to main content
Scour
Discover
Docs
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
LLM Vulnerabilities
🕳 LLM Vulnerabilities
Specific
Hacking LLMs, Prompt Injection
Filter Results
Timeframe
Choose a timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
283
posts in
89.7
ms
🧠
LLM
arXiv
·
1d
1 day ago
AdversaBench: Automated
LLM
Red-Teaming
with Multi-Judge Confirmation and Cross-Model Transferability
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for AdversaBench: Automated LLM Red-Teaming with Multi-Judge Confirmation and Cross-Model Transferability
📊
LLM Evaluation
arXiv
·
1d
1 day ago
REALM: A Unified
Red-Teaming
Benchmark for Physical-World VLMs
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for REALM: A Unified Red-Teaming Benchmark for Physical-World VLMs
💉
Prompt Injection
arXiv
·
1d
1 day ago
PixJail: Self-Evolving Paper-to-Pipeline Reproduction for Text-to-Image
Jailbreak
Evaluation
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for PixJail: Self-Evolving Paper-to-Pipeline Reproduction for Text-to-Image Jailbreak Evaluation
📊
LLM Evaluation
arXiv
·
2d
2 days ago
OTTER: A
Red-Teaming
System for Toxicity-Evading
Jailbreak
Prompt Optimization
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for OTTER: A Red-Teaming System for Toxicity-Evading Jailbreak Prompt Optimization
🤖
LLM, Agent
arXiv
·
6d
6 days ago
LLM
agent
safety
, multi-turn
red-teaming
, jailbreak benchmarks, adversarial robustness,
safety-critical
systems
Covered by
DEV Community
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for LLM agent safety, multi-turn red-teaming, jailbreak benchmarks, adversarial robustness, safety-critical systems
💉
Prompt Injection
arXiv
·
2d
2 days ago
TROPT: An Open Framework for Unifying and Advancing Discrete Text Optimization
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for TROPT: An Open Framework for Unifying and Advancing Discrete Text Optimization
🧠
LLM
arXiv
·
6d
6 days ago
A Layered Security Framework Against
Prompt
Injection
in RAG-Based Chatbots
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for A Layered Security Framework Against Prompt Injection in RAG-Based Chatbots
💉
Prompt Injection
arXiv
·
2d
2 days ago
BELLS-O: Evaluating the Operational Trade-offs of
LLM
Supervision Systems
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for BELLS-O: Evaluating the Operational Trade-offs of LLM Supervision Systems
💉
Prompt Injection
arXiv
·
2d
2 days ago
Scalable Hierarchical Attention Transformers for Multi-Turn
Jailbreak
Detection in Long Conversations
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Scalable Hierarchical Attention Transformers for Multi-Turn Jailbreak Detection in Long Conversations
💉
Prompt Injection
arXiv
·
6d
6 days ago
Analyzing Defensive Misdirection Against
Model-Guided
Automated Attacks on Agentic
AI
Systems
Covered by
DEV Community
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Analyzing Defensive Misdirection Against Model-Guided Automated Attacks on Agentic AI Systems
💬
LLM Prompting
role-confusion.github.io
·
3d
3 days ago
A Theory of Why
Prompt
Injection
Works
Covers
3 stories
See all stories this covers
including
Playwright MCP Server – Snapshot based – faster and more reliable than images
Covered by
8 sources
See all sources covering this story
including
Simon Willison’s Weblog
,
Schneier on Security
Discussed on
Hacker News
and
Lobsters
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for A Theory of Why Prompt Injection Works
🛡️
AI Security
Schneier on Security
·
13h
13 hours ago
Interesting Paper Exploring
Prompt
Injection
Covers
3 stories
See all stories this covers
including
A Theory of Why Prompt Injection Works
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Interesting Paper Exploring Prompt Injection
« Page 1
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous post
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Discover
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help
Like
Save
Not for me
Report