Skip to main content
Scour
Discover
Docs
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
LLM Vulnerabilities
🕳 LLM Vulnerabilities
Specific
Hacking LLMs, Prompt Injection
Filter Results
Timeframe
Choose a timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
282
posts in
17.7
ms
💉
Prompt Injection
arXiv
·
21h
21 hours ago
A
Red
Teaming
Framework for
Large
Language Models: A Case Study on Faithfulness Evaluation
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for A Red Teaming Framework for Large Language Models: A Case Study on Faithfulness Evaluation
💉
Prompt Injection
latent.space
·
3d
3 days ago
Red-Teaming
after Mythos — Zico Kolter & Matt Fredrikson, Gray Swan
Covers
The lethal trifecta for AI agents: private data, untrusted content, and external communication
Covered by
tldr.tech
,
contextmaestro.com
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Red-Teaming after Mythos — Zico Kolter & Matt Fredrikson, Gray Swan
📊
LLM Evaluation
giskard.ai
·
1d
1 day ago
Giskard:
LLM
esting platform for preventing hallucinations and security issues
Covers
3 stories
See all stories this covers
including
Garak, LLM Vulnerability Scanner
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Giskard: LLM esting platform for preventing hallucinations and security issues
💬
LLM Prompting
role-confusion.github.io
·
3d
3 days ago
A Theory of Why
Prompt
Injection
Works
Covers
3 stories
See all stories this covers
including
Playwright MCP Server – Snapshot based – faster and more reliable than images
Covered by
8 sources
See all sources covering this story
including
Simon Willison’s Weblog
,
Schneier on Security
Discussed on
Hacker News
and
Lobsters
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for A Theory of Why Prompt Injection Works
🔐
Cybersecurity
OffSec
·
2d
2 days ago
Cybersecurity
Training in the Age of
AI
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Cybersecurity Training in the Age of AI
🛡️
AI Security
beSpacific
·
23h
23 hours ago
Prompt
Injection
: What Lawyers Considering Agentic
AI
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Prompt Injection: What Lawyers Considering Agentic AI
🎯
Alignment Research
Pangeanic Blog
·
17h
17 hours ago
From Fine-Tuning to
Red
Teaming
: The Data Operations Behind Reliable
AI
Models
Covers
AI Risk Management Framework
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for From Fine-Tuning to Red Teaming: The Data Operations Behind Reliable AI Models
🔐
Cybersecurity
Orca Security
·
2d
2 days ago
Best
AI
Cybersecurity
Providers 2026: A Buyer’s Guide to
AI-Powered
Security Platforms
Covers
RAG Security: Prevent Data Leaks with Access Control
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Best AI Cybersecurity Providers 2026: A Buyer’s Guide to AI-Powered Security Platforms
🛡️
AI Security
medium.com
·
2d
2 days ago
Intent Doesn’t Lie. How TIKOS® Stopped Every
Prompt
Injection
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Intent Doesn’t Lie. How TIKOS® Stopped Every Prompt Injection
💉
Prompt Injection
easternherald.com
·
3d
3 days ago
OrcaRouter Releases
AI
Threat Report 2026 and Makes Its Security Controls Free Amid Rise in
Prompt-Injection
Attacks
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for OrcaRouter Releases AI Threat Report 2026 and Makes Its Security Controls Free Amid Rise in Prompt-Injection Attacks
🛡️
AI Security
Infosecurity Magazine
·
1d
1 day ago
macOS Backdoor Uses
Prompt
Injection
to Evade
AI
Triage
Covers
macOS.Gaslight | Rust Backdoor Turns Prompt Injection on the Analyst, Not the Sandbox
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for macOS Backdoor Uses Prompt Injection to Evade AI Triage
💉
Prompt Injection
medium.com
·
4d
4 days ago
AI
Red
Teaming
: The Key to Testing Real-World LLM Risks and Vulnerabilities
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for AI Red Teaming: The Key to Testing Real-World LLM Risks and Vulnerabilities
🔐
Cybersecurity
dualuse.dev
·
2d
2 days ago
Export controls for Fable are too late to slow proliferation
Covers
2 stories
See all stories this covers
including
Project Glasswing: Securing critical software for the AI era
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Export controls for Fable are too late to slow proliferation
📊
LLM Evaluation
Check Point Blog
·
1d
1 day ago
From
Prompt
Testing to
AI
Red
Teaming at Enterprise Scale
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for From Prompt Testing to AI Red Teaming at Enterprise Scale
💉
Prompt Injection
paddo.dev
·
5d
5 days ago
It Was Never the
Jailbreak
. It Was the Guest List.
Covers
The Korean Telecom Giant at the Center of Anthropic’s Mythos Controversy
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for It Was Never the Jailbreak. It Was the Guest List.
✍️
Prompt Engineering
ryandens.github.io
·
4d
4 days ago
Promptblock
– detect prompt
injections
in GitHub issues
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Promptblock – detect prompt injections in GitHub issues
✍️
Prompt Engineering
medium.com
·
6d
6 days ago
# Fictional Framing as a
Prompt
Injection
Vector: A Reproducibility Study on GPT-4o and
Claude
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for # Fictional Framing as a Prompt Injection Vector: A Reproducibility Study on GPT-4o and Claude
💉
Prompt Injection
arXiv
·
21h
21 hours ago
What Intermediate Layers Know: Detecting
Jailbreaks
from Entropy Dynamics
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for What Intermediate Layers Know: Detecting Jailbreaks from Entropy Dynamics
💉
Prompt Injection
arXiv
·
21h
21 hours ago
How Reliable Is Your
Jailbreak
Judge? Calibration and
Adversarial
Robustness of Automated ASR Scoring
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for How Reliable Is Your Jailbreak Judge? Calibration and Adversarial Robustness of Automated ASR Scoring
🧠
LLM
arXiv
·
21h
21 hours ago
RAS: Measuring
LLM
Safety
Through Refusal Alignment
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for RAS: Measuring LLM Safety Through Refusal Alignment
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous post
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Discover
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help
Like
Save
Not for me
Report