LLM Vulnerabilities

Feeds to Scour
SubscribedAll
Scoured 217 posts in 62.8 ms

Indirect Prompt Injection remains a fundamental security challenge for AI

 💉Prompt Injection  Content type: Blog
brave.com·

Prompt Injection Defense Pipeline

 💉Prompt Injection
emergentmind.com·

ashp15205/guardian-runtime: A zero-latency, local-first runtime firewall for LLMs. Intercept every prompt and response locally to stop data leaks and runaway token costs.

 🤝Multi-Agent Orchestration  Content type: Code
github.com··Hacker News

Measuring Embedding Drift: Why Hybrid Search Saves Stale Models.

 🔗Hybrid Search
pub.towardsai.net
·

Defending Jailbreak Attacks on Large Language Models via Manifold Trajectory Kinetics

 💉Prompt Injection  Content type: Academic
arxiv.org·

iOS 27 system prompts

 🔧Developer tools
gist.github.com··Lobsters

ChatGPT Introduces Lockdown Mode to Everyone, Preventing Prompt Injection Attacks

 🛡️AI Security
researchsnipers.com·

Prompt Injection in RAG Agentic Systems

 💉Prompt Injection
ulad.net··Hacker News

Comparing Claude Fable 5's system prompt to Opus 4.8

 💻Claude Code  Content type: Blog
twelvetables.blog··Hacker News

Tiberius: A Security Testing Framework for LLM Applications in Java

 💉Prompt Injection
foojay.io·
Less-relevant results

Phishing for Lobsters: How We Tricked OpenClaw into Spilling Secrets

 💉Prompt Injection  Content type: Blog
varonis.com··Hacker News

LLM Observability: What To Instrument and How To Act on It

 🪄Prompt Engineering  Content type: Blog
blog.n8n.io·

From prompt to pwned: chaining LLM and web bugs to Admin

 🛡️AI Security  Content type: Blog
blog.quarkslab.com·

Polymarket Annotation Injection

 🛡️AI Security

agentsploit/agentsploit: Offensive security framework for AI agents and MCP servers.

 📋MCP  Content type: Code
github.com··Hacker News

Context-Fractured Decomposition Attacks on Tool-Using LLM Agents: Exploiting Artifact Provenance Gaps

 💉Prompt Injection  Content type: Academic
arxiv.org·

OpenAI unveils Lockdown Mode to protect sensitive data from prompt injection attacks

 💉Prompt Injection

Anthropic says internal and external red team tests of Fable 5 found no universal jailbreaks; it will keep user traffic for 30 days, aligning with Trump's AI EO...

 🎭Claude
techmeme.com·

Meet Hades: The malware that lies to AI security agents

 💉Prompt Injection  Content type: News

OpenAI Help: Lockdown Mode

 💉Prompt Injection
simonwillison.net·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help