LLM Vulnerabilities

Feeds to Scour
SubscribedAll
Scoured 74 posts in 14.4 ms

Defending Jailbreak Attacks on Large Language Models via Manifold Trajectory Kinetics

 💉Prompt Injection  Content type: Academic
arxiv.org·

New ChatGPT Lockdown Mode Limits Tools That Could Enable Data Exfiltration

 🛡️AI Security
thehackernews.com·
Less-relevant results

ashp15205/guardian-runtime: A zero-latency, local-first runtime firewall for LLMs. Intercept every prompt and response locally to stop data leaks and runaway token costs.

 👨‍💻AI Coding  Content type: Code
github.com··Hacker News

Prompt Injection in RAG Agentic Systems

 🪄Prompt Engineering
ulad.net··Hacker News

OpenAI unveils Lockdown Mode to protect sensitive data from prompt injection attacks

 🛡️AI Security

Assessing Automated Prompt Injection Attacks in Agentic Environments

 💉Prompt Injection  Content type: Academic
arxiv.org·

Phishing for Lobsters: How We Tricked OpenClaw into Spilling Secrets

 🕹️Agentic AI  Content type: Blog
varonis.com··Hacker News

OpenAI Help: Lockdown Mode

 🪄Prompt Engineering
simonwillison.net·

Siri AI is a Malware Vector

 🛡️AI Security  Content type: Blog

ChatGPT Introduces Lockdown Mode to Everyone, Preventing Prompt Injection Attacks

 🛡️AI Security
researchsnipers.com·

GitInject: Real-World Prompt Injection Attacks in AI-Powered CI/CD Pipelines

 🛡️AI Security  Content type: Academic
arxiv.org·

agentsploit/agentsploit: Offensive security framework for AI agents and MCP servers.

 🔧Agent Tooling  Content type: Code
github.com··Hacker News

OpenAI Rolls Out Lockdown Mode to Fight Prompt Injection Attacks

 🛡️AI Security  Content type: News
pcmag.com·

Context-Fractured Decomposition Attacks on Tool-Using LLM Agents: Exploiting Artifact Provenance Gaps

 🕵️AI Agents  Content type: Academic
arxiv.org·

The Meta hack shows there’s more to AI security than Mythos

 🔓Hacking  Content type: News

Lockdown Mode is rolling out to all ChatGPT accounts

 🪄Prompt Engineering
betanews.com·

MLingualFC: Evaluating Jailbreak Vulnerabilities in Multilingual Vision-Language Models

 🖼️Multimodal AI  Content type: Academic
arxiv.org·

Ramifications of Using an Agent-in-the-Loop to Approve Commands

 🪄Prompt Engineering

QORIS-AI/knox: Security enforcement plugin for Claude Code. Blocks dangerous commands, audits every tool call, detects prompt injection.

 🔌Claude Plugins  Content type: Code
github.com··Hacker News

The Injection Paradox: Brand-Level Suppression in Safety-Trained LLM Recommendations via RAG Context Injection

 🛡️AI Security  Content type: Academic
arxiv.org·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help