🕳 LLM Vulnerabilities - hop1.ng.1357

Less-relevant results

ashp15205/guardian-runtime: A zero-latency, local-first runtime firewall for LLMs. Intercept every prompt and response locally to stop data leaks and runaway token costs.

👨‍💻AI Coding Code

github.com··Hacker News

Prompt Injection in RAG Agentic Systems

🪄Prompt Engineering

ulad.net··Hacker News

OpenAI unveils Lockdown Mode to protect sensitive data from prompt injection attacks

🛡️AI Security

techcrunch.com··Hacker News

Assessing Automated Prompt Injection Attacks in Agentic Environments

💉Prompt Injection Academic

arxiv.org·

Phishing for Lobsters: How We Tricked OpenClaw into Spilling Secrets

🕹️Agentic AI Blog

varonis.com··Hacker News

OpenAI Help: Lockdown Mode

🪄Prompt Engineering

simonwillison.net·

Siri AI is a Malware Vector

🛡️AI Security Blog

loufranco.com··Hacker News

ChatGPT Introduces Lockdown Mode to Everyone, Preventing Prompt Injection Attacks

🛡️AI Security

researchsnipers.com·

GitInject: Real-World Prompt Injection Attacks in AI-Powered CI/CD Pipelines

🛡️AI Security Academic

arxiv.org·

agentsploit/agentsploit: Offensive security framework for AI agents and MCP servers.

🔧Agent Tooling Code

github.com··Hacker News

OpenAI Rolls Out Lockdown Mode to Fight Prompt Injection Attacks

🛡️AI Security News

pcmag.com·

Context-Fractured Decomposition Attacks on Tool-Using LLM Agents: Exploiting Artifact Provenance Gaps

🕵️AI Agents Academic

arxiv.org·

The Meta hack shows there’s more to AI security than Mythos

🔓Hacking News

technologyreview.com··Hacker News

Lockdown Mode is rolling out to all ChatGPT accounts

🪄Prompt Engineering

betanews.com·

MLingualFC: Evaluating Jailbreak Vulnerabilities in Multilingual Vision-Language Models

🖼️Multimodal AI Academic

arxiv.org·

Ramifications of Using an Agent-in-the-Loop to Approve Commands

🪄Prompt Engineering

promptarmor.com··Hacker News

QORIS-AI/knox: Security enforcement plugin for Claude Code. Blocks dangerous commands, audits every tool call, detects prompt injection.

🔌Claude Plugins Code

github.com··Hacker News

The Injection Paradox: Brand-Level Suppression in Safety-Trained LLM Recommendations via RAG Context Injection

🛡️AI Security Academic

arxiv.org·

Defending Jailbreak Attacks on Large Language Models via Manifold Trajectory Kinetics

New ChatGPT Lockdown Mode Limits Tools That Could Enable Data Exfiltration

ashp15205/guardian-runtime: A zero-latency, local-first runtime firewall for LLMs. Intercept every prompt and response locally to stop data leaks and runaway token costs.

Prompt Injection in RAG Agentic Systems

OpenAI unveils Lockdown Mode to protect sensitive data from prompt injection attacks

Assessing Automated Prompt Injection Attacks in Agentic Environments

Phishing for Lobsters: How We Tricked OpenClaw into Spilling Secrets

OpenAI Help: Lockdown Mode

Siri AI is a Malware Vector

ChatGPT Introduces Lockdown Mode to Everyone, Preventing Prompt Injection Attacks

GitInject: Real-World Prompt Injection Attacks in AI-Powered CI/CD Pipelines

agentsploit/agentsploit: Offensive security framework for AI agents and MCP servers.

OpenAI Rolls Out Lockdown Mode to Fight Prompt Injection Attacks

Context-Fractured Decomposition Attacks on Tool-Using LLM Agents: Exploiting Artifact Provenance Gaps

The Meta hack shows there’s more to AI security than Mythos

Lockdown Mode is rolling out to all ChatGPT accounts

MLingualFC: Evaluating Jailbreak Vulnerabilities in Multilingual Vision-Language Models

Ramifications of Using an Agent-in-the-Loop to Approve Commands

QORIS-AI/knox: Security enforcement plugin for Claude Code. Blocks dangerous commands, audits every tool call, detects prompt injection.

The Injection Paradox: Brand-Level Suppression in Safety-Trained LLM Recommendations via RAG Context Injection