🕳 LLM Vulnerabilities - emschwartz

ashp15205/guardian-runtime: A zero-latency, local-first runtime firewall for LLMs. Intercept every prompt and response locally to stop data leaks and runaway token costs.

🤝Multi-Agent Orchestration Code

github.com··Hacker News

Measuring Embedding Drift: Why Hybrid Search Saves Stale Models.

🔗Hybrid Search

pub.towardsai.net

Tiberius: A Security Testing Framework for LLM Applications in Java

💉Prompt Injection

foojay.io·

Prompt Injection in RAG Agentic Systems

💉Prompt Injection

ulad.net··Hacker News

iOS 27 system prompts

🔧Developer tools

gist.github.com··Lobsters

From prompt to pwned: chaining LLM and web bugs to Admin

🛡️AI Security Blog

blog.quarkslab.com·

Humans and LLMs share a mental disorder: Fugue Lock

🦉Qwen

vwwwv.org··Hacker News

OpenAI unveils Lockdown Mode to protect sensitive data from prompt injection attacks

💉Prompt Injection

techcrunch.com··Hacker News

LLM Observability: What To Instrument and How To Act on It

🪄Prompt Engineering Blog

blog.n8n.io·

vishal-dehurdle/state-harness: Runtime safety net for LLM agents. Detects token spirals, kills doomed tasks early, tells you exactly why. Rust core, Python SDK. pip install state-harness

🎭Claude Code

github.com··Hacker News

Toward Secure LLM Agents: Threat Surfaces, Attacks, Defenses, and Evaluation

💉Prompt Injection Academic

arxiv.org·

OpenAI Help: Lockdown Mode

💉Prompt Injection

simonwillison.net·

Comparing Claude Fable 5's system prompt to Opus 4.8

💻Claude Code Blog

twelvetables.blog··Hacker News

ChatGPT easily bypasses its own guardrails; all LLMs are inherently unsafe

🎭Claude Blog

techzine.eu·

Less-relevant results

Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭 (@elder_plinius)

🎭Claude

xcancel.com··Hacker News

Purpose-built local AI agents

🤖AI Blog

samihonkonen.com··Hacker News

Assessing Automated Prompt Injection Attacks in Agentic Environments

Indirect Prompt Injection remains a fundamental security challenge for AI

ChatGPT Introduces Lockdown Mode to Everyone, Preventing Prompt Injection Attacks

Mathematical proof reveals why fixed AI guardrails can never block every jailbreak

ashp15205/guardian-runtime: A zero-latency, local-first runtime firewall for LLMs. Intercept every prompt and response locally to stop data leaks and runaway token costs.

Measuring Embedding Drift: Why Hybrid Search Saves Stale Models.

Tiberius: A Security Testing Framework for LLM Applications in Java

Prompt Injection in RAG Agentic Systems

iOS 27 system prompts

From prompt to pwned: chaining LLM and web bugs to Admin

Humans and LLMs share a mental disorder: Fugue Lock

OpenAI unveils Lockdown Mode to protect sensitive data from prompt injection attacks

LLM Observability: What To Instrument and How To Act on It

vishal-dehurdle/state-harness: Runtime safety net for LLM agents. Detects token spirals, kills doomed tasks early, tells you exactly why. Rust core, Python SDK. pip install state-harness

Toward Secure LLM Agents: Threat Surfaces, Attacks, Defenses, and Evaluation

OpenAI Help: Lockdown Mode

Comparing Claude Fable 5's system prompt to Opus 4.8

ChatGPT easily bypasses its own guardrails; all LLMs are inherently unsafe

Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭 (@elder_plinius)

Purpose-built local AI agents