🕳 LLM Vulnerabilities - emschwartz · Scour

Claude Opus 4.8 system prompt leaked

gist.github.com··Hacker News

Less-relevant results

Build a Basic AI Agent from Scratch: Long Task Planning

💻Coding Agents Blog

ruxu.dev··Hacker News

Anthropic says internal and external red team tests of Fable 5 found no universal jailbreaks; it will keep user traffic for 30 days, aligning with Trump's AI EO...

Neglected Basics of AI Alignment

🛡️AI Safety

lesswrong.com·

Siri AI is a Malware Vector

🛡️AI Security Blog

loufranco.com··Hacker News

LLM-Guided Neural Architecture Search for Robust Co-Design of Physical Neural Networks

🧠LLM Inference Academic

Claude Fable 5 and new AI safety fables

🎭Claude News

interconnects.ai··Hacker News

SaqlainXoas/llm-system-patterns: A docs-first guide to LLM system design — hybrid search, embedding pipelines, reranking, and LLM-as-judge patterns.

💉Prompt Injection Code

github.com··r/LocalLLaMA, r/SideProject

Production AI Playbook: Complex Agent Patterns

📡RSS Blog

System Prompts & Custom Instructions: Your Permanent

🪄Prompt Engineering

pub.towardsai.net

·

Casual experiment hint that models seem to search for different stuff

spock.is··Hacker News

Context-Fractured Decomposition Attacks on Tool-Using LLM Agents: Exploiting Artifact Provenance Gaps

💉Prompt Injection Academic

Evaluating using Mock Tool Calls to Quarantine Untrusted Prompt Inputs

🪄Prompt Engineering

lesswrong.com·

The Bill Arrives: How to Manage Agentic AI Costs at Scale

🤖AI Blog

cockroachlabs.com·

Tokenminning: Because Tokenmaxxing Is a Bad Idea

🪄Prompt Engineering

tokenminning.com··Hacker News

Lockdown Mode is rolling out to all ChatGPT accounts

🛡️AI Security

agentsploit/agentsploit: Offensive security framework for AI agents and MCP servers.

📋MCP Code

github.com··Hacker News

The best new ChatGPT feature is one most people will never use

🛡️AI Security

digitaltrends.com·

When the Chain of Thought Knows Better: Failure Modes in Multi-Turn Reasoning Models

🪄Prompt Engineering Academic

The Meta hack shows there’s more to AI security than Mythos

🔓Hacking News

technologyreview.com··Hacker News

No more posts from emschwartz's subscribed feeds.

Scour all 25255 feeds Learn more about Feeds

Sign up or log in to see more results

Log in to enable infinite scrolling