LLM Vulnerabilities

Feeds to Scour
SubscribedAll
Scoured 228 posts in 31.3 ms

Claude Opus 4.8 system prompt leaked

 🎭Claude
Less-relevant results

Build a Basic AI Agent from Scratch: Long Task Planning

 💻Coding Agents  Content type: Blog
ruxu.dev··Hacker News

Anthropic says internal and external red team tests of Fable 5 found no universal jailbreaks; it will keep user traffic for 30 days, aligning with Trump's AI EO...

 🎭Claude
techmeme.com·

Neglected Basics of AI Alignment

 🛡️AI Safety
lesswrong.com·

Siri AI is a Malware Vector

 🛡️AI Security  Content type: Blog

LLM-Guided Neural Architecture Search for Robust Co-Design of Physical Neural Networks

 🧠LLM Inference  Content type: Academic
arxiv.org·

Claude Fable 5 and new AI safety fables

 🎭Claude  Content type: News
interconnects.ai··Hacker News

SaqlainXoas/llm-system-patterns: A docs-first guide to LLM system design — hybrid search, embedding pipelines, reranking, and LLM-as-judge patterns.

 💉Prompt Injection  Content type: Code

Production AI Playbook: Complex Agent Patterns

 📡RSS  Content type: Blog
blog.n8n.io·

System Prompts & Custom Instructions: Your Permanent

 🪄Prompt Engineering
pub.towardsai.net
·

Casual experiment hint that models seem to search for different stuff

 🤖AI
spock.is··Hacker News

Context-Fractured Decomposition Attacks on Tool-Using LLM Agents: Exploiting Artifact Provenance Gaps

 💉Prompt Injection  Content type: Academic
arxiv.org·

Evaluating using Mock Tool Calls to Quarantine Untrusted Prompt Inputs

 🪄Prompt Engineering
lesswrong.com·

The Bill Arrives: How to Manage Agentic AI Costs at Scale

 🤖AI  Content type: Blog
cockroachlabs.com·

Tokenminning: Because Tokenmaxxing Is a Bad Idea

 🪄Prompt Engineering

Lockdown Mode is rolling out to all ChatGPT accounts

 🛡️AI Security
betanews.com·

agentsploit/agentsploit: Offensive security framework for AI agents and MCP servers.

 📋MCP  Content type: Code
github.com··Hacker News

The best new ChatGPT feature is one most people will never use

 🛡️AI Security
digitaltrends.com·

When the Chain of Thought Knows Better: Failure Modes in Multi-Turn Reasoning Models

 🪄Prompt Engineering  Content type: Academic
arxiv.org·

The Meta hack shows there’s more to AI security than Mythos

 🔓Hacking  Content type: News

No more posts from emschwartz's subscribed feeds.

Sign up or log in to see more results

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help