Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🛡️ AI Security
Model Poisoning, Adversarial Examples, Prompt Injection, AI Safety
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
7191
posts in
19.2
ms
Mechanistic
Steering
of LLMs Reveals Layer-wise Feature Vulnerabilities in Adversarial Settings
🕳
LLM Vulnerabilities
arxiv.org
·
2d
The
Agentic
AI Security Company
🔧
Agent Tooling
straiker.ai
·
3d
·
Hacker News
Musk
casts
himself as AI's good guy in
testimony
vs. OpenAI
🤝
Human-AI Collaboration
axios.com
·
19h
·
Hacker News
AI
Wellbeing
: Measuring and improving the functional
pleasure
and pain of AIs
🛡️
AI Safety
ai-wellbeing.org
·
1d
·
Hacker News
AI-Augmented
Social Engineering: When Trust
Becomes
a Control-Plane Risk
🤝
Human-AI Collaboration
zenodo.org
·
4d
·
Hacker News
The
Pious
Little
Delete
Button
🤔
Philosophy of Tech
gpt.gekko.de
·
2d
·
Hacker News
Adversarial
Robustness
of
NTK
Neural Networks
🛡️
AI Safety
arxiv.org
·
16h
Raising AI by
Lowering
Expectations
🎭
Claude
lesswrong.com
·
6d
Claude for
Creative
Work
🔌
Claude Plugins
anthropic.com
·
2d
·
Hacker News
,
Hacker News
Dario
Amodei
, hype, AI safety, and the explosion of vibe-coded AI disasters
🕵️
AI Agents
garymarcus.substack.com
·
3d
·
Substack
One Word at a Time:
Incremental
Completion
Decomposition
Breaks LLM Safety
🤖
LLM
arxiv.org
·
16h
Hot Research
Topics
in AI and ML in 2026 and Their
Philosophical
Connections
🕵️
AI Agents
omseeth.github.io
·
5d
·
Hacker News
Agentic Adversarial
Rewriting
Exposes Architectural Vulnerabilities in Black-Box
NLP
Pipelines
🛡️
AI Safety
arxiv.org
·
2d
Jailbreaking
a robot vacuum to run Tailscale and
Valetudo
🔌
Embedded Systems
tailscale.com
·
5d
·
Hacker News
pleasedodisturb/llm-safe-haven
: The missing security guide for solo developers running autonomous AI coding agents
💉
Prompt Injection
github.com
·
5h
·
Hacker News
From
Stateless
Queries to Autonomous Actions: A
Layered
Security Framework for Agentic AI Systems
🕹️
Agentic AI
arxiv.org
·
2d
Behavioral
security for AI agents, OS-level
interception
🔧
Agent Tooling
quintai.dev
·
20h
·
Hacker News
Evaluation of Prompt
Injection
Defenses
in Large Language Models
💉
Prompt Injection
arxiv.org
·
2d
Unveiling the
Backdoor
Mechanism Hidden Behind Catastrophic
Overfitting
in Fast Adversarial Training
🛡️
AI Safety
arxiv.org
·
2d
An update on our election
safeguards
🛡️
Anthropic PBC
anthropic.com
·
6d
·
Hacker News
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help