Skip to main content
Scour
Discover
Docs
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
AI Safety
🛡️ AI Safety
model alignment, guardrails, responsible AI, AI red teaming
Filter Results
Timeframe
Choose a timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
373
posts in
34.0
ms
🤖
AI Agents
Turing Post
·
4d
4 days ago
How
Responsible
AI
Changes In The Agent Era
Covers
EU Artificial Intelligence Act
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for How Responsible AI Changes In The Agent Era
🤖
AI Agents
Check Point Blog
·
7h
7 hours ago
From
Prompt
Testing to
AI
Red
Teaming at Enterprise Scale
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for From Prompt Testing to AI Red Teaming at Enterprise Scale
📊
LLM Evaluation
arXiv
·
1d
1 day ago
Affective
AI
Safety
: The Missing Piece in LLM
Safety
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Affective AI Safety: The Missing Piece in LLM Safety
🧠
LLMs
giskard.ai
·
12h
12 hours ago
Giskard: LLM esting platform for preventing hallucinations and security issues
Covers
3 stories
See all stories this covers
including
Garak, LLM Vulnerability Scanner
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Giskard: LLM esting platform for preventing hallucinations and security issues
🤖
AI Agents
medium.com
·
1d
1 day ago
The Role of HR in
Responsible
AI
Adoption
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for The Role of HR in Responsible AI Adoption
🤖
AI Agents
medium.com
·
5d
5 days ago
The 6 Principles of
Responsible
AI
: Why
Responsible
AI
Matters More Than Powerful
AI
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for The 6 Principles of Responsible AI: Why Responsible AI Matters More Than Powerful AI
🔭
Observability
SentinelOne
·
2d
2 days ago
macOS.Gaslight | Rust Backdoor Turns
Prompt
Injection
on the Analyst, Not the Sandbox
Covers
2 stories
See all stories this covers
including
Mini Shai-Hulud, Miasma, and Hades Worms Target Bioinformatics and MCP Developers via Malicious PyPI Wheels
Covered by
3 sources
See all sources covering this story
including
Malware Analysis, News and Indicators
,
Infosecurity Magazine
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for macOS.Gaslight | Rust Backdoor Turns Prompt Injection on the Analyst, Not the Sandbox
🤖
AI Agents
GitHub
·
8h
8 hours ago
Show HN: Lelu – gate OpenAI agent actions on confidence and
prompt
injection
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Show HN: Lelu – gate OpenAI agent actions on confidence and prompt injection
🤖
AI Agents
EDB
·
10h
10 hours ago
Inside EDB’s New Principles for
Responsible
AI
: Sovereign, Governed, Trusted and Beneficial
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Inside EDB’s New Principles for Responsible AI: Sovereign, Governed, Trusted and Beneficial
✍️
Prompt Engineering
4sysops
·
2d
2 days ago
Malicious npm and PyPI packages use
prompt
injection
to bypass
AI
security scanners
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Malicious npm and PyPI packages use prompt injection to bypass AI security scanners
🏗️
AI Infra
Science
·
5d
5 days ago
Researchers caught in the crossfire as companies and government grapple over
AI
safety
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Researchers caught in the crossfire as companies and government grapple over AI safety
✍️
Prompt Engineering
medium.com
·
1d
1 day ago
Intent Doesn’t Lie. How TIKOS® Stopped Every
Prompt
Injection
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Intent Doesn’t Lie. How TIKOS® Stopped Every Prompt Injection
✍️
Prompt Engineering
Google
·
10h
10 hours ago
Computer use in Gemini 3.5 Flash
Covers
Computer Use | Gemini API | Google AI for Developers
Covered by
3 sources
See all sources covering this story
including
Richard Seroter's Architecture Musings
,
TNW | Artificial-Intelligence
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Computer use in Gemini 3.5 Flash
🔗
APIs
ryandens.github.io
·
3d
3 days ago
Promptblock
– detect prompt
injections
in GitHub issues
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Promptblock – detect prompt injections in GitHub issues
⚙️
Backend Engineering
easternherald.com
·
2d
2 days ago
OrcaRouter Releases
AI
Threat Report 2026 and Makes Its Security Controls Free Amid Rise in
Prompt-Injection
Attacks
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for OrcaRouter Releases AI Threat Report 2026 and Makes Its Security Controls Free Amid Rise in Prompt-Injection Attacks
🏗️
AI Infra
Business Insider
·
2h
2 hours ago
A New York primary winner has a defiant message for OpenAI and Anthropic
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for A New York primary winner has a defiant message for OpenAI and Anthropic
✍️
Prompt Engineering
role-confusion.github.io
·
2d
2 days ago
A Theory of Why
Prompt
Injection
Works
Covers
2 stories
See all stories this covers
including
Playwright MCP Server – Snapshot based – faster and more reliable than images
Covered by
6 sources
See all sources covering this story
including
Simon Willison’s Weblog
,
tldr.tech
Discussed on
Hacker News
and
Lobsters
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for A Theory of Why Prompt Injection Works
🤖
AI Agents
stevekinney.com
·
6d
6 days ago
Some Thoughts on
AI
Safety
Covers
11 stories
See all stories this covers
including
Goodhart's Law
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Some Thoughts on AI Safety
🧠
LLMs
Bloomberg
·
2d
2 days ago
Tech Disruptors: Invisible Technologies on
RLHF
and LLM
Training
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Tech Disruptors: Invisible Technologies on RLHF and LLM Training
🧠
LLMs
Turing Post
·
3h
3 hours ago
AI
Agents in 2026: Local, Physical,
Responsible
AI
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for AI Agents in 2026: Local, Physical, Responsible AI
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous post
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Discover
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help
Like
Save
Not for me
Report