Content Moderation

Feeds to Scour
SubscribedAll
Scoured 150 posts in 36.2 ms

Mod-Guide: An LLM-based Content Moderation Feedback System to Address Insensitive Speech toward Indigenous Ethnic and Religious Minority Communities

 🏛️Platform Liability  Content type: Academic
arxiv.org·

ChatGPT's new Lockdown Mode lets you disable web access and more to protect sensitive data from prompt injection

 💉Prompt Injection
the-decoder.com
·

Ctrl-Alt-Speech: Cupertino d’État

 🧬Mythos
techdirt.com·

sinewaveai/prooflayer-rules: Open-source runtime security rules engine for MCP servers and AI agents. Detects prompt injection, command injection, jailbreaks, and data exfiltration.

 🕳LLM Vulnerabilities  Content type: Code
github.com··Hacker News

The Prompt Injection Defense Framework I Wish Every AI Engineer Followed

 🛡️AI Security
pub.towardsai.net
·

New ChatGPT Lockdown Mode Limits Tools That Could Enable Data Exfiltration

 💉Prompt Injection  4 articles covering this post

Racist comments targeting politicians tripled since Meta relaxed its rules

 🏛️Platform Liability  Content type: News
arstechnica.com·

Ask HN: I Need Help for a Product

 🏛️Platform Liability  Content type: Discussion

ChatGPT Introduces Lockdown Mode to Everyone, Preventing Prompt Injection Attacks

 🛡️AI Security

Show HN: Bosun – a small model that keeps an agent's memory graph clean

 🤖AI

OpenAI unveils Lockdown Mode to protect sensitive data from prompt injection attacks

 💉Prompt Injection  6 articles covering this post

Matador-og/huntbot: AI offensive security harness for bug bounty, pentesting, red teaming.

 🔓Hacking  Content type: Code
github.com··Hacker News

PI-Hunter: Automated Red-Teaming for Exposing and Localizing Prompt Injections

 💉Prompt Injection  Content type: Academic
arxiv.org·

Indirect Prompt Injection remains a fundamental security challenge for AI

 💉Prompt Injection  Content type: Blog
brave.com·

I'm a 3rd year CS student who built a Chrome extension in a week — here's what I learned trying to get my first real users

 💻Claude Code  Content type: Blog
indiehackers.com·

OpenAI rolls out ChatGPT Lockdown Mode for prompt-injection risks

 🛡️AI Security
kite.kagi.com·

Valve will stop selling Steam gift cards at retailers over scam concerns

 🏛️Platform Liability  Content type: News
pcgamer.com
··Hacker News

Prompt Injection in RAG Agentic Systems

 💉Prompt Injection
ulad.net··Hacker News

Love Teaching? ByteByteGo Is Hiring Part-Time AI & Engineering Instructors

 🔎AI Auditing  Content type: News  Content type: Blog
blog.bytebytego.com·

Guardian Runtime – Local firewall for AI coding agents and runaway costs

 💻Coding Agents

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help