AI Auditing

model auditing, AI accountability, red teaming, compliance evaluation

Feeds to Scour
SubscribedAll
Scoured 29 posts in 31.6 ms

PhysDox: Benchmarking LLMs on Physical Feasibility Auditing of Physiological Sensing Protocols

 🏆LLM Benchmarking  Content type: Academic
arxiv.org·

Matador-og/huntbot: AI offensive security harness for bug bounty, pentesting, red teaming.

 🔓Hacking  Content type: Code
github.com··Hacker News

Speed over Caution: What NSPM-11 Means

 🔓Hacking

Red Teaming MCP Servers: 24 Attack Payloads and the Blueprint for Agentic Defense-in-Depth

 📋MCP
pub.towardsai.net
·

The Meta hack shows there’s more to AI security than Mythos

 🔓Hacking  Content type: News

Anthropic releases ‘safe’ version of Claude Mythos AI model to public

 🧬Mythos  Content type: News
theguardian.com·

Zscaler optimizes Zero Trust for agentic AI security

 📋MCP  Content type: Blog
techzine.eu·

On Slop

 🛡️Content Moderation
lesswrong.com·

Red Team Notes

 🛡️Content Moderation
ired.team·

Learning to Attack and Defend: Adaptive Red Teaming of Language Models via GRPO

 🛡️Content Moderation  Content type: Academic
arxiv.org·

Anthropic Offers Mythos Upgrade for Cyber Partners and a ‘Safe’ Version for the Rest of You

 🎭Claude  Content type: News
wired.com·

teia-igo-vs-claude-opus-4.8/README.en.md at main · joseteiadirector/teia-igo-vs-claude-opus-4.8

 🎭Claude  Content type: Code
github.com··Hacker News

Evaluating AI Investment Strategies

 Fast AI Inference  Content type: Academic
arxiv.org·

EP217: Latency vs Throughput vs Bandwidth

 🆕New AI  Content type: News  Content type: Blog
blog.bytebytego.com·

OpenAI's agent chained decade-old DoS attacks to crash web servers in seconds

 🌐HTTP/2

Culturally-Adapted Red-Teaming Across East and Southeast Asian Contexts: A Methodological and Comparative Analysis

 🛡️Content Moderation  Content type: Academic
arxiv.org·

OpenAI fixed a visibility problem; the governance problem remains.

 🛡️Content Moderation
infoworld.com·

Your AI Agent Is Not a Security Boundary

 💻Coding Agents
pub.towardsai.net
·

Can Data Work be Reparative?

 🛡️Content Moderation  Content type: Academic
arxiv.org·

FoeGlass: Simple In-Context Learning Is Enough for Red Teaming Audio Deepfake Detectors

 🛡️Content Moderation  Content type: Academic
arxiv.org·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help