🔎 AI Auditing - emschwartz · Scour

PhysDox: Benchmarking LLMs on Physical Feasibility Auditing of Physiological Sensing Protocols

🏆LLM Benchmarking Academic

Matador-og/huntbot: AI offensive security harness for bug bounty, pentesting, red teaming.

🔓Hacking Code

github.com··Hacker News

Speed over Caution: What NSPM-11 Means

smallwarsjournal.com··Hacker News

Red Teaming MCP Servers: 24 Attack Payloads and the Blueprint for Agentic Defense-in-Depth

pub.towardsai.net

·

The Meta hack shows there’s more to AI security than Mythos

🔓Hacking News

technologyreview.com··Hacker News

Anthropic releases ‘safe’ version of Claude Mythos AI model to public

🧬Mythos News

theguardian.com·

Zscaler optimizes Zero Trust for agentic AI security

📋MCP Blog

On Slop

🛡️Content Moderation

lesswrong.com·

Red Team Notes

🛡️Content Moderation

Learning to Attack and Defend: Adaptive Red Teaming of Language Models via GRPO

🛡️Content Moderation Academic

Anthropic Offers Mythos Upgrade for Cyber Partners and a ‘Safe’ Version for the Rest of You

🎭Claude News

teia-igo-vs-claude-opus-4.8/README.en.md at main · joseteiadirector/teia-igo-vs-claude-opus-4.8

🎭Claude Code

github.com··Hacker News

Evaluating AI Investment Strategies

⚡Fast AI Inference Academic

EP217: Latency vs Throughput vs Bandwidth

🆕New AI News Blog

blog.bytebytego.com·

OpenAI's agent chained decade-old DoS attacks to crash web servers in seconds

theregister.com··Hacker News, r/artificial

Culturally-Adapted Red-Teaming Across East and Southeast Asian Contexts: A Methodological and Comparative Analysis

🛡️Content Moderation Academic

OpenAI fixed a visibility problem; the governance problem remains.

🛡️Content Moderation

infoworld.com·

Your AI Agent Is Not a Security Boundary

💻Coding Agents

pub.towardsai.net

·

Can Data Work be Reparative?

🛡️Content Moderation Academic

FoeGlass: Simple In-Context Learning Is Enough for Red Teaming Audio Deepfake Detectors

🛡️Content Moderation Academic

Log in to enable infinite scrolling