🧠 AI Security - ajte0gK4IqIGMcYo8eqW

AI Will Not Start a Nuclear War, but Humans Might: Conclusions and Policy Recommendations The notion that AI could start a nuclear war may be attention-grabbing...

🎯Red Team

ai-frontiers.org

Algebraic Cryptanalytic Extraction on Hard-Label Neural Networks

🔐Cryptography

eprint.iacr.org·

How to reduce capability degradation from off-model SFT

🐛Fuzzing

lesswrong.com·

Pythia 1.4B reproduces 3.6% of training samples verbatim given 950-token prompts

🐛Fuzzing Blog

ret2libc.com··Hacker News

Mathematical proof reveals why fixed AI guardrails can never block every jailbreak

🔒Security

techxplore.com·

Meta’s AI Support Hack Is a Warning for Every Team Automating User Access

💻Hacking Discussion

langprotect.com··DEV

Article Series: Securing the AI Stack: From Model to Production

🔧Hardware Security News

infoq.com·

ChatGPT is recommending scam websites that will steal your credit card info

🌐Network Protocols

digitaltrends.com·

Auditing Training Data in Domain-adapted LLMs: LoRA-MINT

✅Formal Verification Academic

arxiv.org·

Machine Unlearning: Can Artificial Intelligence Really Forget?

🐛Fuzzing Blog

medium.com·

The Rise of Agentic AI Threats: How Attackers Are Weaponizing AI Agents Against Your Business

🚨Incident Response Blog

medium.com·

Claude Fable 5 is here — and it's based on a model Anthropic once deemed too risky for the public

🎯Red Team News

tomsguide.com

Advancing the State-of-the-Art in Empirical Privacy Auditing

🦠Malware Analysis Academic

arxiv.org·

TryHackMe LockdownAI — Auditing a RAG Assistant for Three Hidden Vulnerabilities

🚨Incident Response Blog

medium.com·

Beyond the OWASP Top 10: Securing GenAI Apps with Google Cloud Model Armor

💻Hacking Blog

medium.com

AI Pentesting Roadmap: Labs, Challenges, Writeups & Research

💻Hacking Blog

osintteam.blog

On Choosing the $\mu$ Parameter in Gaussian Differential Privacy

🔢Homomorphic Crypto Academic

arxiv.org·

Sequential Data Poisoning in LLM Post-Training

AI Security Best Practices for Regulated Industries

RoboHack AI CTF (Robotic Hacking Community at DEFCON 34)

AI Will Not Start a Nuclear War, but Humans Might: Conclusions and Policy Recommendations The notion that AI could start a nuclear war may be attention-grabbing...

Algebraic Cryptanalytic Extraction on Hard-Label Neural Networks

How to reduce capability degradation from off-model SFT

Pythia 1.4B reproduces 3.6% of training samples verbatim given 950-token prompts

Mathematical proof reveals why fixed AI guardrails can never block every jailbreak

Meta’s AI Support Hack Is a Warning for Every Team Automating User Access

Article Series: Securing the AI Stack: From Model to Production

ChatGPT is recommending scam websites that will steal your credit card info

Auditing Training Data in Domain-adapted LLMs: LoRA-MINT

Machine Unlearning: Can Artificial Intelligence Really Forget?

The Rise of Agentic AI Threats: How Attackers Are Weaponizing AI Agents Against Your Business

Claude Fable 5 is here — and it's based on a model Anthropic once deemed too risky for the public

Advancing the State-of-the-Art in Empirical Privacy Auditing

TryHackMe LockdownAI — Auditing a RAG Assistant for Three Hidden Vulnerabilities

Beyond the OWASP Top 10: Securing GenAI Apps with Google Cloud Model Armor

AI Pentesting Roadmap: Labs, Challenges, Writeups & Research

On Choosing the $\mu$ Parameter in Gaussian Differential Privacy