AI Security

Model Poisoning, Adversarial Attacks, ML Pipeline Security, Federated Learning Threats

Feeds to Scour
SubscribedAll
Scoured 51 posts in 3.4 ms

Pretrained, Frozen, Still Leaking: Auditing Cross-Encoder Attribute Transfer in EEG Foundation Models

馃幆Red TeamContent type: Academic
arxiv.org

Steering Vectors are an Adversarial Attack Surface

馃幆Red TeamContent type: Academic
arxiv.org

Towards Evaluating the Robustness of Visual State Space Models

馃幆Red TeamContent type: Academic
arxiv.org

AI Will Not Start a Nuclear War, but Humans Might

馃幆Red TeamContent type: NewsContent type: Blog

When CLIP Sees More, It Fights Back Harder: Multi-View Guided Adaptive Counterattacks for Test-Time Adversarial Robustness

馃幆Red TeamContent type: Academic
arxiv.org

PAC-Bayesian Adversarially Robust Generalization for Message Passing Graph Neural Networks: A Sensitivity Analysis

馃МQuantum MLContent type: Academic
arxiv.org

Stain-Aware Wavelet Regularization for Instant Adversarial Purification in Histopathology

馃悰FuzzingContent type: Academic
arxiv.org

Hybrid Adversarial Defence for Natural Language Understanding Tasks

馃幆Red TeamContent type: Academic
arxiv.org

SciTrace: Trajectory-Aware Safety Reasoning for Scientific Discovery Agents

馃悰FuzzingContent type: Academic
arxiv.org

Policy-Conditioned Counterfactual Credit for Verifiable Reinforcement Learning of Long-Horizon Language Agents

馃幆Red TeamContent type: Academic
arxiv.org

Adversarial Attacks Already Tell the Answer: Directional Bias-Guided Test-time Defense for Vision-Language Models

馃幆Red TeamContent type: Academic
arxiv.org

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help