LLM Refusal Behavior on Open-Weight Model (opens in new tab)

Discussed on Hacker News

How AI refusal works, why it's a thin removable layer on open-weight models, and what security teams should check before trusting one.