Formal Verification, Microkernel, Capability Security, Isabelle/HOL
Anchoring Refusal Direction: Mitigating Safety Risks in Tuning via Projection Constraint
arxiv.org·1d
Loading...Loading more...
Formal Verification, Microkernel, Capability Security, Isabelle/HOL