New Learnable Framework Detects Unseen Jailbreak Attacks on Vision-Language Models
Global: New Learnable Framework Detects Unseen Jailbreak Attacks on Vision-Language ModelsResearchers have introduced a novel detection system called Learning to Detect (LoD) in a recent arXiv preprint, aiming to identify…