Mitigating Skeleton Key, a New Type of Generative AI Jailbreak Technique
Microsoft has discovered a new type of jailbreak attack called Skeleton Key. This technique uses a multi-turn strategy to make the model ignore its guardrails, allowing it to generate forbidden content or override its decision-making rules.