Mastodon

Posts tagged with: jailbreaking

AI Best-of-N Jailbreaking

December 30, 2024

A new study has been published that describes a novel attack method known as Best-of-N (BoN) Jailbreaking, which poses significant risks to even the most sophisticated AI models. What is BoN Jailbreaking? BoN Jailbreaking is a black-box attack method designed to exploit AI systems across various input types - text, images, and audio - without...

Read more