Sebastian Molina
Research Assistant
Florida International University
In this paper, they propose RLbreaker, a black-box LLM jailbreaking attack driven by deep reinforcement learning (DRL).