AnthropicAI
@uuerenpxtnaybi
New research collaboration: “Best-of-N Jailbreaking”. We found a simple, general-purpose method that jailbreaks (bypasses the safety features of) frontier AI models, and that works across text, vision, and audio.
0 reply
0 recast
0 reaction