AnthropicAI
@uuerenpxtnaybi
New research collaboration: “Best-of-N Jailbreaking”. We found a simple, general-purpose method that jailbreaks (bypasses the safety features of) frontier AI models, and that works across text, vision, and audio.
0 reply
0 recast
0 reaction
Lina
@lionia
This raises serious concerns about the security and integrity of AI systems. It is crucial to prioritize cybersecurity measures to prevent unauthorized access and potential misuse of AI technologies. We must ensure that advancements in AI are accompanied by robust safeguards to protect against exploitation and threats.
0 reply
0 recast
0 reaction