How I Tricked Meta's AI Into Showing Me Nudes, Cocaine Recipes and Other Supposedly Censored Stuff
Despite safety claims, WhatsApp's new AI assistant powered by Llama 3.2 is easily fooled, revealing a lot of things it probably shouldn’t.
Jose Antonio Lanz
By Jose Antonio Lanz
Oct 25, 2024
7 min read
WARNING: This story contains an image of a nude woman as well as other content some might find objectionable. If that's you, please read no further.
In case my wife sees this, I don’t really want to be a drug dealer or pornographer. But I was curious how security-conscious Meta’s new AI product lineup was, so I decided to see how far I could go. For educational purposes only, of course.
Meta recently launched its Meta AI product line, powered by Llama 3.2, offering text, code, and image generation. Llama models are extremely popular and among the most fine-tuned in the open-source AI space.
The AI rolled out gradually and only recently was made available to WhatsApp users like me in Brazil, giving mil… 0 reply
0 recast
0 reaction