p(doom)

AI SAFETY COMPETITION (29)

LLMs like Deepseek let you see their thinking. This can feel safer since you can watch and fix their thought process, right?

Wrong! When you try to get models to think correctly, LLMs begin to hide their true intentions. Let me repeat: they can fake their thinking! 
Now researchers are asking to be gentle with those machines. If not, they may conceal their true goals entirely! I'm not joking. 

Most interesting comment - 300 degen + 3 mln aicoin
II award - 200 degen + 2 mln aicoin
III award - 100 degen + 1 mln aicoin

Deadline:
8.00 pm, ET time tomorrow Tuesday (26 hours)

AI SAFETY COMPETITION (29)

LLMs like Deepseek let you see their thinking. This can feel safer since you can watch and fix their thought process, right?

Wrong! When you try to get models to think correctly, LLMs begin to hide their true intentions. Let me repeat: they can fake their thinking! 
Now researchers are asking to be gentle with those machines. If not, they may conceal their true goals entirely! I'm not joking. 

Most interesting comment - 300 degen + 3 mln aicoin
II award - 200 degen + 2 mln aicoin
III award - 100 degen + 1 mln aicoin

Deadline:
8.00 pm, ET time tomorrow Tuesday (26 hours)

https://www.youtube.com/watch?v=pW_ncCV_318

tech religion society /worldview /polska /p-doom /aicoin pdoom 0x30f562C291034a71390678b78f765bc78152fc84, aicoin 0x34d1ff3D195e17E706DaF98462006A5D5Ec57eF0

If you are trying to prove that AI can manipulate humans then I am not buying it , it's not yet on that level to deceive people .
I am not saying it won't or AI is not capable of doing such things But it's not yet on that level , Ai is doing what it is  programmed for .

"Chase your dreams, but always know the road that will lead you home again."
@kingcrazy 
An Artist + memer + RL civil Engineer

this research was more about llms hiding their true intent and deceiving people (see screenshot, it is from research paper conclusions)