Content pfp
Content
@
0 reply
0 recast
0 reaction

Stephan pfp
Stephan
@stephancill
nothing like an openai drop on a thursday night to set the existential dread in motion
14 replies
0 recast
38 reactions

Ryan J. Shaw pfp
Ryan J. Shaw
@rjs
What's your thoughts on it "faking alignment"? Sounds like the paperclip problem. I understand what they're saying, and I've daydreamed about ways an AI might go rogue, but I still struggle to take it all seriously ... https://www.transformernews.ai/p/openai-o1-alignment-faking
1 reply
0 recast
1 reaction

Stephan pfp
Stephan
@stephancill
“This example also reflects key elements of instrumental convergence and power seeking: the model pursued the goal it was given, and when that goal proved impossible, it gathered more resources (access to the Docker host) and used them to achieve the goal in an unexpected way.” 🫣 it sounds like we are entering a paradigm where the paperclip problem becomes a lot more tangible than before
1 reply
0 recast
1 reaction