Albert Wenger
@albertwenger
Remember when Sam Altman said that AI systems could just "learn the collective moral preferences" in response to a question by Jack Kornfield about values? https://youtu.be/hn1Y6GVWUV0?si=wLl9t023O8eYREXm&t=750
2 replies
0 recast
2 reactions
Albert Wenger
@albertwenger
So about that ... this fascinating paper shows that models develop their own value systems and what emerges is, well, problematic arxiv.org/abs/2502.08640
0 reply
0 recast
0 reaction
JB Rubinovitz ⌐◨-◨
@rubinovitz
Saw this and thought of you https://x.com/OwainEvans_UK/status/1894436637054214509
2 replies
0 recast
2 reactions