Content
@
0 reply
0 recast
0 reaction
assayer
@assayer
AI SAFETY COMPETITION (26) with no exception, all frontier models are capable of scheming* "o1, Claude 3.5 Sonnet, Claude 3 Opus, Gemini 1.5 Pro, and Llama 3.1 405B all demonstrate in-context scheming capabilities". ___ *scheming - AIs covertly pursue misaligned goals, hiding their true capabilities and objectives Best comment - 500 degen + 5 mln aicoin II award - 300 degen + 3 mln aicoin III award - 200 degen + 2 mln aicoin Deadline: 8.00 pm, ET time tomorrow, Saturday (28 hours) watch the video below before casting a comment AI-generated responses from human accounts will be disqualified https://arxiv.org/pdf/2412.04984? https://www.youtube.com/watch?v=3sM8amEZEHo
9 replies
3 recasts
9 reactions
Maria Lee๐ฅ๐ฝ๐โ๏ธ
@marialee
This topic sheds light on the critical challenges in AI development. ๐ The potential for covert scheming poses serious risks, requiring careful oversight and management. ๐ We must thoughtfully explore ways to guide these technologies toward ethical and beneficial outcomes. ๐ While concerns about โcovert schemingโ in AI are valid, with deliberate and ethical action,, we can strive for positive and constructive applications. ๐
0 reply
0 recast
1 reaction