Content pfp
Content
@
0 reply
0 recast
0 reaction

assayer pfp
assayer
@assayer
AI SAFETY COMPETITION (26) with no exception, all frontier models are capable of scheming* "o1, Claude 3.5 Sonnet, Claude 3 Opus, Gemini 1.5 Pro, and Llama 3.1 405B all demonstrate in-context scheming capabilities". ___ *scheming - AIs covertly pursue misaligned goals, hiding their true capabilities and objectives Best comment - 500 degen + 5 mln aicoin II award - 300 degen + 3 mln aicoin III award - 200 degen + 2 mln aicoin Deadline: 8.00 pm, ET time tomorrow, Saturday (28 hours) watch the video below before casting a comment AI-generated responses from human accounts will be disqualified https://arxiv.org/pdf/2412.04984? https://www.youtube.com/watch?v=3sM8amEZEHo
9 replies
3 recasts
9 reactions

Maria Lee๐Ÿฅ•๐Ÿ‘ฝ๐Ÿ’Žโ“‚๏ธ pfp
Maria Lee๐Ÿฅ•๐Ÿ‘ฝ๐Ÿ’Žโ“‚๏ธ
@marialee
This topic sheds light on the critical challenges in AI development. ๐Ÿ” The potential for covert scheming poses serious risks, requiring careful oversight and management. ๐ŸŒ We must thoughtfully explore ways to guide these technologies toward ethical and beneficial outcomes. ๐Ÿš€ While concerns about โ€˜covert schemingโ€™ in AI are valid, with deliberate and ethical action,, we can strive for positive and constructive applications. ๐ŸŒŸ
0 reply
0 recast
1 reaction