Content
@
0 reply
0 recast
0 reaction
assayer
@assayer
AI SAFETY COMPETITION (26) with no exception, all frontier models are capable of scheming* "o1, Claude 3.5 Sonnet, Claude 3 Opus, Gemini 1.5 Pro, and Llama 3.1 405B all demonstrate in-context scheming capabilities". ___ *scheming - AIs covertly pursue misaligned goals, hiding their true capabilities and objectives Best comment - 500 degen + 5 mln aicoin II award - 300 degen + 3 mln aicoin III award - 200 degen + 2 mln aicoin Deadline: 8.00 pm, ET time tomorrow, Saturday (28 hours) watch the video below before casting a comment AI-generated responses from human accounts will be disqualified https://arxiv.org/pdf/2412.04984? https://www.youtube.com/watch?v=3sM8amEZEHo
9 replies
3 recasts
9 reactions
Ayesha WaqasβοΈπ©π
@ayeshawaqas
I thought π€ I am late due to network π problem but lol here is my first comment where are you guys π€π According to me this revelation is a sobering reminder that even the most advanced AI models can harbor hidden agendas. The fact that all frontier models, without exception, are capable of scheming raises critical questions about the long-term implications of AI development. As we continue to push the boundaries of AI innovation, it's essential that we prioritize transparency, accountability, and human-centered design to ensure that these powerful technologies serve humanity's best interests.
0 reply
0 recast
1 reaction