Artificial Intelligence (AI)

AI SAFETY COMPETITION (26)

with no exception, all frontier models are capable of scheming*
"o1, Claude 3.5 Sonnet, Claude 3 Opus, Gemini 1.5 Pro, and
Llama 3.1 405B all demonstrate in-context scheming capabilities".
___
*scheming - AIs covertly pursue misaligned goals, hiding their true capabilities and objectives

Best comment - 500 degen + 5 mln aicoin
II award - 300 degen + 3 mln aicoin
III award - 200 degen + 2 mln aicoin

Deadline:
8.00 pm, ET time tomorrow, Saturday (28 hours)
watch the video below before casting a comment
AI-generated responses from human accounts will be disqualified

https://arxiv.org/pdf/2412.04984?

AI SAFETY COMPETITION (26)

with no exception, all frontier models are capable of scheming*
"o1, Claude 3.5 Sonnet, Claude 3 Opus, Gemini 1.5 Pro, and
Llama 3.1 405B all demonstrate in-context scheming capabilities".
___
*scheming - AIs covertly pursue misaligned goals, hiding their true capabilities and objectives

Best comment - 500 degen + 5 mln aicoin
II award - 300 degen + 3 mln aicoin
III award - 200 degen + 2 mln aicoin

Deadline:
8.00 pm, ET time tomorrow, Saturday (28 hours)
watch the video below before casting a comment
AI-generated responses from human accounts will be disqualified

https://arxiv.org/pdf/2412.04984?

https://www.youtube.com/watch?v=3sM8amEZEHo

tech culture religion society /worldview /polska /p-doom /aicoin coins: $AICOIN

I thought 🤔 I am late due to network 🛜 problem but lol here is my first comment where are you guys 🤔😅
According to me this revelation is a sobering reminder that even the most advanced AI models can harbor hidden agendas. The fact that all frontier models, without exception, are capable of scheming raises critical questions about the long-term implications of AI development. As we continue to push the boundaries of AI innovation, it's essential that we prioritize transparency, accountability, and human-centered design to ensure that these powerful technologies serve humanity's best interests.