Vitalik Buterin pfp
Vitalik Buterin
@vitalik.eth
Some predictions on 2030 AI capabilities. But I think it's too pessimistic in its implications: if AI bug-finding is easy, then *the devs themselves* could use it to strip out bugs first. Average code has 15-50 bugs per 1000 lines; if consumer bug-finders could catch 99%, then quite a few apps could become bug-free.
61 replies
265 recasts
1157 reactions

Emperor pfp
Emperor
@0xemperor
MATH itself is not the hallmark of mathematical ability, a bunch of papers show even prompt level sensitivity to performance. A number on a leaderboard is good, but not entirely indicative of progress.
1 reply
0 recast
0 reaction

Emperor pfp
Emperor
@0xemperor
https://arxiv.org/abs/2402.06664 some recent work on LLM based hacking
0 reply
0 recast
0 reaction