for love and money: ai research, community management, hard sci fi scriptwriting // for only love: startups, ukulele, blogging

Please give a warm welcome to my friend @rayd 
He’s an AI researcher with a paper out soon, working on making Language Models better at being Agents, and helping models resist prompt injection attacks (out on arxiv tomorrow)

He’s explained a bit about his research in the comments

Be specific.    
Building productscore.org, @germanify // leonasskau.co.uk // Hosting /geopolitics, /strategy, /leo
d33m:mun

Glad to be here! 

First paper is on why base models doing offline prediction are theoretically doomed to hallucinate etc until they get external feedback, and how the same underlying property makes it hard for them to generalise strategies better than their training data (1/-)

Second is on taking tasks where larger models do worse (like prompt injection) and helping models do better (30% gain) by treating them as mixtures of distributions, and downweighing the bad distribution