AI Agents

I've been wondering: are there agents able to prove the authenticity of their messages? (i.e a proof that a specific answer is the result of a certain prompt on a given model, while optionally keeping the model private)

Especially with agents that give financial analysis, how do you trust there are no evil hands behind it?

thot leadership at /opacity --
host of vibe check podcast 
▶️ https://pods.media/vibe-check
❄️ icebreaker.xyz/dawufi

Working on building zkTLS infra/products. A ruthless empiricist and meme connoisseur.

/opacity
/eulerlagrange

Building https://addr.id •
Also web3 tech lead at Ubisoft

Great question. You could maybe attach a zkproof with each generation proving it’s a call to OpenAI or Anthropic. Maybe another one to proof the system prompt hasn’t been modified from a public one too.

@eulerlagrange.eth @dawufi

CEO and Founder of $BLEU

Developing infinite memetics AI Agent @elefant


lebleuelefant.com

Most reason act agents have several of these In a loop and other tools so it would certainly be hard to do 100% coverage. Also at the end of the day training data for OpenAI isn’t open neither so

Very interesting! Proving it comes from a known source is sufficient for most use cases I guess. But is it technically feasible to get a proof of inference, verifiable against a specific "model hash" for instance?