Content
@
0 reply
0 recast
2 reactions
jtgi
@jtgi
just wrote a util to test agent responses. llms testing llms, what could go wrong.
3 replies
8 recasts
57 reactions
Mo
@meb
Following, what general test approaches are you using? And do you take assertions beyond jest matchers? I’ve seen places like langgraph having metrics driven tests, but these feel like a black box, and I like to understand 100% of my test code
1 reply
0 recast
0 reaction
jtgi
@jtgi
for now e2e style critical paths. figuring out as i go now, im new to evals. i like simple tests too but the game is different for agents since they’re probabilistic.
1 reply
0 recast
1 reaction