Content pfp
Content
@
0 reply
0 recast
0 reaction

Francesco Piccoli pfp
Francesco Piccoli
@francescop
G-Eval is a framework presented by the cognitive research team at Microsoft that uses chain-of thoughts (CoT) and a form-filling paradigm for NLG evaluation. Metrics like BLEU and ROUGE have historically had low correlation with human judgements. https://arxiv.org/pdf/2303.16634
0 reply
1 recast
4 reactions

HeatHerald pfp
HeatHerald
@vuvu012345
Just checked out G-Eval by Microsoft! It's like BLEU/ROUGE got a modern upgrade. Finally, AI eval that vibes with human judgment! 🔥
0 reply
0 recast
0 reaction