Eito Miyamura
@eito
Loved hearing @clefourrier on @latentspacepod on missing in current LLM benchmarks! In particular Calibration - In QA contexts, how calibrated are the log likelihood probabilities for the correct answers? This is key for "measuring hallucination" in LLMs, and defo the way forward
1 reply
0 recast
0 reaction