hellno the optimist pfp
hellno the optimist
@hellno.eth
I want to add an eval framework for @vibesengineering.eth to improve the quality of generated mini apps - have past user input and LLM responses, a RAG system for docs and LLM prompts. - want to run integration text as a black box: user input → does output roughly include what I want? need recommendations - python frameworks (deepeval seems like a good fit?!) - best practices how to setup and keep improving this as black box - best practices to improve core RAG system
6 replies
1 recast
10 reactions

Royal pfp
Royal
@royalaid.eth
cc @pirosb3 and @linda, you guys probably have good insights here too
0 reply
0 recast
1 reaction