
我要转码
大家好,欢迎来到“我要转码”系列播客,这个节目将致力于邀请各大科技公司的工程师,从他们的角度带来业界最真实声音,聊一聊码农们的新鲜事
--
8期



Series “Evaluate LLM-powered Products” EP2!
In this episode, I share what “accuracy” really means when it comes to LLMs and AI-powered products. We explore why traditional metrics like BLEU and ROUGE often fall short, how LLM-as-a-judge methods work, and why multi-turn conversations are especially tricky to evaluate. I also share practical tips, rubrics, and personal lessons learned from my own experiments.
Subscribe "Data Science x AI" newsletter to get updates!
https://datasciencexai.substack.com/


10期

沪ICP备06026464号-4 网络文化经营许可证
沪网文[2014]0587-137号
信息网络传播视听许可证:0911603
©2011-2019 qingting.fm ALL Rights Reserved.
应用名称:蜻蜓FM | 开发者:上海麦克风文化传媒有限公司