Translations:FACTS About Building Retrieval Augmented Generation-based Chatbots/50/zh: Difference between revisions

Latest revision as of 08:52, 19 February 2025

Information about message (contribute)

This message has no documentation. If you know where or how this message is used, you can help other translators by adding documentation to this message.

Message definition (FACTS About Building Retrieval Augmented Generation-based Chatbots)

Testing generative AI solutions can be a lengthy process due to the need for human response validation. LLMs are increasingly being employed using ‘LLM-as-a-judge’ approach. However, it is advisable to use caution when using LLMs as human proxy, as using LLMs as judges can lead to self-fulfilling prophecy type of scenarios reinforcing their inherent biases in evaluations as well.

由于需要人工响应验证，测试生成式人工智能解决方案可能是一个漫长的过程。越来越多地使用“LLM-as-a-judge”方法来应用大型语言模型。然而，建议在将大型语言模型作为人类代理时要谨慎，因为将大型语言模型作为评判者可能导致自我实现预言类型的情景，从而在评估中强化其固有的偏见。