LARS：一种评估LLM输出结果准确性概率的方法

对LLM输出结果的可靠性评估有较多的方法，例如

LLM准确率提升：LLM Self-Consistency多推理路径结果实现方式

当然也存在一些如Self-Detection, Self-Check等等。

这些问题都属于LLM的Uncertainty Estimation。

1. LLM Uncertainty Estimation

LLM Uncertainty Estimation也叫LLM不确定性评估，主要目的是评估LLM的输出结果好坏。表现为一个设计一个计算自信度分数模型，分数越高，则LLM的结果越准确，反之则越不可靠。

当前有多种方法来设计这样的一个自信度分数模型，例如：Self-checking方法、Output consistency方法、Internal state examination方法、 Token probability-based方法等等。

2.LLM直接对答案输出自信度分数

我们可以设计提示词来让LLM在生成结果的时候输出答案及其自信度分数，甚至我们也可以通过提示词对LLM输出的一个答案进行打分。然而这样的方式可靠么？答案是不可靠的。

LLM结果可靠性验证：直接输出结果自信分数是否可行？

一些研究也证明了这点，例如：

confidence calibration methods are observed with severe over-trust issue on LLM, assigning high confidence score in some incorrectly generated answers. In fact, LLM has a bias to blindly trust its generated answers, leading to difficulties in distinguishing the correctness of its generated answers

资料来源：Think Twice Before Trusting: Self-Detection for Large Language Models through Comprehensive Answer Reflection

3. LARS方法

LARS方法来源《Do Not Design, Learn: A Trainable Scoring Function for Uncertainty Estimation in Generative LLMs》，它是一种简单训练方法，且效果相比于当前常使用的不确定性评估方法来说要好。

LARS结构如下：

LARS方法结构图