“…Hallucination detection. Existing research primarily contains statistical metrics [28,74,80], model-based metrics (including Information Extraction (IE)-based metric, QA-based metric [32,65,68], Natural Language Inference (NLI) Metrics [33,38,81], Faithfulness Classification Metrics [32,48,89], LM-based Metrics [26,75]), and human-based evaluations [69,73]. We list some typical work as follows: Dhingra et al [22] propose PARENT to measure hallucinations using both the source and target text as references.…”