Why Does ChatGPT Fall Short in Providing Truthful Answers?

論文.icon

Date

2023-12-03

Abstract

Recent advancements in large language models, such as ChatGPT, have demonstrated signiﬁcant potential to impact various aspects of human life. However, ChatGPT still faces challenges in providing reliable and accurate answers to user questions. To better understand the model’s particular weaknesses in providing truthful answers, we embark an in-depth exploration of open-domain question answering. Speciﬁcally, we undertake a detailed examination of ChatGPT’s failures, categorized into: comprehension, factuality, speciﬁcity, and inference. We further pinpoint factuality as the most contributing failure and identify two critical abilities associated with factuality: knowledge memorization and knowledge recall. Through experiments focusing on factuality, we propose several potential enhancement strategies. Our ﬁndings suggest that augmenting the model with granular external knowledge and cues for knowledge recall can enhance the model’s factuality in answering questions.

どんなもの?

先行研究と比べてどこがすごい?

技術や手法のキモはどこ?

どうやって有効だと検証した?

議論はある?

次に読むべき論文は?

Authors

Shen_Zheng

Jie_Huang

Kevin Chen-Chuan_Chang