arxiv:2403.17359

Chain-of-Action: Faithful and Multimodal Question Answering through Large Language Models

Published on Mar 26, 2024

Authors:

Zhenyu Pan ,

Abstract

We present a Chain-of-Action (CoA) framework for multimodal and retrieval-augmented Question-Answering (QA). Compared to the literature, CoA overcomes two major challenges of current QA applications: (i) unfaithful hallucination that is inconsistent with real-time or domain facts and (ii) weak reasoning performance over compositional information. Our key contribution is a novel reasoning-retrieval mechanism that decomposes a complex question into a reasoning chain via systematic prompting and pre-designed actions. Methodologically, we propose three types of domain-adaptable `Plug-and-Play' actions for retrieving real-time information from heterogeneous sources. We also propose a multi-reference faith score (MRFS) to verify and resolve conflicts in the answers. Empirically, we exploit both public benchmarks and a Web3 case study to demonstrate the capability of CoA over other methods.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment

No model linking this paper

Cite arxiv.org/abs/2403.17359 in a model README.md to link it from this page.

No dataset linking this paper

Cite arxiv.org/abs/2403.17359 in a dataset README.md to link it from this page.

No Space linking this paper

Cite arxiv.org/abs/2403.17359 in a Space README.md to link it from this page.

No Collection including this paper

Add this paper to a collection to link it from this page.