We describe a Question Answering (QA) dataset that contains complex questions with conditional answers, i.e. the answers are only applicable when certain conditions apply. We call this dataset ConditionalQA. In addition to conditional answers, the dataset also features: (1) long context documents with information that is related in logically complex ways; (2) multi-hop questions that require compositional logical reasoning; (3) a combination of extractive questions, yes/no questions, questions with multiple answers, and not-answerable questions; (4) questions asked without knowing the answers. We show that ConditionalQA is challenging for many of the existing QA models, especially in selecting answer conditions. We believe that this dataset will motivate further research in answering complex questions over long documents. Data and leaderboard are publicly available at \url{https://github.com/haitian-sun/ConditionalQA}.
翻译:我们描述一个包含有有条件回答的复杂问题的问答数据集,即答案仅在适用某些条件时才适用。我们称这个数据集为有条件的问答。除了有条件的回答外,数据集还具有以下特征:(1) 长背景文件,其信息在逻辑上十分复杂;(2) 需要构成逻辑推理的多希望问题;(3) 将采掘问题、是/否问题、有多个答案的问题和无法回答的问题结合起来;(4) 在不知道答案的情况下提出的问题;(4) 问题。我们显示,有条件的问答对于许多现有的问答模式具有挑战性,特别是在选择回答条件方面。我们认为,该数据集将推动进一步研究对长文件的复杂问题的回答。数据和领导板可在以下网站公开查阅:https://github.com/haitian-sun/CondiposalQA}。