Natural language interfaces to tabular data must handle ambiguities inherent to queries. Instead of treating ambiguity as a deficiency, we reframe it as a feature of cooperative interaction where users are intentional about the degree to which they specify queries. We develop a principled framework based on a shared responsibility of query specification between user and system, distinguishing unambiguous and ambiguous cooperative queries, which systems can resolve through reasonable inference, from uncooperative queries that cannot be resolved. Applying the framework to evaluations for tabular question answering and analysis, we analyze the queries in 15 popular datasets, and observe an uncontrolled mixing of query types neither adequate for evaluating a system's execution accuracy nor for evaluating interpretation capabilities. This conceptualization around cooperation in resolving queries informs how to design and evaluate natural language interfaces for tabular data analysis, for which we distill concrete directions for future research and broader implications.
翻译:面向表格数据的自然语言接口必须处理查询中固有的模糊性。我们并非将模糊性视为缺陷,而是将其重构为协作交互的一种特征,即用户有意控制查询的明确程度。我们基于用户与系统之间查询规范的共同责任,建立了一个原则性框架,区分了明确查询与模糊协作查询——后者可通过合理推理由系统解析,以及无法解析的非协作查询。将该框架应用于表格问答与分析任务的评估中,我们对15个常用数据集中的查询进行了分析,发现其中混杂了不同类型的查询,这既不足以评估系统的执行准确性,也无法有效评估其解释能力。这种围绕查询解析协作性的概念化,为设计和评估面向表格数据分析的自然语言接口提供了指导,我们据此提炼出未来研究的具体方向及更广泛的影响。