听我说：关于使用语音模态进行群众采集相关性评估的研究 (Hear Me Out: A Study on the Use of the Voice Modality for Crowdsourced Relevance Assessments) - 专知论文

会员服务 ·

0

相关性 · 模态 · 呈现 · 接口 · 用户研究 ·

2023 年 4 月 21 日

Hear Me Out: A Study on the Use of the Voice Modality for Crowdsourced Relevance Assessments

翻译：听我说：关于使用语音模态进行群众采集相关性评估的研究

Nirmal Roy,Agathe Balayn,David Maxwell,Claudia Hauff

from arxiv, Accepted at SIGIR 2023

The creation of relevance assessments by human assessors (often nowadays crowdworkers) is a vital step when building IR test collections. Prior works have investigated assessor quality & behaviour, though into the impact of a document's presentation modality on assessor efficiency and effectiveness. Given the rise of voice-based interfaces, we investigate whether it is feasible for assessors to judge the relevance of text documents via a voice-based interface. We ran a user study (n = 49) on a crowdsourcing platform where participants judged the relevance of short and long documents sampled from the TREC Deep Learning corpus-presented to them either in the text or voice modality. We found that: (i) participants are equally accurate in their judgements across both the text and voice modality; (ii) with increased document length it takes participants significantly longer (for documents of length > 120 words it takes almost twice as much time) to make relevance judgements in the voice condition; and (iii) the ability of assessors to ignore stimuli that are not relevant (i.e., inhibition) impacts the assessment quality in the voice modality-assessors with higher inhibition are significantly more accurate than those with lower inhibition. Our results indicate that we can reliably leverage the voice modality as a means to effectively collect relevance labels from crowdworkers.

翻译：通过人类评估员（如今通常是众包工人）创建相关性评估是构建信息检索测试集的至关重要的步骤。以往的研究调查了评估员的质量和行为，但没有考虑到文档呈现模态对评估员效率和效果的影响。考虑到基于语音接口的兴起，本研究调查了评估员是否可以通过语音接口判断文本文档的相关性的可行性。我们在一个众包平台上进行了用户研究（n=49），参与者评估了抽样自TREC深度学习语料库的短文档和长文档，以文字或语音形式呈现给他们。我们发现：（i）参与者在文本和语音模态下的判断准确性相同；（ii）随着文档长度的增加，在语音条件下进行相关性判断需要的时间明显更长（对于长度>120个单词的文档，需要的时间几乎增长了一倍）；（iii）评估员忽略不相关刺激的能力（即抑制）对语音模态下的评估质量有影响-抑制更高的评估员比抑制较低的评估员更准确。我们的研究结果表明，我们可以可靠地利用语音模态作为从众包工人中有效收集相关标签的手段。

0

相关内容

相关性

多模态人机交互综述

多模态人机交互综述

专知会员服务

127+阅读 · 2022年7月3日

【Meta AI】多模态理解研究进展，Advances in multimodal understanding research at Meta AI

【Meta AI】多模态理解研究进展，Advances in multimodal understanding research at Meta AI

专知会员服务

59+阅读 · 2022年3月20日

Artificial Intelligence: Ready to Ride the Wave? BCG 28页PPT

Artificial Intelligence: Ready to Ride the Wave? BCG 28页PPT

专知会员服务

26+阅读 · 2022年2月20日

WWW21最新「比较学习」教程，135页PPT阐述从排名数据中学习

专知会员服务

36+阅读 · 2021年4月27日

【论文推荐】文本摘要简述

【论文推荐】文本摘要简述

专知会员服务

67+阅读 · 2020年7月20日

【KDD2020】多任务多关系嵌入的Twitter意识形态检测，TIMME-Twitter Ideology-detection via Multi-task Multi-relational Embedding

【KDD2020】多任务多关系嵌入的Twitter意识形态检测，TIMME-Twitter Ideology-detection via Multi-task Multi-relational Embedding

专知会员服务

17+阅读 · 2020年6月8日

【WWW2020-北京大学】多模态多轮对话系统，Multi-Modality in Multi-Turn Dialog

【WWW2020-北京大学】多模态多轮对话系统，Multi-Modality in Multi-Turn Dialog

专知会员服务

56+阅读 · 2020年3月13日

【NLP| 推荐文章】神经网络方法的机器阅读理解：方法与趋势（Neural Machine Reading Comprehension：Methods and Trends）

专知会员服务

40+阅读 · 2019年11月24日

【AAAI2020接受论文】预测性参与:开放领域对话系统自动评估的有效指标（Predictive Engagement: An Efficient Metric For Automatic Evaluation of Open-Domain Dialogue Systems）

【AAAI2020接受论文】预测性参与:开放领域对话系统自动评估的有效指标（Predictive Engagement: An Efficient Metric For Automatic Evaluation of Open-Domain Dialogue Systems）

专知会员服务

12+阅读 · 2019年11月15日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

39+阅读 · 2019年10月9日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

14+阅读 · 2019年4月13日

IEEE | DSC 2019诚邀稿件 (EI检索)

IEEE | DSC 2019诚邀稿件 (EI检索)

Call4Papers

10+阅读 · 2019年2月25日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

17+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

41+阅读 · 2019年1月3日

大数据 | 顶级SCI期刊专刊/国际会议信息7条

大数据 | 顶级SCI期刊专刊/国际会议信息7条

Call4Papers

10+阅读 · 2018年12月29日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

16+阅读 · 2018年12月24日

【论文推荐】最新六篇命名实体识别相关论文—跨专业医学、阿拉伯命名实体、中国临床、深度多任务学习、多模态、图卷积网络

【论文推荐】最新六篇命名实体识别相关论文—跨专业医学、阿拉伯命名实体、中国临床、深度多任务学习、多模态、图卷积网络

专知

54+阅读 · 2018年5月21日

【论文推荐】最新七篇推荐系统相关论文—影响兴趣、知识Embeddings、音乐推荐、非结构化、一致性、显式和隐式特征、知识图谱

【论文推荐】最新七篇推荐系统相关论文—影响兴趣、知识Embeddings、音乐推荐、非结构化、一致性、显式和隐式特征、知识图谱

专知

14+阅读 · 2018年3月28日

【论文推荐】最新5篇行人再识别（ReID）相关论文—迁移学习、特征集成、重排序、多通道金字塔、深层生成模型

【论文推荐】最新5篇行人再识别（ReID）相关论文—迁移学习、特征集成、重排序、多通道金字塔、深层生成模型

专知

12+阅读 · 2018年3月24日

【论文推荐】最新7篇聊天机器人（Chatbot）相关论文—触动你的心、DeepProbe、饮食推荐、知识学习、交互、挑战、管理

【论文推荐】最新7篇聊天机器人（Chatbot）相关论文—触动你的心、DeepProbe、饮食推荐、知识学习、交互、挑战、管理

专知

12+阅读 · 2018年3月15日

面向社会媒体的情感倾向分析方法研究

国家自然科学基金

3+阅读 · 2014年12月31日

基于格理论的社交网络访问控制方法研究

国家自然科学基金

1+阅读 · 2013年12月31日

基于多语用户模型的个性化跨语言信息检索研究

国家自然科学基金

2+阅读 · 2013年12月31日

基于用户模型的移动设备可用性评估方法研究

国家自然科学基金

1+阅读 · 2012年12月31日

中文自动口语摘要技术研究

国家自然科学基金

1+阅读 · 2011年12月31日

基于多模态概率主题模型的实体相关文本可视化

国家自然科学基金

1+阅读 · 2011年12月31日

音频信号处理中基于模型的语音与音乐信号分离算法

国家自然科学基金

1+阅读 · 2009年12月31日

核因子κ#20171;导醛固酮通过NHE1所致肾小球硬化的研究

国家自然科学基金

0+阅读 · 2009年12月31日

面向查询的XML文本自动文摘研究

国家自然科学基金

0+阅读 · 2008年12月31日

基于压缩域听觉谱的音频分类与检索算法研究

国家自然科学基金

0+阅读 · 2008年12月31日

On the Reliability of Watermarks for Large Language Models

Arxiv

0+阅读 · 2023年6月7日

The Role of Relevance in Fair Ranking

Arxiv

0+阅读 · 2023年6月6日

Scalable Concept Extraction in Industry 4.0

Arxiv

0+阅读 · 2023年6月6日

In Search of Insights, Not Magic Bullets: Towards Demystification of the Model Selection Dilemma in Heterogeneous Treatment Effect Estimation

Arxiv

0+阅读 · 2023年6月6日

Which Argumentative Aspects of Hate Speech in Social Media can be reliably identified?

Arxiv

0+阅读 · 2023年6月5日

Understanding Self-Efficacy in the Context of Software Engineering: A Qualitative Study in the Industry

Arxiv

0+阅读 · 2023年6月2日

Do We Need Explainable AI in Companies? Investigation of Challenges, Expectations, and Chances from Employees' Perspective

Arxiv

0+阅读 · 2023年6月2日

"I'm fully who I am": Towards Centering Transgender and Non-Binary Voices to Measure Biases in Open Language Generation

Arxiv

0+阅读 · 2023年6月1日

A Survey of Human-in-the-loop for Machine Learning

Arxiv

35+阅读 · 2021年8月2日

Blockchain for Future Smart Grid: A Comprehensive Survey

Blockchain for Future Smart Grid: A Comprehensive Survey

Arxiv

20+阅读 · 2019年11月8日

VIP会员

文章信息

相关主题

相关VIP内容

多模态人机交互综述

多模态人机交互综述

专知会员服务

127+阅读 · 2022年7月3日

【Meta AI】多模态理解研究进展，Advances in multimodal understanding research at Meta AI

【Meta AI】多模态理解研究进展，Advances in multimodal understanding research at Meta AI

专知会员服务

59+阅读 · 2022年3月20日

Artificial Intelligence: Ready to Ride the Wave? BCG 28页PPT

Artificial Intelligence: Ready to Ride the Wave? BCG 28页PPT

专知会员服务

26+阅读 · 2022年2月20日

WWW21最新「比较学习」教程，135页PPT阐述从排名数据中学习

专知会员服务

36+阅读 · 2021年4月27日

【论文推荐】文本摘要简述

【论文推荐】文本摘要简述

专知会员服务

67+阅读 · 2020年7月20日

【KDD2020】多任务多关系嵌入的Twitter意识形态检测，TIMME-Twitter Ideology-detection via Multi-task Multi-relational Embedding

【KDD2020】多任务多关系嵌入的Twitter意识形态检测，TIMME-Twitter Ideology-detection via Multi-task Multi-relational Embedding

专知会员服务

17+阅读 · 2020年6月8日

【WWW2020-北京大学】多模态多轮对话系统，Multi-Modality in Multi-Turn Dialog

【WWW2020-北京大学】多模态多轮对话系统，Multi-Modality in Multi-Turn Dialog

专知会员服务

56+阅读 · 2020年3月13日

【NLP| 推荐文章】神经网络方法的机器阅读理解：方法与趋势（Neural Machine Reading Comprehension：Methods and Trends）

专知会员服务

40+阅读 · 2019年11月24日

【AAAI2020接受论文】预测性参与:开放领域对话系统自动评估的有效指标（Predictive Engagement: An Efficient Metric For Automatic Evaluation of Open-Domain Dialogue Systems）

【AAAI2020接受论文】预测性参与:开放领域对话系统自动评估的有效指标（Predictive Engagement: An Efficient Metric For Automatic Evaluation of Open-Domain Dialogue Systems）

专知会员服务

12+阅读 · 2019年11月15日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

39+阅读 · 2019年10月9日

热门VIP内容

相关资讯

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

14+阅读 · 2019年4月13日

IEEE | DSC 2019诚邀稿件 (EI检索)

IEEE | DSC 2019诚邀稿件 (EI检索)

Call4Papers

10+阅读 · 2019年2月25日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

17+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

41+阅读 · 2019年1月3日

大数据 | 顶级SCI期刊专刊/国际会议信息7条

大数据 | 顶级SCI期刊专刊/国际会议信息7条

Call4Papers

10+阅读 · 2018年12月29日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

16+阅读 · 2018年12月24日

【论文推荐】最新六篇命名实体识别相关论文—跨专业医学、阿拉伯命名实体、中国临床、深度多任务学习、多模态、图卷积网络

【论文推荐】最新六篇命名实体识别相关论文—跨专业医学、阿拉伯命名实体、中国临床、深度多任务学习、多模态、图卷积网络

专知

54+阅读 · 2018年5月21日

【论文推荐】最新七篇推荐系统相关论文—影响兴趣、知识Embeddings、音乐推荐、非结构化、一致性、显式和隐式特征、知识图谱

【论文推荐】最新七篇推荐系统相关论文—影响兴趣、知识Embeddings、音乐推荐、非结构化、一致性、显式和隐式特征、知识图谱

专知

14+阅读 · 2018年3月28日

【论文推荐】最新5篇行人再识别（ReID）相关论文—迁移学习、特征集成、重排序、多通道金字塔、深层生成模型

【论文推荐】最新5篇行人再识别（ReID）相关论文—迁移学习、特征集成、重排序、多通道金字塔、深层生成模型

专知

12+阅读 · 2018年3月24日

【论文推荐】最新7篇聊天机器人（Chatbot）相关论文—触动你的心、DeepProbe、饮食推荐、知识学习、交互、挑战、管理

【论文推荐】最新7篇聊天机器人（Chatbot）相关论文—触动你的心、DeepProbe、饮食推荐、知识学习、交互、挑战、管理

专知

12+阅读 · 2018年3月15日

相关论文

On the Reliability of Watermarks for Large Language Models

Arxiv

0+阅读 · 2023年6月7日

The Role of Relevance in Fair Ranking

Arxiv

0+阅读 · 2023年6月6日

Scalable Concept Extraction in Industry 4.0

Arxiv

0+阅读 · 2023年6月6日

In Search of Insights, Not Magic Bullets: Towards Demystification of the Model Selection Dilemma in Heterogeneous Treatment Effect Estimation

Arxiv

0+阅读 · 2023年6月6日

Which Argumentative Aspects of Hate Speech in Social Media can be reliably identified?

Arxiv

0+阅读 · 2023年6月5日

Understanding Self-Efficacy in the Context of Software Engineering: A Qualitative Study in the Industry

Arxiv

0+阅读 · 2023年6月2日

Do We Need Explainable AI in Companies? Investigation of Challenges, Expectations, and Chances from Employees' Perspective

Arxiv

0+阅读 · 2023年6月2日

"I'm fully who I am": Towards Centering Transgender and Non-Binary Voices to Measure Biases in Open Language Generation

Arxiv

0+阅读 · 2023年6月1日

A Survey of Human-in-the-loop for Machine Learning

Arxiv

35+阅读 · 2021年8月2日

Blockchain for Future Smart Grid: A Comprehensive Survey

Blockchain for Future Smart Grid: A Comprehensive Survey

Arxiv

20+阅读 · 2019年11月8日

相关基金

面向社会媒体的情感倾向分析方法研究

国家自然科学基金

3+阅读 · 2014年12月31日

基于格理论的社交网络访问控制方法研究

国家自然科学基金

1+阅读 · 2013年12月31日

基于多语用户模型的个性化跨语言信息检索研究

国家自然科学基金

2+阅读 · 2013年12月31日

基于用户模型的移动设备可用性评估方法研究

国家自然科学基金

1+阅读 · 2012年12月31日

中文自动口语摘要技术研究

国家自然科学基金

1+阅读 · 2011年12月31日

基于多模态概率主题模型的实体相关文本可视化

国家自然科学基金

1+阅读 · 2011年12月31日

音频信号处理中基于模型的语音与音乐信号分离算法

国家自然科学基金

1+阅读 · 2009年12月31日

核因子κ#20171;导醛固酮通过NHE1所致肾小球硬化的研究

国家自然科学基金

0+阅读 · 2009年12月31日

面向查询的XML文本自动文摘研究

国家自然科学基金

0+阅读 · 2008年12月31日

基于压缩域听觉谱的音频分类与检索算法研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员