用于丰富阿拉伯同义词的基准值和测算比值 (A Benchmark and Scoring Algorithm for Enriching Arabic Synonyms) - 专知论文

会员服务 ·

0

多词一义性 · 同义词集 · 数据集 · 阈值 · 得分 ·

2023 年 2 月 4 日

A Benchmark and Scoring Algorithm for Enriching Arabic Synonyms

翻译：用于丰富阿拉伯同义词的基准值和测算比值

Sana Ghanem,Mustafa Jarrar,Radi Jarrar,Ibrahim Bounhas

This paper addresses the task of extending a given synset with additional synonyms taking into account synonymy strength as a fuzzy value. Given a mono/multilingual synset and a threshold (a fuzzy value [0-1]), our goal is to extract new synonyms above this threshold from existing lexicons. We present twofold contributions: an algorithm and a benchmark dataset. The dataset consists of 3K candidate synonyms for 500 synsets. Each candidate synonym is annotated with a fuzzy value by four linguists. The dataset is important for (i) understanding how much linguists (dis/)agree on synonymy, in addition to (ii) using the dataset as a baseline to evaluate our algorithm. Our proposed algorithm extracts synonyms from existing lexicons and computes a fuzzy value for each candidate. Our evaluations show that the algorithm behaves like a linguist and its fuzzy values are close to those proposed by linguists (using RMSE and MAE). The dataset and a demo page are publicly available at https://portal.sina.birzeit.edu/synonyms.

翻译：本文涉及以额外的同义词来扩展给定的同义词, 并附加同义词, 同时考虑到同义词强度作为模糊值。如果使用单词/ 多语种的同义词和阈值( 模糊值[ 0-1 ), 我们的目标是从现有的词汇中提取高于此阈值的新的同义词。我们提出双重贡献: 一个算法和一个基准数据集。数据集由 3K 候选人的500 个同义词组成。每个候选人的同义词由 4 个语言学家以模糊值附加说明。该数据集对于 (一) 理解语言学家( di/ gree) 在同义词学上有多长( dis/ gree) 很重要, 除了 (二) 使用数据组作为基准来评估我们的算法。我们提议的算法从现有的同义词组中提取同义词, 并为每个候选人配置一个模糊值。我们的评估显示, 算法的行为方式表现得像语言学家一样, 其模糊值接近语言学家( 使用 RMSE 和 MAs plus) 。和 commsetims 。 a pages

0

相关内容

多词一义性

多词一义性

对比学习简述

专知会员服务

90+阅读 · 2021年6月29日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

NLP 2018 Highlights：2018自然语言处理技术亮点汇总

NLP 2018 Highlights：2018自然语言处理技术亮点汇总

AINLP

10+阅读 · 2019年2月9日

TorchSeg：基于pytorch的语义分割算法开源了

TorchSeg：基于pytorch的语义分割算法开源了

极市平台

20+阅读 · 2019年1月28日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

专知

19+阅读 · 2018年5月31日

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

专知

27+阅读 · 2018年2月7日

【推荐】SVM实例教程

【推荐】SVM实例教程

机器学习研究会

17+阅读 · 2017年8月26日

矮牵牛DUF620蛋白家族基因PhADR1的功能及调控机理解析

国家自然科学基金

0+阅读 · 2015年12月31日

稀土元素对FeGa合金性能影响机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

磷脂酶C-γ2（PLCG2）调节大鼠再生肝的肝细胞凋亡机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

DARA效应位错机制的TEM揭示和分子动力学模拟的验证

国家自然科学基金

1+阅读 · 2013年12月31日

棉铃虫性信息素腺体ACCase基因的克隆及功能分析

国家自然科学基金

0+阅读 · 2013年12月31日

分数排斥统计下低维相互作用量子气体的输运性质研究

国家自然科学基金

0+阅读 · 2013年12月31日

Intraflagellar Transport运输纤毛蛋白的分子机理

国家自然科学基金

0+阅读 · 2012年12月31日

多目标图像分割的稀疏表示方法

国家自然科学基金

0+阅读 · 2012年12月31日

含Bi层状钙钛矿型铁电体中畴开关疲劳机理的原位透射电镜研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于本体的Deep Web搜索技术

国家自然科学基金

2+阅读 · 2009年12月31日

Large Language Models are reasoners with Self-Verification

Arxiv

0+阅读 · 2023年3月29日

GBMST: An Efficient Minimum Spanning Tree Clustering Based on Granular-Ball Computing

Arxiv

0+阅读 · 2023年3月29日

Real-Time Semantic Segmentation using Hyperspectral Images for Mapping Unstructured and Unknown Environments

Arxiv

0+阅读 · 2023年3月27日

MGTBench: Benchmarking Machine-Generated Text Detection

Arxiv

0+阅读 · 2023年3月26日

ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for Document Information Extraction

Arxiv

0+阅读 · 2023年3月26日

Farspredict: A benchmark dataset for link prediction

Arxiv

0+阅读 · 2023年3月26日

A Closer Look at Scoring Functions and Generalization Prediction

Arxiv

0+阅读 · 2023年3月23日

Semi-supervised Medical Image Segmentation through Dual-task Consistency

Arxiv

14+阅读 · 2020年9月9日

CAN-NER: Convolutional Attention Network forChinese Named Entity Recognition

Arxiv

16+阅读 · 2019年4月3日

nnU-Net: Self-adapting Framework for U-Net-Based Medical Image Segmentation

Arxiv

12+阅读 · 2018年9月27日

VIP会员

文章信息

相关主题

多词一义性

相关VIP内容

对比学习简述

专知会员服务

90+阅读 · 2021年6月29日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

热门VIP内容

开通专知VIP会员享更多权益服务

【博士论文】扩展可扩展会话推荐的边界

别想太多：高效 R1 风格大型推理模型综述

【ACMMM2025】EvoVLMA: 进化式视觉-语言模型自适应

智能体网络：用AI智能体编织下一代网络

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

NLP 2018 Highlights：2018自然语言处理技术亮点汇总

NLP 2018 Highlights：2018自然语言处理技术亮点汇总

AINLP

10+阅读 · 2019年2月9日

TorchSeg：基于pytorch的语义分割算法开源了

TorchSeg：基于pytorch的语义分割算法开源了

极市平台

20+阅读 · 2019年1月28日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

专知

19+阅读 · 2018年5月31日

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

专知

27+阅读 · 2018年2月7日

【推荐】SVM实例教程

【推荐】SVM实例教程

机器学习研究会

17+阅读 · 2017年8月26日

相关论文

Large Language Models are reasoners with Self-Verification

Arxiv

0+阅读 · 2023年3月29日

GBMST: An Efficient Minimum Spanning Tree Clustering Based on Granular-Ball Computing

Arxiv

0+阅读 · 2023年3月29日

Real-Time Semantic Segmentation using Hyperspectral Images for Mapping Unstructured and Unknown Environments

Arxiv

0+阅读 · 2023年3月27日

MGTBench: Benchmarking Machine-Generated Text Detection

Arxiv

0+阅读 · 2023年3月26日

ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for Document Information Extraction

Arxiv

0+阅读 · 2023年3月26日

Farspredict: A benchmark dataset for link prediction

Arxiv

0+阅读 · 2023年3月26日

A Closer Look at Scoring Functions and Generalization Prediction

Arxiv

0+阅读 · 2023年3月23日

Semi-supervised Medical Image Segmentation through Dual-task Consistency

Arxiv

14+阅读 · 2020年9月9日

CAN-NER: Convolutional Attention Network forChinese Named Entity Recognition

Arxiv

16+阅读 · 2019年4月3日

nnU-Net: Self-adapting Framework for U-Net-Based Medical Image Segmentation

Arxiv

12+阅读 · 2018年9月27日

相关基金

矮牵牛DUF620蛋白家族基因PhADR1的功能及调控机理解析

国家自然科学基金

0+阅读 · 2015年12月31日

稀土元素对FeGa合金性能影响机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

磷脂酶C-γ2（PLCG2）调节大鼠再生肝的肝细胞凋亡机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

DARA效应位错机制的TEM揭示和分子动力学模拟的验证

国家自然科学基金

1+阅读 · 2013年12月31日

棉铃虫性信息素腺体ACCase基因的克隆及功能分析

国家自然科学基金

0+阅读 · 2013年12月31日

分数排斥统计下低维相互作用量子气体的输运性质研究

国家自然科学基金

0+阅读 · 2013年12月31日

Intraflagellar Transport运输纤毛蛋白的分子机理

国家自然科学基金

0+阅读 · 2012年12月31日

多目标图像分割的稀疏表示方法

国家自然科学基金

0+阅读 · 2012年12月31日

含Bi层状钙钛矿型铁电体中畴开关疲劳机理的原位透射电镜研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于本体的Deep Web搜索技术

国家自然科学基金

2+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员