LAILA：一个基于多维特征的大规模阿拉伯语自动作文评分数据集 (LAILA: A Large Trait-Based Dataset for Arabic Automated Essay Scoring)

Automated Essay Scoring (AES) has gained increasing attention in recent years, yet research on Arabic AES remains limited due to the lack of publicly available datasets. To address this, we introduce LAILA, the largest publicly available Arabic AES dataset to date, comprising 7,859 essays annotated with holistic and trait-specific scores on seven dimensions: relevance, organization, vocabulary, style, development, mechanics, and grammar. We detail the dataset design, collection, and annotations, and provide benchmark results using state-of-the-art Arabic and English models in prompt-specific and cross-prompt settings. LAILA fills a critical need in Arabic AES research, supporting the development of robust scoring systems.

翻译：自动作文评分（AES）近年来受到越来越多的关注，但由于缺乏公开可用的数据集，针对阿拉伯语的AES研究仍然有限。为应对这一问题，我们推出了LAILA，这是迄今为止规模最大的公开阿拉伯语AES数据集，包含7,859篇作文，并在七个维度上标注了整体分数与特征专项分数：相关性、组织结构、词汇、文体、内容展开、书写规范及语法。我们详细阐述了数据集的设计、收集与标注流程，并提供了在题目相关及跨题目设定下使用最先进的阿拉伯语与英语模型所得的基准测试结果。LAILA填补了阿拉伯语AES研究领域的关键空白，为开发稳健的评分系统提供了支持。

相关内容

数据集

关注 88

数据集，又称为资料集、数据集合或资料集合，是一种由数据所组成的集合。
Data set（或dataset）是一个数据的集合，通常以表格形式出现。每一列代表一个特定变量。每一行都对应于某一成员的数据集的问题。它列出的价值观为每一个变量，如身高和体重的一个物体或价值的随机数。每个数值被称为数据资料。对应于行数，该数据集的数据可能包括一个或多个成员。

【CVPR 2022】基于粗粒度和细粒度特征匹配的视频描述评估，EMScore: Evaluating Video Captioning via Coarse-Grained and Fine-Grained Embedding Matching

专知会员服务

10+阅读 · 2022年3月19日