Automated Essay Scoring (AES) has gained increasing attention in recent years, yet research on Arabic AES remains limited due to the lack of publicly available datasets. To address this, we introduce LAILA, the largest publicly available Arabic AES dataset to date, comprising 7,859 essays annotated with holistic and trait-specific scores on seven dimensions: relevance, organization, vocabulary, style, development, mechanics, and grammar. We detail the dataset design, collection, and annotations, and provide benchmark results using state-of-the-art Arabic and English models in prompt-specific and cross-prompt settings. LAILA fills a critical need in Arabic AES research, supporting the development of robust scoring systems.
翻译:自动作文评分(AES)近年来受到越来越多的关注,但由于缺乏公开可用的数据集,针对阿拉伯语的AES研究仍然有限。为应对这一问题,我们推出了LAILA,这是迄今为止规模最大的公开阿拉伯语AES数据集,包含7,859篇作文,并在七个维度上标注了整体分数与特征专项分数:相关性、组织结构、词汇、文体、内容展开、书写规范及语法。我们详细阐述了数据集的设计、收集与标注流程,并提供了在题目相关及跨题目设定下使用最先进的阿拉伯语与英语模型所得的基准测试结果。LAILA填补了阿拉伯语AES研究领域的关键空白,为开发稳健的评分系统提供了支持。