Readability assessment aims to evaluate the reading difficulty of a text. In recent years, while deep learning technology has been gradually applied to readability assessment, most approaches fail to consider either the length of the text or the ordinal relationship of readability labels. This paper proposes a bidirectional readability assessment mechanism that captures contextual information to identify regions with rich semantic information in the text, thereby predicting the readability level of individual sentences. These sentence-level labels are then used to assist in predicting the overall readability level of the document. Additionally, a pairwise sorting algorithm is introduced to model the ordinal relationship between readability levels through label subtraction. Experimental results on Chinese and English datasets demonstrate that the proposed model achieves competitive performance and outperforms other baseline models.
翻译:可读性评估旨在评估文本的阅读难度。近年来,尽管深度学习技术已逐步应用于可读性评估,但大多数方法未能同时考虑文本长度或可读性标签的序数关系。本文提出了一种双向可读性评估机制,通过捕捉上下文信息来识别文本中富含语义信息的区域,从而预测单个句子的可读性等级。这些句子级标签随后用于辅助预测文档的整体可读性等级。此外,引入了一种成对排序算法,通过标签差分来建模可读性等级之间的序数关系。在中文和英文数据集上的实验结果表明,所提出的模型实现了具有竞争力的性能,并优于其他基线模型。