重新思考文本属性传输:词汇分析 (Rethinking Text Attribute Transfer: A Lexical Analysis)

Text attribute transfer is modifying certain linguistic attributes (e.g. sentiment, style, authorship, etc.) of a sentence and transforming them from one type to another. In this paper, we aim to analyze and interpret what is changed during the transfer process. We start from the observation that in many existing models and datasets, certain words within a sentence play important roles in determining the sentence attribute class. These words are referred to as \textit{the Pivot Words}. Based on these pivot words, we propose a lexical analysis framework, \textit{the Pivot Analysis}, to quantitatively analyze the effects of these words in text attribute classification and transfer. We apply this framework to existing datasets and models and show that: (1) the pivot words are strong features for the classification of sentence attributes; (2) to change the attribute of a sentence, many datasets only requires to change certain pivot words; (3) consequently, many transfer models only perform the lexical-level modification, while leaving higher-level sentence structures unchanged. Our work provides an in-depth understanding of linguistic attribute transfer and further identifies the future requirements and challenges of this task\footnote{Our code can be found at https://github.com/FranxYao/pivot_analysis}.

翻译：文本属性传输正在修改某句的某些语言属性( 如情绪、风格、作者等), 并将其从一种类型转换为另一种类型。在本文中, 我们的目标是分析和解释在传输过程中变化的内容。我们从这样的观察开始: 在许多现有的模型和数据集中, 句子中的某些词在确定句子属性类别方面起着重要作用。这些词被称为 \ textit{ the Pivot Words} 。基于这些主词, 我们提议了一个词汇分析框架,\ textit{ the Pivot 分析}, 以定量方式分析这些词在文本属性分类和传输中的效果。我们将这个框架应用到现有的数据集和模型中, 并显示:(1) 语重词是判决属性分类的强性特征; (2) 更改句子属性, 许多数据集只需要更改某些参数; (3) 因此, 许多传输模型只执行词汇级修改, 而同时保持更高层次的句级结构不变。我们的工作提供了对语言属性传输的深入理解, 并进一步确定了未来定义/ MAGI/ frotoimal 。