ChatGPT是一种高度流畅的语法纠错系统吗？全面评估 (Is ChatGPT a Highly Fluent Grammatical Error Correction System? A Comprehensive Evaluation)

ChatGPT, a large-scale language model based on the advanced GPT-3.5 architecture, has shown remarkable potential in various Natural Language Processing (NLP) tasks. However, there is currently a dearth of comprehensive study exploring its potential in the area of Grammatical Error Correction (GEC). To showcase its capabilities in GEC, we design zero-shot chain-of-thought (CoT) and few-shot CoT settings using in-context learning for ChatGPT. Our evaluation involves assessing ChatGPT's performance on five official test sets in three different languages, along with three document-level GEC test sets in English. Our experimental results and human evaluations demonstrate that ChatGPT has excellent error detection capabilities and can freely correct errors to make the corrected sentences very fluent, possibly due to its over-correction tendencies and not adhering to the principle of minimal edits. Additionally, its performance in non-English and low-resource settings highlights its potential in multilingual GEC tasks. However, further analysis of various types of errors at the document-level has shown that ChatGPT cannot effectively correct agreement, coreference, tense errors across sentences, and cross-sentence boundary errors.

翻译：ChatGPT是基于先进的GPT-3.5架构的大规模语言模型，在各种自然语言处理（NLP）任务中展现了卓越的潜力。然而，目前缺乏综合研究来探索它在语法错误纠正（GEC）领域的潜力。为了展示它在GEC方面的能力，我们设计了使用上下文学习的零样本CoT（chain-of-thought）和少样本CoT设置用于ChatGPT。我们的评估涉及对五个不同语言的官方测试集以及英语中三个文档级GEC测试集的ChatGPT性能进行评估。我们的实验结果和人类评估表明，ChatGPT在错误检测方面具有出色的能力，并且可以自由地纠正错误，使更正后的句子非常流畅，可能是因为其过度纠正倾向而不遵守最小修改原则。此外，它在非英语和低资源环境中的表现突出，突出显示了它在多语言GEC任务中的潜力。然而，对文档级各种类型错误的进一步分析表明，ChatGPT不能有效地纠正跨句子的协议、指代和时态错误及跨句子边界的错误。

相关内容

ChatGPT

关注 256

ChatGPT（全名：Chat Generative Pre-trained Transformer），美国OpenAI 研发的聊天机器人程序 [1] ，于2022年11月30日发布。ChatGPT是人工智能技术驱动的自然语言处理工具，它能够通过学习和理解人类的语言来进行对话，还能根据聊天的上下文进行互动，真正像人类一样来聊天交流，甚至能完成撰写邮件、视频脚本、文案、翻译、代码，写论文任务。 [1] https://openai.com/blog/chatgpt/

揭秘ChatGPT情感对话能力

专知会员服务

59+阅读 · 2023年4月9日

【EMNLP2020】自然语言生成，Neural Language Generation

专知会员服务

39+阅读 · 2020年11月20日

【MIT】反偏差对比学习，Debiased Contrastive Learning

专知会员服务

91+阅读 · 2020年7月4日

神经网络与形式语言综述，12页pdf，A Survey of Neural Networks and Formal Languages

专知会员服务

21+阅读 · 2020年6月4日