IndicGEC：强大模型，还是测量幻象？ (IndicGEC: Powerful Models, or a Measurement Mirage?) - 专知论文

会员服务 ·

0

样本 · 评估指标 · 语言模型 · 分析 · 数据集构建 ·

IndicGEC: Powerful Models, or a Measurement Mirage?

翻译：IndicGEC：强大模型，还是测量幻象？

from arxiv, Technical report

In this paper, we report the results of the TeamNRC's participation in the BHASHA-Task 1 Grammatical Error Correction shared task https://github.com/BHASHA-Workshop/IndicGEC2025/ for 5 Indian languages. Our approach, focusing on zero/few-shot prompting of language models of varying sizes (4B to large proprietary models) achieved a Rank 4 in Telugu and Rank 2 in Hindi with GLEU scores of 83.78 and 84.31 respectively. In this paper, we extend the experiments to the other three languages of the shared task - Tamil, Malayalam and Bangla, and take a closer look at the data quality and evaluation metric used. Our results primarily highlight the potential of small language models, and summarize the concerns related to creating good quality datasets and appropriate metrics for this task that are suitable for Indian language scripts.

翻译：本文报告了TeamNRC团队在BHASHA-Task 1语法错误校正共享任务（https://github.com/BHASHA-Workshop/IndicGEC2025/）中针对5种印度语言的研究结果。我们采用的方法侧重于对不同规模（4B至大型专有模型）语言模型进行零样本/少样本提示，在泰卢固语和印地语中分别以83.78和84.31的GLEU分数获得第4名和第2名。本文进一步将该实验扩展至共享任务的另外三种语言——泰米尔语、马拉雅拉姆语和孟加拉语，并对数据质量和所用评估指标进行了深入分析。我们的结果主要凸显了小规模语言模型的潜力，并总结了与此任务中适用于印度语言文字的高质量数据集构建及恰当评估指标相关的关键问题。

0

相关内容

UTC: 用于视觉对话的任务间对比学习的统一Transformer

UTC: 用于视觉对话的任务间对比学习的统一Transformer

专知会员服务

14+阅读 · 2022年5月4日

【上海交通大学-张拳石】可解释CNN，Interpretable CNNs for Object Classification

【上海交通大学-张拳石】可解释CNN，Interpretable CNNs for Object Classification

专知会员服务

46+阅读 · 2020年3月13日

【Amazon】使用预先训练的Transformer模型进行数据增强，Data Augmentation using Pre-trained Transformer Models

【Amazon】使用预先训练的Transformer模型进行数据增强，Data Augmentation using Pre-trained Transformer Models

专知会员服务

51+阅读 · 2020年3月7日

【ACL 2019 Tutorials】深度贝叶斯自然语言处理（Deep Bayesian Natural Language Processing），Jen-Tzung Chien

【ACL 2019 Tutorials】深度贝叶斯自然语言处理（Deep Bayesian Natural Language Processing），Jen-Tzung Chien

专知会员服务

48+阅读 · 2019年11月17日

【Facebook AI】对抗性NLI:自然语言理解的新基准，Adversarial NLI: A New Benchmark for Natural Language Understanding

【Facebook AI】对抗性NLI:自然语言理解的新基准，Adversarial NLI: A New Benchmark for Natural Language Understanding

专知会员服务

11+阅读 · 2019年11月2日

【Tutorial】计算机视觉中的Transformer，98页ppt

【Tutorial】计算机视觉中的Transformer，98页ppt

专知

21+阅读 · 2021年10月25日

【ICML2020】多视角对比图表示学习，Contrastive Multi-View GRL

【ICML2020】多视角对比图表示学习，Contrastive Multi-View GRL

专知

37+阅读 · 2020年6月11日

CVPR 2019：精确目标检测的不确定边界框回归

CVPR 2019：精确目标检测的不确定边界框回归

AI科技评论

13+阅读 · 2019年9月16日

将Python用于NLP：Pattern 库简介

将Python用于NLP：Pattern 库简介

Python程序员

15+阅读 · 2019年6月7日

FAGAN：完全注意力机制（Full Attention）GAN，Self-attention+GAN

FAGAN：完全注意力机制（Full Attention）GAN，Self-attention+GAN

专知

32+阅读 · 2018年8月14日

基于自主学习的Ad hoc Agent序贯决策研究

国家自然科学基金

46+阅读 · 2015年12月31日

高维数据下的模型平均方法

国家自然科学基金

6+阅读 · 2014年12月31日

复杂多元数据的半参数统计推断

国家自然科学基金

5+阅读 · 2014年12月31日

网络的小世界结构及其上随机游动的混合时

国家自然科学基金

1+阅读 · 2014年12月31日

反问题的数学建模、计算及应用

国家自然科学基金

2+阅读 · 2014年12月31日

GazeInterpreter: Parsing Eye Gaze to Generate Eye-Body-Coordinated Narrations

Arxiv

0+阅读 · 11月20日

BitSnap: Checkpoint Sparsification and Quantization in LLM Training

Arxiv

0+阅读 · 11月15日

SWE-fficiency: Can Language Models Optimize Real-World Repositories on Real Workloads?

Arxiv

0+阅读 · 11月11日

What Matters in Data for DPO?

Arxiv

0+阅读 · 11月7日

EmbeddingGemma: Powerful and Lightweight Text Representations

Arxiv

0+阅读 · 11月1日

VIP会员

文章信息

相关主题

数据集构建

相关VIP内容

UTC: 用于视觉对话的任务间对比学习的统一Transformer

UTC: 用于视觉对话的任务间对比学习的统一Transformer

专知会员服务

14+阅读 · 2022年5月4日

【上海交通大学-张拳石】可解释CNN，Interpretable CNNs for Object Classification

【上海交通大学-张拳石】可解释CNN，Interpretable CNNs for Object Classification

专知会员服务

46+阅读 · 2020年3月13日

【Amazon】使用预先训练的Transformer模型进行数据增强，Data Augmentation using Pre-trained Transformer Models

【Amazon】使用预先训练的Transformer模型进行数据增强，Data Augmentation using Pre-trained Transformer Models

专知会员服务

51+阅读 · 2020年3月7日

【ACL 2019 Tutorials】深度贝叶斯自然语言处理（Deep Bayesian Natural Language Processing），Jen-Tzung Chien

【ACL 2019 Tutorials】深度贝叶斯自然语言处理（Deep Bayesian Natural Language Processing），Jen-Tzung Chien

专知会员服务

48+阅读 · 2019年11月17日

【Facebook AI】对抗性NLI:自然语言理解的新基准，Adversarial NLI: A New Benchmark for Natural Language Understanding

【Facebook AI】对抗性NLI:自然语言理解的新基准，Adversarial NLI: A New Benchmark for Natural Language Understanding

专知会员服务

11+阅读 · 2019年11月2日

热门VIP内容

开通专知VIP会员享更多权益服务

大语言模型中的事件抽取：方法、模态与未来展望的全面综述

美海军作战管理系统：变革战场空间的二十年

【MIT博士论文】以语言为中心的医学影像理解

俄罗斯“沙希德”/“天竺葵”攻击无人机

相关资讯

【Tutorial】计算机视觉中的Transformer，98页ppt

【Tutorial】计算机视觉中的Transformer，98页ppt

专知

21+阅读 · 2021年10月25日

【ICML2020】多视角对比图表示学习，Contrastive Multi-View GRL

【ICML2020】多视角对比图表示学习，Contrastive Multi-View GRL

专知

37+阅读 · 2020年6月11日

CVPR 2019：精确目标检测的不确定边界框回归

CVPR 2019：精确目标检测的不确定边界框回归

AI科技评论

13+阅读 · 2019年9月16日

将Python用于NLP：Pattern 库简介

将Python用于NLP：Pattern 库简介

Python程序员

15+阅读 · 2019年6月7日

FAGAN：完全注意力机制（Full Attention）GAN，Self-attention+GAN

FAGAN：完全注意力机制（Full Attention）GAN，Self-attention+GAN

专知

32+阅读 · 2018年8月14日

相关论文

GazeInterpreter: Parsing Eye Gaze to Generate Eye-Body-Coordinated Narrations

Arxiv

0+阅读 · 11月20日

BitSnap: Checkpoint Sparsification and Quantization in LLM Training

Arxiv

0+阅读 · 11月15日

SWE-fficiency: Can Language Models Optimize Real-World Repositories on Real Workloads?

Arxiv

0+阅读 · 11月11日

What Matters in Data for DPO?

Arxiv

0+阅读 · 11月7日

EmbeddingGemma: Powerful and Lightweight Text Representations

Arxiv

0+阅读 · 11月1日

相关基金

基于自主学习的Ad hoc Agent序贯决策研究

国家自然科学基金

46+阅读 · 2015年12月31日

高维数据下的模型平均方法

国家自然科学基金

6+阅读 · 2014年12月31日

复杂多元数据的半参数统计推断

国家自然科学基金

5+阅读 · 2014年12月31日

网络的小世界结构及其上随机游动的混合时

国家自然科学基金

1+阅读 · 2014年12月31日

反问题的数学建模、计算及应用

国家自然科学基金

2+阅读 · 2014年12月31日

微信扫码咨询专知VIP会员