PEFT-Bench：参数高效微调方法基准测试 (PEFT-Bench: A Parameter-Efficient Fine-Tuning Methods Benchmark) - 专知论文

会员服务 ·

0

参数高效 · 基准 · 基准测试 · 微调 · 参数高效微调 ·

PEFT-Bench: A Parameter-Efficient Fine-Tuning Methods Benchmark

翻译：PEFT-Bench：参数高效微调方法基准测试

Robert Belanec,Branislav Pecher,Ivan Srba,Maria Bielikova

Despite the state-of-the-art performance of Large Language Models (LLMs) achieved on many tasks, their massive scale often leads to high computational and environmental costs, limiting their accessibility. Parameter-efficient fine-tuning (PEFT) methods address this challenge by reducing the number of trainable parameters while maintaining strong downstream performance. Despite the increased development in PEFT methods, current evaluations remain limited (in terms of evaluated models and datasets) and difficult to reproduce. To bridge this gap, we introduce PEFT-Bench, a unified end-to-end benchmark for evaluating diverse PEFT methods on autoregressive LLMs. We demonstrate its usage across 27 NLP datasets and 6 PEFT methods. To account for different PEFT training and inference factors, we also introduce the PEFT Soft Score Penalties (PSCP) metric, which takes trainable parameters, inference speed, and training memory usage into account.

翻译：尽管大型语言模型（LLMs）在许多任务上取得了最先进的性能，但其庞大的规模往往导致高昂的计算和环境成本，限制了其可访问性。参数高效微调（PEFT）方法通过减少可训练参数的数量，同时保持强大的下游性能，来应对这一挑战。尽管PEFT方法的开发日益增多，但当前的评估仍然有限（在评估模型和数据集方面）且难以复现。为了弥合这一差距，我们引入了PEFT-Bench，这是一个用于在自回归LLMs上评估多种PEFT方法的统一端到端基准测试。我们展示了其在27个NLP数据集和6种PEFT方法上的应用。为了考虑不同的PEFT训练和推理因素，我们还引入了PEFT软分数惩罚（PSCP）指标，该指标考虑了可训练参数、推理速度和训练内存使用情况。

0

相关内容

参数高效

【ICML2024】SAPG：分裂与聚合策略梯度

【ICML2024】SAPG：分裂与聚合策略梯度

专知会员服务

19+阅读 · 2024年7月30日

【ICML2024】上下文感知标记化的高效世界模型

【ICML2024】上下文感知标记化的高效世界模型

专知会员服务

29+阅读 · 2024年7月2日

MIMIC-IT:多模态上下文指令调优

MIMIC-IT:多模态上下文指令调优

专知会员服务

39+阅读 · 2023年6月11日

【CVPR 2022】基于实例深度估计的统一深度感知全景分割 PanopticDepth: Per-Instance Depth Estimation for Unified Depth-Aware Panoptic Segmentation

【CVPR 2022】基于实例深度估计的统一深度感知全景分割 PanopticDepth: Per-Instance Depth Estimation for Unified Depth-Aware Panoptic Segmentation

专知会员服务

18+阅读 · 2022年3月19日

【ICML2021】具有超参数重要性的可解释的自动图表示学习

专知会员服务

26+阅读 · 2021年7月18日

最新最全《深度元学习》2021综述论文，68页pdf，A Survey of Deep Meta-Learning

最新最全《深度元学习》2021综述论文，68页pdf，A Survey of Deep Meta-Learning

专知

11+阅读 · 2021年4月23日

【KDD2020-Tutorial】因果推理与稳定学习，Causal Inference and Stable Learning

【KDD2020-Tutorial】因果推理与稳定学习，Causal Inference and Stable Learning

专知

11+阅读 · 2020年8月28日

Python图像处理，366页pdf，Image Operators Image Processing in Python

Python图像处理，366页pdf，Image Operators Image Processing in Python

专知

15+阅读 · 2020年7月23日

【阿里巴巴-WWW2020】对抗性多模态表示学习的点击率预测，Adversarial Multimodal RL

【阿里巴巴-WWW2020】对抗性多模态表示学习的点击率预测，Adversarial Multimodal RL

专知

11+阅读 · 2020年3月17日

【NeurIPS2019】图变换网络：Graph Transformer Network

【NeurIPS2019】图变换网络：Graph Transformer Network

专知

245+阅读 · 2019年11月18日

语义Web知识库补全关键技术研究

国家自然科学基金

17+阅读 · 2017年12月31日

SDN数据平面中大规模流表的高性能查找方法研究

国家自然科学基金

4+阅读 · 2015年12月31日

基于支撑函数的不规则形态扩展目标建模和估计研究

国家自然科学基金

0+阅读 · 2015年12月31日

非局部总变差正则化图像恢复模型的快速子空间校正算法

国家自然科学基金

0+阅读 · 2014年12月31日

一般误差分布下若干半参数模型的复合分位数方法

国家自然科学基金

0+阅读 · 2014年12月31日

ReadyPower: A Reliable, Interpretable, and Handy Architectural Power Model Based on Analytical Framework

Arxiv

0+阅读 · 12月16日

SNAP: Low-Latency Test-Time Adaptation with Sparse Updates

Arxiv

0+阅读 · 11月19日

Viper-F1: Fast and Fine-Grained Multimodal Understanding with Cross-Modal State-Space Modulation

Arxiv

0+阅读 · 11月17日

Viper-F1: Fast and Fine-Grained Multimodal Understanding with Cross-Modal State-Space Modulation

Arxiv

0+阅读 · 11月14日

AIRepr: An Analyst-Inspector Framework for Evaluating Reproducibility of LLMs in Data Science

Arxiv

0+阅读 · 11月7日

VIP会员

文章信息

相关主题

参数高效微调

相关VIP内容

【ICML2024】SAPG：分裂与聚合策略梯度

【ICML2024】SAPG：分裂与聚合策略梯度

专知会员服务

19+阅读 · 2024年7月30日

【ICML2024】上下文感知标记化的高效世界模型

【ICML2024】上下文感知标记化的高效世界模型

专知会员服务

29+阅读 · 2024年7月2日

MIMIC-IT:多模态上下文指令调优

MIMIC-IT:多模态上下文指令调优

专知会员服务

39+阅读 · 2023年6月11日

【CVPR 2022】基于实例深度估计的统一深度感知全景分割 PanopticDepth: Per-Instance Depth Estimation for Unified Depth-Aware Panoptic Segmentation

【CVPR 2022】基于实例深度估计的统一深度感知全景分割 PanopticDepth: Per-Instance Depth Estimation for Unified Depth-Aware Panoptic Segmentation

专知会员服务

18+阅读 · 2022年3月19日

【ICML2021】具有超参数重要性的可解释的自动图表示学习

专知会员服务

26+阅读 · 2021年7月18日

热门VIP内容

开通专知VIP会员享更多权益服务

【MIT博士论文】弱监督学习：理论、方法与应用

Andrej Karpathy：2025 年 LLM 年度回顾（2025 LLM Year in Review）

锚定情报：合成欺骗时代的地面真相

NeurIPS 2025 | NMKE：基于神经元归因与动态稀疏掩码的终身知识编辑

相关资讯

最新最全《深度元学习》2021综述论文，68页pdf，A Survey of Deep Meta-Learning

最新最全《深度元学习》2021综述论文，68页pdf，A Survey of Deep Meta-Learning

专知

11+阅读 · 2021年4月23日

【KDD2020-Tutorial】因果推理与稳定学习，Causal Inference and Stable Learning

【KDD2020-Tutorial】因果推理与稳定学习，Causal Inference and Stable Learning

专知

11+阅读 · 2020年8月28日

Python图像处理，366页pdf，Image Operators Image Processing in Python

Python图像处理，366页pdf，Image Operators Image Processing in Python

专知

15+阅读 · 2020年7月23日

【阿里巴巴-WWW2020】对抗性多模态表示学习的点击率预测，Adversarial Multimodal RL

【阿里巴巴-WWW2020】对抗性多模态表示学习的点击率预测，Adversarial Multimodal RL

专知

11+阅读 · 2020年3月17日

【NeurIPS2019】图变换网络：Graph Transformer Network

【NeurIPS2019】图变换网络：Graph Transformer Network

专知

245+阅读 · 2019年11月18日

相关论文

ReadyPower: A Reliable, Interpretable, and Handy Architectural Power Model Based on Analytical Framework

Arxiv

0+阅读 · 12月16日

SNAP: Low-Latency Test-Time Adaptation with Sparse Updates

Arxiv

0+阅读 · 11月19日

Viper-F1: Fast and Fine-Grained Multimodal Understanding with Cross-Modal State-Space Modulation

Arxiv

0+阅读 · 11月17日

Viper-F1: Fast and Fine-Grained Multimodal Understanding with Cross-Modal State-Space Modulation

Arxiv

0+阅读 · 11月14日

AIRepr: An Analyst-Inspector Framework for Evaluating Reproducibility of LLMs in Data Science

Arxiv

0+阅读 · 11月7日

相关基金

语义Web知识库补全关键技术研究

国家自然科学基金

17+阅读 · 2017年12月31日

SDN数据平面中大规模流表的高性能查找方法研究

国家自然科学基金

4+阅读 · 2015年12月31日

基于支撑函数的不规则形态扩展目标建模和估计研究

国家自然科学基金

0+阅读 · 2015年12月31日

非局部总变差正则化图像恢复模型的快速子空间校正算法

国家自然科学基金

0+阅读 · 2014年12月31日

一般误差分布下若干半参数模型的复合分位数方法

国家自然科学基金

0+阅读 · 2014年12月31日

微信扫码咨询专知VIP会员