LogME: 对培训前转让学习模式的实际评估 (LogME: Practical Assessment of Pre-trained Models for Transfer Learning)

This paper studies task adaptive pre-trained model selection, an underexplored problem of assessing pre-trained models for the target task and select best ones from the model zoo \emph{without fine-tuning}. A few pilot works addressed the problem in transferring supervised pre-trained models to classification tasks, but they cannot handle emerging unsupervised pre-trained models or regression tasks. In pursuit of a practical assessment method, we propose to estimate the maximum value of label evidence given features extracted by pre-trained models. Unlike the maximum likelihood, the maximum evidence is \emph{immune to over-fitting}, while its expensive computation can be dramatically reduced by our carefully designed algorithm. The Logarithm of Maximum Evidence (LogME) can be used to assess pre-trained models for transfer learning: a pre-trained model with a high LogME value is likely to have good transfer performance. LogME is \emph{fast, accurate, and general}, characterizing itself as the first practical method for assessing pre-trained models. Compared with brute-force fine-tuning, LogME brings at most $3000\times$ speedup in wall-clock time and requires only $1\%$ memory footprint. It outperforms prior methods by a large margin in their setting and is applicable to new settings. It is general enough for diverse pre-trained models (supervised pre-trained and unsupervised pre-trained), downstream tasks (classification and regression), and modalities (vision and language). Code is available at this repository: \href{https://github.com/thuml/LogME}{https://github.com/thuml/LogME}.

翻译：本文研究任务适应预先培训模式选择适应适应受培训模式任务适应受培训模式, 一个未充分探讨的问题 : 评估目标任务。一些试点工作解决了将受监督接受培训模式转到分类任务的问题, 但是它们无法处理新出现的未经监督接受培训模式或回归任务。在追求一个实用的评估方法时, 我们提议估计由事先培训模式提取的标签证据的最大值。与最大可能性不同, 最大证据是 : 评估目标任务 : 评估目标任务 : 目标受培训 : 最大目标 : 最大定义 : 最大定义定义 : 将自己描述为第一个实际评估的。与定义相比, 最强的修正, 最昂贵的计算方法。最昂贵的 3 000\ 时间。最大证据 (L ) ) 用于最最最最的最的最的最的最的的的最的的模式,,, 最最最最的的最的的的的的的的,, 最的最最的的的的的。