始终发挥优势：一种漂移感知的增量学习框架用于CTR预测 (Always Strengthen Your Strengths: A Drift-Aware Incremental Learning Framework for CTR Prediction) - 专知论文

会员服务 ·

0

CTR预测 · CTR · 增量学习 · 流数据 · 数据分布 ·

2023 年 4 月 17 日

Always Strengthen Your Strengths: A Drift-Aware Incremental Learning Framework for CTR Prediction

翻译：始终发挥优势：一种漂移感知的增量学习框架用于CTR预测

Congcong Liu,Fei Teng,Xiwei Zhao,Zhangang Lin,Jinghe Hu,Jingping Shao

from arxiv, This work has been accepted by SIGIR23

Click-through rate (CTR) prediction is of great importance in recommendation systems and online advertising platforms. When served in industrial scenarios, the user-generated data observed by the CTR model typically arrives as a stream. Streaming data has the characteristic that the underlying distribution drifts over time and may recur. This can lead to catastrophic forgetting if the model simply adapts to new data distribution all the time. Also, it's inefficient to relearn distribution that has been occurred. Due to memory constraints and diversity of data distributions in large-scale industrial applications, conventional strategies for catastrophic forgetting such as replay, parameter isolation, and knowledge distillation are difficult to be deployed. In this work, we design a novel drift-aware incremental learning framework based on ensemble learning to address catastrophic forgetting in CTR prediction. With explicit error-based drift detection on streaming data, the framework further strengthens well-adapted ensembles and freezes ensembles that do not match the input distribution avoiding catastrophic interference. Both evaluations on offline experiments and A/B test shows that our method outperforms all baselines considered.

翻译：点击率（CTR）预测在推荐系统和在线广告平台中非常重要。在工业场景中，CTR模型观察到的用户生成数据通常以流的形式到达。流数据具有随时间漂移的特征，并可能会重复。如果模型只是一直适应新的数据分布，这可能会导致灾难性的遗忘。此外，重新学习已经出现的分布是低效的。由于大规模工业应用中的内存约束和数据分布的多样性，常规的遗忘策略，如回放、参数隔离和知识蒸馏，难以部署。在这项工作中，我们设计了一种基于集成学习的漂移感知的增量学习框架，以解决CTR预测中的灾难性遗忘。随着流数据的显式基于误差的漂移检测，这个框架进一步强化了适应良好的集成，并冻结了与输入分布不匹配的集成，避免了灾难性干扰。离线实验和A / B测试的评估结果表明，我们的方法优于所有考虑的基线。

0

相关内容

CTR预测

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

专知会员服务

49+阅读 · 2022年11月13日

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

【ACL2022】解释生成的多尺度分布深度变分自编码器, Multi-Scale Distribution Deep Variational Autoencoder for Explanation Generation

【ACL2022】解释生成的多尺度分布深度变分自编码器, Multi-Scale Distribution Deep Variational Autoencoder for Explanation Generation

专知会员服务

12+阅读 · 2022年3月24日

【微软】大型神经语言模型的对抗性训练，Adversarial Training for Large Neural Language Models

【微软】大型神经语言模型的对抗性训练，Adversarial Training for Large Neural Language Models

专知会员服务

51+阅读 · 2020年5月3日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

2020的机器学习在研究什么？请看最新8篇ICML2020投稿论文：自监督学习、联邦学习、图学习、数据隐私、语言模型、终身学习……

2020的机器学习在研究什么？请看最新8篇ICML2020投稿论文：自监督学习、联邦学习、图学习、数据隐私、语言模型、终身学习……

专知会员服务

65+阅读 · 2020年2月21日

【CVPR 2019 | tutorial】自主汽车的感知、预测和大规模数据采集：Perception, Prediction, and Large Scale Data Collection for Autonomous Cars

【CVPR 2019 | tutorial】自主汽车的感知、预测和大规模数据采集：Perception, Prediction, and Large Scale Data Collection for Autonomous Cars

专知会员服务

33+阅读 · 2019年11月28日

【用十亿级半监督学习实现最先进图像与视频分类】《Billion-scale semi-supervised learning for state-of-the-art image and video classification | Facebook》

【用十亿级半监督学习实现最先进图像与视频分类】《Billion-scale semi-supervised learning for state-of-the-art image and video classification | Facebook》

专知会员服务

16+阅读 · 2019年10月21日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

RecSys2022 | 多阶段推荐系统的神经重排序教程

RecSys2022 | 多阶段推荐系统的神经重排序教程

机器学习与推荐算法

0+阅读 · 2022年10月12日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

22篇论文！增量学习/终生学习论文资源列表

22篇论文！增量学习/终生学习论文资源列表

专知

32+阅读 · 2018年12月27日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新七篇知识图谱相关论文—嵌入式知识、Zero-shot识别、知识图谱嵌入、网络库、变分推理、解释、弱监督

【论文推荐】最新七篇知识图谱相关论文—嵌入式知识、Zero-shot识别、知识图谱嵌入、网络库、变分推理、解释、弱监督

专知

19+阅读 · 2018年3月26日

面向互联网大数据的用户兴趣挖掘及预测研究

国家自然科学基金

6+阅读 · 2017年12月31日

基于分层协调优化思想的钢包精炼过程钢水质量的稳定控制方法

国家自然科学基金

0+阅读 · 2015年12月31日

基于HTCPN和动态博弈的SCADA系统可生存性建模与分析方法研究

国家自然科学基金

1+阅读 · 2014年12月31日

基于时空显著特性的行人再识别方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于图模型与增量学习的网络化智能视频监控研究

国家自然科学基金

0+阅读 · 2012年12月31日

CPS框架下智能电网安全状态实时感知方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

面向OTHR目标跟踪的多路径PHD滤波算法研究

国家自然科学基金

1+阅读 · 2012年12月31日

基于导航性能的连续下降进近与性能评估算法研究

国家自然科学基金

0+阅读 · 2012年12月31日

磁性纳米结构对多铁性场效应异质结电控磁性的增强研究

国家自然科学基金

0+阅读 · 2012年12月31日

Web Service QoS的多维多尺度模型及评估、预测方法的研究

国家自然科学基金

1+阅读 · 2008年12月31日

Learning Similarity among Users for Personalized Session-Based Recommendation from hierarchical structure of User-Session-Item

Learning Similarity among Users for Personalized Session-Based Recommendation from hierarchical structure of User-Session-Item

Arxiv

0+阅读 · 2023年6月5日

Data Quality in Imitation Learning

Arxiv

0+阅读 · 2023年6月4日

Architecture-Agnostic Masked Image Modeling -- From ViT back to CNN

Arxiv

0+阅读 · 2023年6月2日

A new method using deep transfer learning on ECG to predict the response to cardiac resynchronization therapy

Arxiv

0+阅读 · 2023年6月2日

Deep Class-Incremental Learning: A Survey

Arxiv

13+阅读 · 2023年2月7日

Adversarial and Contrastive Variational Autoencoder for Sequential Recommendation

Arxiv

17+阅读 · 2021年3月19日

Spatially Consistent Representation Learning

Arxiv

14+阅读 · 2021年3月10日

CReST: A Class-Rebalancing Self-Training Framework for Imbalanced Semi-Supervised Learning

Arxiv

11+阅读 · 2021年2月18日

Reinforced Negative Sampling over Knowledge Graph for Recommendation

Arxiv

17+阅读 · 2020年3月12日

Adversarial Multimodal Representation Learning for Click-Through Rate Prediction

Arxiv

23+阅读 · 2020年3月7日

VIP会员

文章信息

相关主题

相关VIP内容

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

专知会员服务

49+阅读 · 2022年11月13日

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

【ACL2022】解释生成的多尺度分布深度变分自编码器, Multi-Scale Distribution Deep Variational Autoencoder for Explanation Generation

【ACL2022】解释生成的多尺度分布深度变分自编码器, Multi-Scale Distribution Deep Variational Autoencoder for Explanation Generation

专知会员服务

12+阅读 · 2022年3月24日

【微软】大型神经语言模型的对抗性训练，Adversarial Training for Large Neural Language Models

【微软】大型神经语言模型的对抗性训练，Adversarial Training for Large Neural Language Models

专知会员服务

51+阅读 · 2020年5月3日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

2020的机器学习在研究什么？请看最新8篇ICML2020投稿论文：自监督学习、联邦学习、图学习、数据隐私、语言模型、终身学习……

2020的机器学习在研究什么？请看最新8篇ICML2020投稿论文：自监督学习、联邦学习、图学习、数据隐私、语言模型、终身学习……

专知会员服务

65+阅读 · 2020年2月21日

【CVPR 2019 | tutorial】自主汽车的感知、预测和大规模数据采集：Perception, Prediction, and Large Scale Data Collection for Autonomous Cars

【CVPR 2019 | tutorial】自主汽车的感知、预测和大规模数据采集：Perception, Prediction, and Large Scale Data Collection for Autonomous Cars

专知会员服务

33+阅读 · 2019年11月28日

【用十亿级半监督学习实现最先进图像与视频分类】《Billion-scale semi-supervised learning for state-of-the-art image and video classification | Facebook》

【用十亿级半监督学习实现最先进图像与视频分类】《Billion-scale semi-supervised learning for state-of-the-art image and video classification | Facebook》

专知会员服务

16+阅读 · 2019年10月21日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

新质生成式AI赋能产业变革的实践与路径

用于多模态大模型的离散标记化：全面综述

Nature综述：金融网络中的物理学

【CMU博士论文】通信高效且差分隐私的优化方法

相关资讯

RecSys2022 | 多阶段推荐系统的神经重排序教程

RecSys2022 | 多阶段推荐系统的神经重排序教程

机器学习与推荐算法

0+阅读 · 2022年10月12日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

22篇论文！增量学习/终生学习论文资源列表

22篇论文！增量学习/终生学习论文资源列表

专知

32+阅读 · 2018年12月27日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新七篇知识图谱相关论文—嵌入式知识、Zero-shot识别、知识图谱嵌入、网络库、变分推理、解释、弱监督

【论文推荐】最新七篇知识图谱相关论文—嵌入式知识、Zero-shot识别、知识图谱嵌入、网络库、变分推理、解释、弱监督

专知

19+阅读 · 2018年3月26日

相关论文

Learning Similarity among Users for Personalized Session-Based Recommendation from hierarchical structure of User-Session-Item

Learning Similarity among Users for Personalized Session-Based Recommendation from hierarchical structure of User-Session-Item

Arxiv

0+阅读 · 2023年6月5日

Data Quality in Imitation Learning

Arxiv

0+阅读 · 2023年6月4日

Architecture-Agnostic Masked Image Modeling -- From ViT back to CNN

Arxiv

0+阅读 · 2023年6月2日

A new method using deep transfer learning on ECG to predict the response to cardiac resynchronization therapy

Arxiv

0+阅读 · 2023年6月2日

Deep Class-Incremental Learning: A Survey

Arxiv

13+阅读 · 2023年2月7日

Adversarial and Contrastive Variational Autoencoder for Sequential Recommendation

Arxiv

17+阅读 · 2021年3月19日

Spatially Consistent Representation Learning

Arxiv

14+阅读 · 2021年3月10日

CReST: A Class-Rebalancing Self-Training Framework for Imbalanced Semi-Supervised Learning

Arxiv

11+阅读 · 2021年2月18日

Reinforced Negative Sampling over Knowledge Graph for Recommendation

Arxiv

17+阅读 · 2020年3月12日

Adversarial Multimodal Representation Learning for Click-Through Rate Prediction

Arxiv

23+阅读 · 2020年3月7日

相关基金

面向互联网大数据的用户兴趣挖掘及预测研究

国家自然科学基金

6+阅读 · 2017年12月31日

基于分层协调优化思想的钢包精炼过程钢水质量的稳定控制方法

国家自然科学基金

0+阅读 · 2015年12月31日

基于HTCPN和动态博弈的SCADA系统可生存性建模与分析方法研究

国家自然科学基金

1+阅读 · 2014年12月31日

基于时空显著特性的行人再识别方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于图模型与增量学习的网络化智能视频监控研究

国家自然科学基金

0+阅读 · 2012年12月31日

CPS框架下智能电网安全状态实时感知方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

面向OTHR目标跟踪的多路径PHD滤波算法研究

国家自然科学基金

1+阅读 · 2012年12月31日

基于导航性能的连续下降进近与性能评估算法研究

国家自然科学基金

0+阅读 · 2012年12月31日

磁性纳米结构对多铁性场效应异质结电控磁性的增强研究

国家自然科学基金

0+阅读 · 2012年12月31日

Web Service QoS的多维多尺度模型及评估、预测方法的研究

国家自然科学基金

1+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员