正则化的EM算法 (Regularized EM algorithm) - 专知论文

会员服务 ·

0

EM算法 · 协方差矩阵 · 方差 · 正则化 · 算法 ·

2023 年 3 月 27 日

Regularized EM algorithm

翻译：正则化的EM算法

Pierre Houdouin,Esa Ollila,Frederic Pascal

from arxiv, ICASSP Conference, 4 pages, 8 figures

Expectation-Maximization (EM) algorithm is a widely used iterative algorithm for computing (local) maximum likelihood estimate (MLE). It can be used in an extensive range of problems, including the clustering of data based on the Gaussian mixture model (GMM). Numerical instability and convergence problems may arise in situations where the sample size is not much larger than the data dimensionality. In such low sample support (LSS) settings, the covariance matrix update in the EM-GMM algorithm may become singular or poorly conditioned, causing the algorithm to crash. On the other hand, in many signal processing problems, a priori information can be available indicating certain structures for different cluster covariance matrices. In this paper, we present a regularized EM algorithm for GMM-s that can make efficient use of such prior knowledge as well as cope with LSS situations. The method aims to maximize a penalized GMM likelihood where regularized estimation may be used to ensure positive definiteness of covariance matrix updates and shrink the estimators towards some structured target covariance matrices. We show that the theoretical guarantees of convergence hold, leading to better performing EM algorithm for structured covariance matrix models or with low sample settings.

翻译：EM算法是用于计算（局部）最大似然估计的广泛使用的迭代算法。它可在广泛的问题中使用，包括基于高斯混合模型（GMM）的数据聚类。在样本大小与数据维度不相差的情况下，可能会出现数值不稳定性和收敛问题。在这种低样本支撑（LSS）情况下，EM-GMM算法中的协方差矩阵更新可能变得奇异或病态，导致算法崩溃。另一方面，在许多信号处理问题中，先验信息可以可用，指示不同聚类协方差矩阵的某些结构。在本文中，我们提出了一种GMM的正则化EM算法，可以高效利用这种先验知识，并处理LSS情况。该方法旨在最大化罚函数的GMM似然函数，其中可以使用正则化估计来确保协方差矩阵更新的正定性，并将估计器收缩到一些结构化的目标协方差矩阵。我们证明了收敛的理论保证，从而为结构化协方差矩阵模型或低样本设置提供更好的性能EM算法。

0

相关内容

EM算法

em算法指的是最大期望算法（Expectation Maximization Algorithm，又译期望最大化算法），是一种迭代算法，用于含有隐变量（latent variable）的概率参数模型的最大似然估计或极大后验概率估计。

干货书！基于单调算子的大规模凸优化，348页pdf

干货书！基于单调算子的大规模凸优化，348页pdf

专知会员服务

49+阅读 · 2022年7月24日

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

【经典书】数据挖掘和机器学习:基本概念和算法，附电子书与PPT

【经典书】数据挖掘和机器学习:基本概念和算法，附电子书与PPT

专知会员服务

167+阅读 · 2021年2月23日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【经典书】图模型: 指数族和变分推断，305页pdf

专知会员服务

52+阅读 · 2020年12月10日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【ICML2020】拉普拉斯正则化小样本学习，Laplacian Regularized Few-Shot Learning

【ICML2020】拉普拉斯正则化小样本学习，Laplacian Regularized Few-Shot Learning

专知会员服务

77+阅读 · 2020年6月28日

【NeurIPS 2019|经典论文奖】正则随机学习和在线优化的双重平均法（Dual Averaging Method for Regularized Stochastic Learning and Online Optimization），微软研究院Lin Xiao

【NeurIPS 2019|经典论文奖】正则随机学习和在线优化的双重平均法（Dual Averaging Method for Regularized Stochastic Learning and Online Optimization），微软研究院Lin Xiao

专知会员服务

17+阅读 · 2019年12月9日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

量化金融强化学习论文集合

量化金融强化学习论文集合

专知

14+阅读 · 2019年12月18日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

TorchSeg：基于pytorch的语义分割算法开源了

TorchSeg：基于pytorch的语义分割算法开源了

极市平台

20+阅读 · 2019年1月28日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

42+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【推荐】免费书(草稿)：数据科学的数学基础

【推荐】免费书(草稿)：数据科学的数学基础

机器学习研究会

20+阅读 · 2017年10月1日

【推荐】SVM实例教程

【推荐】SVM实例教程

机器学习研究会

17+阅读 · 2017年8月26日

方差正则化的分类模型选择方法研究

国家自然科学基金

1+阅读 · 2015年12月31日

变步长和变正则化因子的子带自适应滤波算法研究

国家自然科学基金

0+阅读 · 2015年12月31日

高阶图像去噪模型的快速数值算法研究

国家自然科学基金

1+阅读 · 2015年12月31日

实际复杂系统不确定量化中的降阶建模理论

国家自然科学基金

0+阅读 · 2013年12月31日

非凸稀疏先验图像恢复建模理论和算法

国家自然科学基金

0+阅读 · 2012年12月31日

基于稀疏分解及子空间的多项式相位信号的参数估计及其快速算法研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于list-mode数据的快速SART真3D PET断层重建算法的研究

国家自然科学基金

0+阅读 · 2011年12月31日

非凸对称锥优化的最优性理论和算法

国家自然科学基金

0+阅读 · 2009年12月31日

无线通信物理层网络编码与低复杂度迭代可译信道编码联合设计

国家自然科学基金

0+阅读 · 2009年12月31日

稀疏逼近及其应用

国家自然科学基金

0+阅读 · 2008年12月31日

Optimal Weighted Random Forests

Arxiv

0+阅读 · 2023年5月17日

Stochastic Ratios Tracking Algorithm for Large Scale Machine Learning Problems

Arxiv

0+阅读 · 2023年5月17日

Double Pessimism is Provably Efficient for Distributionally Robust Offline Reinforcement Learning: Generic Algorithm and Robust Partial Coverage

Arxiv

0+阅读 · 2023年5月16日

Manifold Regularized Tucker Decomposition Approach for Spatiotemporal Traffic Data Imputation

Arxiv

0+阅读 · 2023年5月16日

Nearly Optimal VC-Dimension and Pseudo-Dimension Bounds for Deep Neural Network Derivatives

Arxiv

0+阅读 · 2023年5月15日

Gradient-enhanced physics-informed neural networks based on transfer learning for inverse problems of the variable coefficient differential equations

Arxiv

0+阅读 · 2023年5月15日

Tight and fast generalization error bound of graph embedding in metric space

Arxiv

0+阅读 · 2023年5月13日

Private and Communication-Efficient Algorithms for Entropy Estimation

Arxiv

0+阅读 · 2023年5月12日

Random Smoothing Regularization in Kernel Gradient Descent Learning

Arxiv

0+阅读 · 2023年5月12日

Optimization for deep learning: theory and algorithms

Optimization for deep learning: theory and algorithms

Arxiv

105+阅读 · 2019年12月19日

VIP会员

文章信息

相关主题

协方差矩阵

相关VIP内容

干货书！基于单调算子的大规模凸优化，348页pdf

干货书！基于单调算子的大规模凸优化，348页pdf

专知会员服务

49+阅读 · 2022年7月24日

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

【经典书】数据挖掘和机器学习:基本概念和算法，附电子书与PPT

【经典书】数据挖掘和机器学习:基本概念和算法，附电子书与PPT

专知会员服务

167+阅读 · 2021年2月23日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【经典书】图模型: 指数族和变分推断，305页pdf

专知会员服务

52+阅读 · 2020年12月10日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【ICML2020】拉普拉斯正则化小样本学习，Laplacian Regularized Few-Shot Learning

【ICML2020】拉普拉斯正则化小样本学习，Laplacian Regularized Few-Shot Learning

专知会员服务

77+阅读 · 2020年6月28日

【NeurIPS 2019|经典论文奖】正则随机学习和在线优化的双重平均法（Dual Averaging Method for Regularized Stochastic Learning and Online Optimization），微软研究院Lin Xiao

【NeurIPS 2019|经典论文奖】正则随机学习和在线优化的双重平均法（Dual Averaging Method for Regularized Stochastic Learning and Online Optimization），微软研究院Lin Xiao

专知会员服务

17+阅读 · 2019年12月9日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【新手册】机器学习：讲义笔记

文本生成与编辑图像：综述

决策智能中的时间序列预测大模型

【ICML2025】迈向多模态通用人工智能之路：通用级别与通用基准

相关资讯

量化金融强化学习论文集合

量化金融强化学习论文集合

专知

14+阅读 · 2019年12月18日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

TorchSeg：基于pytorch的语义分割算法开源了

TorchSeg：基于pytorch的语义分割算法开源了

极市平台

20+阅读 · 2019年1月28日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

42+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【推荐】免费书(草稿)：数据科学的数学基础

【推荐】免费书(草稿)：数据科学的数学基础

机器学习研究会

20+阅读 · 2017年10月1日

【推荐】SVM实例教程

【推荐】SVM实例教程

机器学习研究会

17+阅读 · 2017年8月26日

相关论文

Optimal Weighted Random Forests

Arxiv

0+阅读 · 2023年5月17日

Stochastic Ratios Tracking Algorithm for Large Scale Machine Learning Problems

Arxiv

0+阅读 · 2023年5月17日

Double Pessimism is Provably Efficient for Distributionally Robust Offline Reinforcement Learning: Generic Algorithm and Robust Partial Coverage

Arxiv

0+阅读 · 2023年5月16日

Manifold Regularized Tucker Decomposition Approach for Spatiotemporal Traffic Data Imputation

Arxiv

0+阅读 · 2023年5月16日

Nearly Optimal VC-Dimension and Pseudo-Dimension Bounds for Deep Neural Network Derivatives

Arxiv

0+阅读 · 2023年5月15日

Gradient-enhanced physics-informed neural networks based on transfer learning for inverse problems of the variable coefficient differential equations

Arxiv

0+阅读 · 2023年5月15日

Tight and fast generalization error bound of graph embedding in metric space

Arxiv

0+阅读 · 2023年5月13日

Private and Communication-Efficient Algorithms for Entropy Estimation

Arxiv

0+阅读 · 2023年5月12日

Random Smoothing Regularization in Kernel Gradient Descent Learning

Arxiv

0+阅读 · 2023年5月12日

Optimization for deep learning: theory and algorithms

Optimization for deep learning: theory and algorithms

Arxiv

105+阅读 · 2019年12月19日

相关基金

方差正则化的分类模型选择方法研究

国家自然科学基金

1+阅读 · 2015年12月31日

变步长和变正则化因子的子带自适应滤波算法研究

国家自然科学基金

0+阅读 · 2015年12月31日

高阶图像去噪模型的快速数值算法研究

国家自然科学基金

1+阅读 · 2015年12月31日

实际复杂系统不确定量化中的降阶建模理论

国家自然科学基金

0+阅读 · 2013年12月31日

非凸稀疏先验图像恢复建模理论和算法

国家自然科学基金

0+阅读 · 2012年12月31日

基于稀疏分解及子空间的多项式相位信号的参数估计及其快速算法研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于list-mode数据的快速SART真3D PET断层重建算法的研究

国家自然科学基金

0+阅读 · 2011年12月31日

非凸对称锥优化的最优性理论和算法

国家自然科学基金

0+阅读 · 2009年12月31日

无线通信物理层网络编码与低复杂度迭代可译信道编码联合设计

国家自然科学基金

0+阅读 · 2009年12月31日

稀疏逼近及其应用

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员