功能随机变量的分布等性检验 (Testing distributional equality for functional random variables) - 专知论文

会员服务 ·

0

统计量 · 泛函 · 随机变量 · 最大平均偏差 · Extensibility ·

2023 年 3 月 20 日

Testing distributional equality for functional random variables

翻译：功能随机变量的分布等性检验

In this article, we propose a two-sample test for functional observations modeled as elements of a separable Hilbert space. We present a general recipe for constructing a measure of dissimilarity between the distributions of two Hilbertian random variables and study the theoretical properties of one such measure which is constructed using Maximum Mean Discrepancy (MMD) on random linear projections of the distributions and aggregating them. We propose a data-driven estimate of this measure and use it as the test statistic. Large sample distributions of this statistic are derived both under null and alternative hypotheses. This test statistic involves a kernel function and the associated bandwidth. We prove that the resulting test has large-sample consistency for any data-driven choice of bandwidth that converges in probability to a positive number. Since the theoretical quantiles of the limiting null distribution are intractable, in practice, the test is calibrated using the permutation method. We also derive the limiting distribution of the permuted test statistic and the asymptotic power of the permutation test under local contiguous alternatives. This shows that the permutation test is consistent and statistically efficient in the Pitman sense. Extensive simulation studies are carried out and a real data set is analyzed to compare the performance of our proposed test with some state-of-the-art methods.

翻译：在这篇文章中，我们提出了一种用于函数观测值的两个样本测试，这些观测值被建模为可分Hilbert空间的元素。我们提出了一种构建两个Hilbert随机变量分布差异度量的通用方法，并研究了一种使用分布的最大均值差异度量（MMD）构建的度量之一的理论特性。我们提出了这种度量的数据驱动估计，将其用作检验统计量。在零假设和备择假设下导出了这个统计量的大样本分布。这个检验统计量涉及一个核函数和相关的带宽。我们证明了一个数据驱动带宽的选择可以以概率收敛到一个正数，从而得到的测试具有大样本一致性。由于极限空值分布的理论分位数难以计算，因此在实践中，使用置换方法进行校准。我们还导出了置换测试统计量的极限分布和在局部连续备择假设下的渐近功率。这表明置换测试在Pitman意义下是一致和统计有效的。我们进行了大量的模拟研究，并分析了一个真实数据集，比较了我们提出的测试方法和一些最先进的方法的表现。

0

相关内容

统计量

【干货书】工程和科学中的概率和统计，

【干货书】工程和科学中的概率和统计，

专知会员服务

58+阅读 · 2022年12月24日

【2022新书】基于模糊随机变量的模糊统计推理，295页pdf

【2022新书】基于模糊随机变量的模糊统计推理，295页pdf

专知会员服务

62+阅读 · 2022年10月17日

WWW21最新「比较学习」教程，135页PPT阐述从排名数据中学习

专知会员服务

37+阅读 · 2021年4月27日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

【ICML2020】用于图结构化数据的卷积核网络，Convolutional Kernel Networks for Graph-Structured Data

【ICML2020】用于图结构化数据的卷积核网络，Convolutional Kernel Networks for Graph-Structured Data

专知会员服务

44+阅读 · 2020年6月29日

生成性对抗网络:理论模型、评估指标和最近发展的概述，Generative Adversarial Networks (GANs): An Overview of Theoretical Model, Evaluation Metrics, and Recent Developments

生成性对抗网络:理论模型、评估指标和最近发展的概述，Generative Adversarial Networks (GANs): An Overview of Theoretical Model, Evaluation Metrics, and Recent Developments

专知会员服务

42+阅读 · 2020年5月30日

Python分布式计算，171页pdf，Distributed Computing with Python

Python分布式计算，171页pdf，Distributed Computing with Python

专知会员服务

108+阅读 · 2020年5月3日

【Google-CMU】元伪标签的元学习，Meta Pseudo Labels

【Google-CMU】元伪标签的元学习，Meta Pseudo Labels

专知会员服务

32+阅读 · 2020年3月30日

【MIT】时间序列GAN，Subadditivity of Probability Divergences

专知会员服务

63+阅读 · 2020年3月4日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

浅聊对比学习（Contrastive Learning）第一弹

浅聊对比学习（Contrastive Learning）第一弹

PaperWeekly

0+阅读 · 2022年6月10日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

【推荐】SVM实例教程

【推荐】SVM实例教程

机器学习研究会

17+阅读 · 2017年8月26日

复杂数据模型中的分布逼近方法

国家自然科学基金

3+阅读 · 2014年12月31日

关于面板(纵向）数据的动态统计分析

国家自然科学基金

0+阅读 · 2014年12月31日

统计深度函数的若干推广及其计算方法研究

国家自然科学基金

1+阅读 · 2014年12月31日

高维数据下多因变量回归模型的统计推断

国家自然科学基金

5+阅读 · 2013年12月31日

随机变量结构的模型论

国家自然科学基金

0+阅读 · 2013年12月31日

满足开集条件的自相似结构上的分析

国家自然科学基金

0+阅读 · 2013年12月31日

一类半参数时间序列模型的统计推断

国家自然科学基金

0+阅读 · 2012年12月31日

高维数据的假设检验

国家自然科学基金

0+阅读 · 2012年12月31日

似然方法的有限样本研究

国家自然科学基金

0+阅读 · 2011年12月31日

因果推断的统计方法

国家自然科学基金

26+阅读 · 2011年12月31日

Score-based calibration testing for multivariate forecast distributions

Arxiv

0+阅读 · 2023年5月9日

Causal Discovery with Unobserved Variables: A Proxy Variable Approach

Arxiv

0+阅读 · 2023年5月9日

Axiomatization of Interventional Probability Distributions

Arxiv

0+阅读 · 2023年5月8日

Contrastive losses as generalized models of global epistasis

Arxiv

0+阅读 · 2023年5月8日

Wasserstein multivariate auto-regressive models for modeling distributional time series and its application in graph learning

Arxiv

0+阅读 · 2023年5月6日

A new approach to shooting methods for terminal value problems of fractional differential equations

Arxiv

0+阅读 · 2023年5月6日

Distribution-free and Model-free Multivariate Feature Screening via Multivariate Rank Distance Correlation

Arxiv

0+阅读 · 2023年5月5日

Prompt Distribution Learning

Arxiv

14+阅读 · 2022年5月6日

Learning Neural Models for Natural Language Processing in the Face of Distributional Shift

Arxiv

11+阅读 · 2021年9月3日

Towards Out-Of-Distribution Generalization: A Survey

Arxiv

38+阅读 · 2021年8月31日

VIP会员

文章信息

相关主题

最大平均偏差

相关VIP内容

【干货书】工程和科学中的概率和统计，

【干货书】工程和科学中的概率和统计，

专知会员服务

58+阅读 · 2022年12月24日

【2022新书】基于模糊随机变量的模糊统计推理，295页pdf

【2022新书】基于模糊随机变量的模糊统计推理，295页pdf

专知会员服务

62+阅读 · 2022年10月17日

WWW21最新「比较学习」教程，135页PPT阐述从排名数据中学习

专知会员服务

37+阅读 · 2021年4月27日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

【ICML2020】用于图结构化数据的卷积核网络，Convolutional Kernel Networks for Graph-Structured Data

【ICML2020】用于图结构化数据的卷积核网络，Convolutional Kernel Networks for Graph-Structured Data

专知会员服务

44+阅读 · 2020年6月29日

生成性对抗网络:理论模型、评估指标和最近发展的概述，Generative Adversarial Networks (GANs): An Overview of Theoretical Model, Evaluation Metrics, and Recent Developments

生成性对抗网络:理论模型、评估指标和最近发展的概述，Generative Adversarial Networks (GANs): An Overview of Theoretical Model, Evaluation Metrics, and Recent Developments

专知会员服务

42+阅读 · 2020年5月30日

Python分布式计算，171页pdf，Distributed Computing with Python

Python分布式计算，171页pdf，Distributed Computing with Python

专知会员服务

108+阅读 · 2020年5月3日

【Google-CMU】元伪标签的元学习，Meta Pseudo Labels

【Google-CMU】元伪标签的元学习，Meta Pseudo Labels

专知会员服务

32+阅读 · 2020年3月30日

【MIT】时间序列GAN，Subadditivity of Probability Divergences

专知会员服务

63+阅读 · 2020年3月4日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

从社会学实验到行为仿真：理解基于Agent的观点动力学建模思维

中英文版《GPT-5 System Card速览》报告

ACL 2025 | 大模型结构化知识提示的泛化能力研究

【普林斯顿博士论文】大型模型的高效推理

相关资讯

浅聊对比学习（Contrastive Learning）第一弹

浅聊对比学习（Contrastive Learning）第一弹

PaperWeekly

0+阅读 · 2022年6月10日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

【推荐】SVM实例教程

【推荐】SVM实例教程

机器学习研究会

17+阅读 · 2017年8月26日

相关论文

Score-based calibration testing for multivariate forecast distributions

Arxiv

0+阅读 · 2023年5月9日

Causal Discovery with Unobserved Variables: A Proxy Variable Approach

Arxiv

0+阅读 · 2023年5月9日

Axiomatization of Interventional Probability Distributions

Arxiv

0+阅读 · 2023年5月8日

Contrastive losses as generalized models of global epistasis

Arxiv

0+阅读 · 2023年5月8日

Wasserstein multivariate auto-regressive models for modeling distributional time series and its application in graph learning

Arxiv

0+阅读 · 2023年5月6日

A new approach to shooting methods for terminal value problems of fractional differential equations

Arxiv

0+阅读 · 2023年5月6日

Distribution-free and Model-free Multivariate Feature Screening via Multivariate Rank Distance Correlation

Arxiv

0+阅读 · 2023年5月5日

Prompt Distribution Learning

Arxiv

14+阅读 · 2022年5月6日

Learning Neural Models for Natural Language Processing in the Face of Distributional Shift

Arxiv

11+阅读 · 2021年9月3日

Towards Out-Of-Distribution Generalization: A Survey

Arxiv

38+阅读 · 2021年8月31日

相关基金

复杂数据模型中的分布逼近方法

国家自然科学基金

3+阅读 · 2014年12月31日

关于面板(纵向）数据的动态统计分析

国家自然科学基金

0+阅读 · 2014年12月31日

统计深度函数的若干推广及其计算方法研究

国家自然科学基金

1+阅读 · 2014年12月31日

高维数据下多因变量回归模型的统计推断

国家自然科学基金

5+阅读 · 2013年12月31日

随机变量结构的模型论

国家自然科学基金

0+阅读 · 2013年12月31日

满足开集条件的自相似结构上的分析

国家自然科学基金

0+阅读 · 2013年12月31日

一类半参数时间序列模型的统计推断

国家自然科学基金

0+阅读 · 2012年12月31日

高维数据的假设检验

国家自然科学基金

0+阅读 · 2012年12月31日

似然方法的有限样本研究

国家自然科学基金

0+阅读 · 2011年12月31日

因果推断的统计方法

国家自然科学基金

26+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员