Z-Dip：一种经过验证的Dip检验推广方法 (Z-Dip: a validated generalization of the Dip Test) - 专知论文

会员服务 ·

0

样本 · 分析 · 经验分布 · 基础问题 · 聚类分析 ·

Z-Dip: a validated generalization of the Dip Test

翻译：Z-Dip：一种经过验证的Dip检验推广方法

Edoardo Di Martino,Matteo Cinelli,Roy Cerqueti

Detecting multimodality in empirical distributions is a fundamental problem in statistics and data analysis, with applications ranging from clustering to social science. Hartigan's Dip Test is a classical nonparametric procedure for testing unimodality versus multimodality, but its interpretation is hindered by strong dependence on sample size and the need for lookup tables. We introduce the Z-Dip, a standardized extension of the Dip Test that removes sample-size dependence by comparing observed Dip values to simulated null distributions. We calibrate a universal decision threshold for Z-Dip via simulation and bootstrap resampling, providing a unified criterion for multimodality detection. In the final section, we also propose a downsampling-based approach to further mitigate residual sample-size effects in very large datasets. Lookup tables and software implementations are made available for efficient use in practice.

翻译：检测经验分布中的多峰性是统计学与数据分析中的基础问题，其应用范围涵盖聚类分析至社会科学。Hartigan的Dip检验是用于检验单峰性与多峰性的经典非参数方法，但其解释受到样本量高度依赖性和需查表使用的限制。我们提出了Z-Dip，作为Dip检验的标准化扩展，通过将观测到的Dip值与模拟零分布进行比较，消除了样本量依赖性。我们通过模拟和自助重采样校准了Z-Dip的通用决策阈值，为多峰性检测提供了统一标准。在最后部分，我们还提出了一种基于下采样的方法，以进一步减轻超大样本数据中残留的样本量效应。查表工具和软件实现已提供，便于实际高效使用。

0

相关内容

UnHiPPO：面向不确定性的状态空间模型初始化方法

UnHiPPO：面向不确定性的状态空间模型初始化方法

专知会员服务

11+阅读 · 6月6日

【AAAI 2022】 GeomGCL:用于分子性质预测的几何图对比学习

【AAAI 2022】 GeomGCL:用于分子性质预测的几何图对比学习

专知会员服务

24+阅读 · 2022年2月27日

IEEE TPAMI | 基于标注偏差估计的实例相关PU学习

IEEE TPAMI | 基于标注偏差估计的实例相关PU学习

专知会员服务

12+阅读 · 2021年10月23日

语义相似性算法演化论文，29页pdf，Evolution of Semantic Similarity - A Survey

语义相似性算法演化论文，29页pdf，Evolution of Semantic Similarity - A Survey

专知会员服务

44+阅读 · 2020年4月30日

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning

专知会员服务

159+阅读 · 2020年2月29日

【ICML2021】因果匹配领域泛化

【ICML2021】因果匹配领域泛化

专知

12+阅读 · 2021年8月12日

【CVPR2021】CausalVAE: 引入因果结构的解耦表示学习

【CVPR2021】CausalVAE: 引入因果结构的解耦表示学习

专知

19+阅读 · 2021年3月28日

【KDD2020-Tutorial】因果推理与稳定学习，Causal Inference and Stable Learning

【KDD2020-Tutorial】因果推理与稳定学习，Causal Inference and Stable Learning

专知

11+阅读 · 2020年8月28日

【NeurIPS2019】图变换网络：Graph Transformer Network

【NeurIPS2019】图变换网络：Graph Transformer Network

专知

245+阅读 · 2019年11月18日

NAACL 2019 | 一种考虑缓和KL消失的简单VAE训练方法

NAACL 2019 | 一种考虑缓和KL消失的简单VAE训练方法

PaperWeekly

20+阅读 · 2019年4月24日

半线性广义Tricomi方程Cauchy问题解的生命跨度估计研究

国家自然科学基金

0+阅读 · 2017年12月31日

两类Markov排队模型的衰减性质

国家自然科学基金

1+阅读 · 2015年12月31日

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

一般误差分布下若干半参数模型的复合分位数方法

国家自然科学基金

0+阅读 · 2014年12月31日

Poisson流形上的修正Hamilton方法

国家自然科学基金

0+阅读 · 2014年12月31日

Boltzmann-Shannon Index: A Geometric-Aware Measure of Clustering Balance

Arxiv

0+阅读 · 12月17日

Beyond I-Con: Exploring New Dimension of Distance Measures in Representation Learning

Arxiv

0+阅读 · 12月4日

A Fast Kernel-based Conditional Independence test with Application to Causal Discovery

Arxiv

0+阅读 · 12月4日

LPCD: Unified Framework from Layer-Wise to Submodule Quantization

Arxiv

0+阅读 · 12月1日

An Approach to Variable Clustering: K-means in Transposed Data and its Relationship with Principal Component Analysis

Arxiv

0+阅读 · 11月30日

VIP会员

文章信息

相关主题

相关VIP内容

UnHiPPO：面向不确定性的状态空间模型初始化方法

UnHiPPO：面向不确定性的状态空间模型初始化方法

专知会员服务

11+阅读 · 6月6日

【AAAI 2022】 GeomGCL:用于分子性质预测的几何图对比学习

【AAAI 2022】 GeomGCL:用于分子性质预测的几何图对比学习

专知会员服务

24+阅读 · 2022年2月27日

IEEE TPAMI | 基于标注偏差估计的实例相关PU学习

IEEE TPAMI | 基于标注偏差估计的实例相关PU学习

专知会员服务

12+阅读 · 2021年10月23日

语义相似性算法演化论文，29页pdf，Evolution of Semantic Similarity - A Survey

语义相似性算法演化论文，29页pdf，Evolution of Semantic Similarity - A Survey

专知会员服务

44+阅读 · 2020年4月30日

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning

专知会员服务

159+阅读 · 2020年2月29日

热门VIP内容

开通专知VIP会员享更多权益服务

前沿人工智能趋势报告（Frontier AI Trends Report）

【AAAI2026】善始则事半功倍：基于前缀优化的大语言模型推理强化学习

Andrej Karpathy：2025 年 LLM 年度回顾（2025 LLM Year in Review）

音退化问题：基于输入操控的鲁棒语音转换综述

相关资讯

【ICML2021】因果匹配领域泛化

【ICML2021】因果匹配领域泛化

专知

12+阅读 · 2021年8月12日

【CVPR2021】CausalVAE: 引入因果结构的解耦表示学习

【CVPR2021】CausalVAE: 引入因果结构的解耦表示学习

专知

19+阅读 · 2021年3月28日

【KDD2020-Tutorial】因果推理与稳定学习，Causal Inference and Stable Learning

【KDD2020-Tutorial】因果推理与稳定学习，Causal Inference and Stable Learning

专知

11+阅读 · 2020年8月28日

【NeurIPS2019】图变换网络：Graph Transformer Network

【NeurIPS2019】图变换网络：Graph Transformer Network

专知

245+阅读 · 2019年11月18日

NAACL 2019 | 一种考虑缓和KL消失的简单VAE训练方法

NAACL 2019 | 一种考虑缓和KL消失的简单VAE训练方法

PaperWeekly

20+阅读 · 2019年4月24日

相关论文

Boltzmann-Shannon Index: A Geometric-Aware Measure of Clustering Balance

Arxiv

0+阅读 · 12月17日

Beyond I-Con: Exploring New Dimension of Distance Measures in Representation Learning

Arxiv

0+阅读 · 12月4日

A Fast Kernel-based Conditional Independence test with Application to Causal Discovery

Arxiv

0+阅读 · 12月4日

LPCD: Unified Framework from Layer-Wise to Submodule Quantization

Arxiv

0+阅读 · 12月1日

An Approach to Variable Clustering: K-means in Transposed Data and its Relationship with Principal Component Analysis

Arxiv

0+阅读 · 11月30日

相关基金

半线性广义Tricomi方程Cauchy问题解的生命跨度估计研究

国家自然科学基金

0+阅读 · 2017年12月31日

两类Markov排队模型的衰减性质

国家自然科学基金

1+阅读 · 2015年12月31日

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

一般误差分布下若干半参数模型的复合分位数方法

国家自然科学基金

0+阅读 · 2014年12月31日

Poisson流形上的修正Hamilton方法

国家自然科学基金

0+阅读 · 2014年12月31日

微信扫码咨询专知VIP会员