后勤后退的核心设置 (On Coresets for Logistic Regression) - 专知论文

会员服务 ·

0

对数几率回归 · Continuity · 均匀采样 · Performer · UniFormer ·

2021 年 3 月 8 日

On Coresets for Logistic Regression

翻译：后勤后退的核心设置

Alexander Munteanu,Chris Schwiegelshohn,Christian Sohler,David P. Woodruff

Coresets are one of the central methods to facilitate the analysis of large data sets. We continue a recent line of research applying the theory of coresets to logistic regression. First, we show a negative result, namely, that no strongly sublinear sized coresets exist for logistic regression. To deal with intractable worst-case instances we introduce a complexity measure $\mu(X)$, which quantifies the hardness of compressing a data set for logistic regression. $\mu(X)$ has an intuitive statistical interpretation that may be of independent interest. For data sets with bounded $\mu(X)$-complexity, we show that a novel sensitivity sampling scheme produces the first provably sublinear $(1\pm\varepsilon)$-coreset. We illustrate the performance of our method by comparing to uniform sampling as well as to state of the art methods in the area. The experiments are conducted on real world benchmark data for logistic regression.

翻译：核心数据集是便于分析大型数据集的核心方法之一。我们继续最近的一系列研究, 将核心数据集理论应用于物流回归。首先, 我们显示一个负结果, 即不存在用于物流回归的强烈亚线型核心数据集。为了处理棘手的最坏案例, 我们引入了一个复杂度为$\mu( X) 的计量标准, 该计量了压缩一套物流回归数据集的难度。 $\ mu( X) $ 具有可能具有独立兴趣的直观统计解释。对于有约束值$mu( X) $( X) 的复杂度的数据集, 我们显示一个新的敏感度取样计划产生了第一个可移动的亚线性亚线性( $(1\\ p\ pm\ varrepsilon) $( $) 核心。我们通过比较统一的取样以及该地区艺术方法的状态来说明我们方法的性能。实验是根据真实的世界物流回归基准数据进行的。

0

相关内容

对数几率回归

对数几率回归

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【卡内基梅隆大学-CMU】机器学习中的公平性，Learning Fair Representations

【卡内基梅隆大学-CMU】机器学习中的公平性，Learning Fair Representations

专知会员服务

38+阅读 · 2020年2月29日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

TensorFlow深度学习，从线性回归到强化学习的深度学习（TensorFlow for Deep Learning From Linear Regression to Reinforcement Learning），附页256页pdf

TensorFlow深度学习，从线性回归到强化学习的深度学习（TensorFlow for Deep Learning From Linear Regression to Reinforcement Learning），附页256页pdf

专知会员服务

46+阅读 · 2020年1月1日

【论文】用于推理的概率逻辑神经网络（Probabilistic Logic Neural Networks for Reasoning）

【论文】用于推理的概率逻辑神经网络（Probabilistic Logic Neural Networks for Reasoning）

专知会员服务

104+阅读 · 2019年12月30日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

Successor representations 强化学习表示的生物学启发

Successor representations 强化学习表示的生物学启发

CreateAMind

6+阅读 · 2019年9月5日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

逻辑回归（Logistic Regression）模型简介

逻辑回归（Logistic Regression）模型简介

全球人工智能

5+阅读 · 2017年11月1日

【推荐】免费书(草稿)：数据科学的数学基础

【推荐】免费书(草稿)：数据科学的数学基础

机器学习研究会

20+阅读 · 2017年10月1日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Logistic回归第一弹——二项Logistic Regression

Logistic回归第一弹——二项Logistic Regression

机器学习深度学习实战原创交流

3+阅读 · 2015年10月22日

The Raise Regression: Justification, properties and application

The Raise Regression: Justification, properties and application

Arxiv

0+阅读 · 2021年4月29日

Towards a more efficient approach for the satisfiability of two-variable logic

Arxiv

0+阅读 · 2021年4月29日

Bandit-Based Monte Carlo Optimization for Nearest Neighbors

Arxiv

0+阅读 · 2021年4月28日

Rule-based Shielding for Partially Observable Monte-Carlo Planning

Arxiv

1+阅读 · 2021年4月28日

A Kernel-based Consensual Aggregation for Regression

Arxiv

0+阅读 · 2021年4月28日

Modeling the dynamics of language change: logistic regression, Piotrowski's law, and a handful of examples in Polish

Arxiv

0+阅读 · 2021年4月28日

Spatially Clustered Regression

Arxiv

0+阅读 · 2021年4月28日

Improved log-Gaussian approximation for over-dispersed Poisson regression: application to spatial analysis of COVID-19

Arxiv

0+阅读 · 2021年4月28日

Robust estimation for semi-functional linear regression models

Arxiv

0+阅读 · 2021年4月28日

Logically-Constrained Reinforcement Learning

Arxiv

5+阅读 · 2018年4月22日

VIP会员

文章信息

相关主题

对数几率回归

相关VIP内容

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【卡内基梅隆大学-CMU】机器学习中的公平性，Learning Fair Representations

【卡内基梅隆大学-CMU】机器学习中的公平性，Learning Fair Representations

专知会员服务

38+阅读 · 2020年2月29日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

TensorFlow深度学习，从线性回归到强化学习的深度学习（TensorFlow for Deep Learning From Linear Regression to Reinforcement Learning），附页256页pdf

TensorFlow深度学习，从线性回归到强化学习的深度学习（TensorFlow for Deep Learning From Linear Regression to Reinforcement Learning），附页256页pdf

专知会员服务

46+阅读 · 2020年1月1日

【论文】用于推理的概率逻辑神经网络（Probabilistic Logic Neural Networks for Reasoning）

【论文】用于推理的概率逻辑神经网络（Probabilistic Logic Neural Networks for Reasoning）

专知会员服务

104+阅读 · 2019年12月30日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《概率数值计算：贝叶斯求积法与人机协作》最新博士论文

【NTU博士论文】多模态神经三维资产合成

人工智能：实时战斗适应

《运用作战人员数字孪生与生成式人工智能预测任务成果》最新文献

相关资讯

Successor representations 强化学习表示的生物学启发

Successor representations 强化学习表示的生物学启发

CreateAMind

6+阅读 · 2019年9月5日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

逻辑回归（Logistic Regression）模型简介

逻辑回归（Logistic Regression）模型简介

全球人工智能

5+阅读 · 2017年11月1日

【推荐】免费书(草稿)：数据科学的数学基础

【推荐】免费书(草稿)：数据科学的数学基础

机器学习研究会

20+阅读 · 2017年10月1日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Logistic回归第一弹——二项Logistic Regression

Logistic回归第一弹——二项Logistic Regression

机器学习深度学习实战原创交流

3+阅读 · 2015年10月22日

相关论文

The Raise Regression: Justification, properties and application

The Raise Regression: Justification, properties and application

Arxiv

0+阅读 · 2021年4月29日

Towards a more efficient approach for the satisfiability of two-variable logic

Arxiv

0+阅读 · 2021年4月29日

Bandit-Based Monte Carlo Optimization for Nearest Neighbors

Arxiv

0+阅读 · 2021年4月28日

Rule-based Shielding for Partially Observable Monte-Carlo Planning

Arxiv

1+阅读 · 2021年4月28日

A Kernel-based Consensual Aggregation for Regression

Arxiv

0+阅读 · 2021年4月28日

Modeling the dynamics of language change: logistic regression, Piotrowski's law, and a handful of examples in Polish

Arxiv

0+阅读 · 2021年4月28日

Spatially Clustered Regression

Arxiv

0+阅读 · 2021年4月28日

Improved log-Gaussian approximation for over-dispersed Poisson regression: application to spatial analysis of COVID-19

Arxiv

0+阅读 · 2021年4月28日

Robust estimation for semi-functional linear regression models

Arxiv

0+阅读 · 2021年4月28日

Logically-Constrained Reinforcement Learning

Arxiv

5+阅读 · 2018年4月22日

微信扫码咨询专知VIP会员