Benign 自动计算器 (Benign Autoencoders) - 专知论文

会员服务 ·

0

Learning · MoDELS · 模型性能 · 自编码器 · INFORMS ·

2022 年 10 月 2 日

Benign Autoencoders

翻译：Benign 自动计算器

Semyon Malamud,Andreas Schrimpf,Andrea Xu,Giuseppe Matera,Antoine Didisheim

from arxiv, arXiv admin note: substantial text overlap with arXiv:2110.08884

The success of modern machine learning algorithms depends crucially on efficient data representation and compression through dimensionality reduction. This practice seemingly contradicts the conventional intuition suggesting that data processing always leads to information loss. We prove that this intuition is wrong. For any non-convex problem, there exists an optimal, benign auto-encoder (BAE) extracting a lower-dimensional data representation that is strictly beneficial: Compressing model inputs improves model performance. We prove that BAE projects data onto a manifold whose dimension is the compressibility dimension of the learning model. We develop and implement an efficient algorithm for computing BAE and show that BAE improves model performance in every dataset we consider. Furthermore, by compressing "malignant" data dimensions, BAE makes learning more stable and robust.

翻译：现代机器学习算法的成功关键取决于高效率的数据代表性和通过减少维度压缩压缩数据。这种做法似乎与传统直觉相矛盾, 表明数据处理总是导致信息丢失。我们证明这种直觉是错误的。对于任何非隐形问题, 都存在最佳、良性的自动编码器(BAE), 提取一个非常有利的低维数据表达法: 压缩模型投入可以改善模型性能。我们证明 BAE 将数据投放到一个方块上, 其维度是学习模型的压缩维度。我们为计算 BAE 开发和实施一种高效的算法, 并表明 BAE 改善了我们所考虑的每个数据集的模型性能。此外, 通过压缩“ 错误” 数据维度, BAE 使学习更加稳定和稳健。

0

相关内容

Learning

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

长链非编码RNA AC074286.1在食管鳞癌中的生物学功能及其表观遗传机制

国家自然科学基金

0+阅读 · 2014年12月31日

基于CS算法的数字信号压缩和高效数字系统设计的研究

国家自然科学基金

0+阅读 · 2012年12月31日

大规模序列数据集的压缩索引与搜索算法研究

国家自然科学基金

1+阅读 · 2012年12月31日

通用可复合安全的密码协议及其应用研究

国家自然科学基金

0+阅读 · 2012年12月31日

编码密码学中若干组合对象研究

国家自然科学基金

0+阅读 · 2009年12月31日

Provable and Efficient Continual Representation Learning

Arxiv

0+阅读 · 2022年11月7日

NAPG: Non-Autoregressive Program Generation for Hybrid Tabular-Textual Question Answering

Arxiv

0+阅读 · 2022年11月7日

Robust Testing in High-Dimensional Sparse Models

Arxiv

0+阅读 · 2022年11月4日

Decomposed Mutual Information Estimation for Contrastive Representation Learning

Arxiv

11+阅读 · 2021年6月25日

Generative Adversarial Autoencoder Networks

Arxiv

11+阅读 · 2018年3月23日

VIP会员

文章信息

相关主题

相关VIP内容

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《北约认知战概念报告》

《预测促成大规模货运无人机的技术趋势与影响》报告

美海军放弃星座级转而采用国家安全巡逻舰设计

《北约作战弹性概念》报告

相关资讯

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

相关论文

Provable and Efficient Continual Representation Learning

Arxiv

0+阅读 · 2022年11月7日

NAPG: Non-Autoregressive Program Generation for Hybrid Tabular-Textual Question Answering

Arxiv

0+阅读 · 2022年11月7日

Robust Testing in High-Dimensional Sparse Models

Arxiv

0+阅读 · 2022年11月4日

Decomposed Mutual Information Estimation for Contrastive Representation Learning

Arxiv

11+阅读 · 2021年6月25日

Generative Adversarial Autoencoder Networks

Arxiv

11+阅读 · 2018年3月23日

相关基金

长链非编码RNA AC074286.1在食管鳞癌中的生物学功能及其表观遗传机制

国家自然科学基金

0+阅读 · 2014年12月31日

基于CS算法的数字信号压缩和高效数字系统设计的研究

国家自然科学基金

0+阅读 · 2012年12月31日

大规模序列数据集的压缩索引与搜索算法研究

国家自然科学基金

1+阅读 · 2012年12月31日

通用可复合安全的密码协议及其应用研究

国家自然科学基金

0+阅读 · 2012年12月31日

编码密码学中若干组合对象研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员