科学数据网络内缓存的科学数据存取趋势 (Access Trends of In-network Cache for Scientific Data) - 专知论文

会员服务 ·

0

Storage · Networking · 可约的 · AIM · 迹 ·

2022 年 5 月 11 日

Access Trends of In-network Cache for Scientific Data

翻译：科学数据网络内缓存的科学数据存取趋势

Ruize Han,Alex Sim,Kesheng Wu,Inder Monga,Chin Guok,Frank Würthwein,Diego Davila,Justas Balcas,Harvey Newman

Scientific collaborations are increasingly relying on large volumes of data for their work and many of them employ tiered systems to replicate the data to their worldwide user communities. Each user in the community often selects a different subset of data for their analysis tasks; however, members of a research group often are working on related research topics that require similar data objects. Thus, there is a significant amount of data sharing possible. In this work, we study the access traces of a federated storage cache known as the Southern California Petabyte Scale Cache. By studying the access patterns and potential for network traffic reduction by this caching system, we aim to explore the predictability of the cache uses and the potential for a more general in-network data caching. Our study shows that this distributed storage cache is able to reduce the network traffic volume by a factor of 2.35 during a part of the study period. We further show that machine learning models could predict cache utilization with an accuracy of 0.88. This demonstrates that such cache usage is predictable, which could be useful for managing complex networking resources such as in-network caching.

翻译：科学协作越来越多地依靠大量数据开展工作,其中许多人利用分层系统将数据复制给世界各地的用户社区。社区中的每个用户往往为分析任务选择不同的一组数据;然而,一个研究小组成员往往在研究需要类似数据对象的相关研究课题;因此,可以进行大量的数据共享。在这项工作中,我们研究了称为南加利福尼亚Petabyte比例缓存的联结存储缓存的存取痕迹。通过研究这个缓存系统的存取模式和网络流量减少潜力,我们的目标是探讨缓存用途的可预测性和网络内数据更一般缓存的可能性。我们的研究显示,在研究期的一部分时间里,分散的存储缓存能够将网络流量减少2.35倍。我们进一步表明,机器学习模型可以预测0.88的缓存利用准确度。这说明,这种缓存的使用是可预测的,有助于管理网络内缓存等复杂的网络资源。

0

相关内容

Storage

Storage

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

80+阅读 · 2020年7月26日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

IEEE | DSC 2019诚邀稿件 (EI检索)

IEEE | DSC 2019诚邀稿件 (EI检索)

Call4Papers

10+阅读 · 2019年2月25日

大块板状样品中子核数据宏观基准检验研究

国家自然科学基金

0+阅读 · 2015年12月31日

高血压血管重塑中血管保护分子CREG基因表达的上游调控机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

Cadherin 11在骨关节炎滑膜炎及关节软骨破坏中的作用机制

国家自然科学基金

0+阅读 · 2014年12月31日

Serp-2 调控apoptosis和pyroptosis 对肝脏缺血再灌注损伤的保护作用研究

国家自然科学基金

0+阅读 · 2014年12月31日

MiR-139-5p通过调控Rho/ROCK信号通路参与高血压心肌重塑

国家自然科学基金

0+阅读 · 2014年12月31日

miR125a-5p对血管内皮细胞和膜微粒功能的调控及其在缺血性脑卒中发病中的作用

国家自然科学基金

0+阅读 · 2013年12月31日

Long non-coding RNA MEG3分子对胶质瘤干细胞调控作用的研究

国家自然科学基金

0+阅读 · 2013年12月31日

ADS中检测快中子束的GEM探测器的研制

国家自然科学基金

0+阅读 · 2013年12月31日

PPARγ和ANGPTL4基因表达在急性胰腺炎肺损伤发病机制中的作用及清胰汤的干预作用

国家自然科学基金

0+阅读 · 2011年12月31日

miR-29调控TGF-β1与COLI基因表达在增生性瘢痕形成中的作用及机制

国家自然科学基金

0+阅读 · 2011年12月31日

Using Machine Learning to Anticipate Tipping Points and Extrapolate to Post-Tipping Dynamics of Non-Stationary Dynamical Systems

Using Machine Learning to Anticipate Tipping Points and Extrapolate to Post-Tipping Dynamics of Non-Stationary Dynamical Systems

Arxiv

0+阅读 · 2022年7月1日

Towards an Architecture-centric Methodology for Migrating to Microservices

Towards an Architecture-centric Methodology for Migrating to Microservices

Arxiv

0+阅读 · 2022年7月1日

Panning for gold: Lessons learned from the platform-agnostic automated detection of political content in textual data

Arxiv

0+阅读 · 2022年7月1日

A Time Series Forecasting Approach to Minimize Cold Start Time in Cloud-Serverless Platform

A Time Series Forecasting Approach to Minimize Cold Start Time in Cloud-Serverless Platform

Arxiv

0+阅读 · 2022年6月30日

There and Back Again: On Applying Data Reduction Rules by Undoing Others

There and Back Again: On Applying Data Reduction Rules by Undoing Others

Arxiv

0+阅读 · 2022年6月29日

Statistical Evaluation of Privacy-preserving Publication and Sharing of Three Types of COVID-19 Pandemic Data: Methods and Case Studies

Arxiv

0+阅读 · 2022年6月29日

In-network Computation for Large-scale Federated Learning over Wireless Edge Networks

Arxiv

0+阅读 · 2022年6月28日

Smart Application for Fall Detection Using Wearable ECG & Accelerometer Sensors

Arxiv

0+阅读 · 2022年6月28日

A Survey of Methods for Low-Power Deep Learning and Computer Vision

A Survey of Methods for Low-Power Deep Learning and Computer Vision

Arxiv

14+阅读 · 2020年3月24日

A Survey of Model Compression and Acceleration for Deep Neural Networks

Arxiv

66+阅读 · 2019年9月8日

VIP会员

文章信息

相关主题

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

80+阅读 · 2020年7月26日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

卫星导航技术发展综述

《美军"僚机"联合能力技术演示项目：有人-无人火炮作战》41页报告

美军条令《火力指挥》116页

可解释的人工智能在生物医学图像分析中的应用综述

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

IEEE | DSC 2019诚邀稿件 (EI检索)

IEEE | DSC 2019诚邀稿件 (EI检索)

Call4Papers

10+阅读 · 2019年2月25日

相关论文

Using Machine Learning to Anticipate Tipping Points and Extrapolate to Post-Tipping Dynamics of Non-Stationary Dynamical Systems

Using Machine Learning to Anticipate Tipping Points and Extrapolate to Post-Tipping Dynamics of Non-Stationary Dynamical Systems

Arxiv

0+阅读 · 2022年7月1日

Towards an Architecture-centric Methodology for Migrating to Microservices

Towards an Architecture-centric Methodology for Migrating to Microservices

Arxiv

0+阅读 · 2022年7月1日

Panning for gold: Lessons learned from the platform-agnostic automated detection of political content in textual data

Arxiv

0+阅读 · 2022年7月1日

A Time Series Forecasting Approach to Minimize Cold Start Time in Cloud-Serverless Platform

A Time Series Forecasting Approach to Minimize Cold Start Time in Cloud-Serverless Platform

Arxiv

0+阅读 · 2022年6月30日

There and Back Again: On Applying Data Reduction Rules by Undoing Others

There and Back Again: On Applying Data Reduction Rules by Undoing Others

Arxiv

0+阅读 · 2022年6月29日

Statistical Evaluation of Privacy-preserving Publication and Sharing of Three Types of COVID-19 Pandemic Data: Methods and Case Studies

Arxiv

0+阅读 · 2022年6月29日

In-network Computation for Large-scale Federated Learning over Wireless Edge Networks

Arxiv

0+阅读 · 2022年6月28日

Smart Application for Fall Detection Using Wearable ECG & Accelerometer Sensors

Arxiv

0+阅读 · 2022年6月28日

A Survey of Methods for Low-Power Deep Learning and Computer Vision

A Survey of Methods for Low-Power Deep Learning and Computer Vision

Arxiv

14+阅读 · 2020年3月24日

A Survey of Model Compression and Acceleration for Deep Neural Networks

Arxiv

66+阅读 · 2019年9月8日

相关基金

大块板状样品中子核数据宏观基准检验研究

国家自然科学基金

0+阅读 · 2015年12月31日

高血压血管重塑中血管保护分子CREG基因表达的上游调控机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

Cadherin 11在骨关节炎滑膜炎及关节软骨破坏中的作用机制

国家自然科学基金

0+阅读 · 2014年12月31日

Serp-2 调控apoptosis和pyroptosis 对肝脏缺血再灌注损伤的保护作用研究

国家自然科学基金

0+阅读 · 2014年12月31日

MiR-139-5p通过调控Rho/ROCK信号通路参与高血压心肌重塑

国家自然科学基金

0+阅读 · 2014年12月31日

miR125a-5p对血管内皮细胞和膜微粒功能的调控及其在缺血性脑卒中发病中的作用

国家自然科学基金

0+阅读 · 2013年12月31日

Long non-coding RNA MEG3分子对胶质瘤干细胞调控作用的研究

国家自然科学基金

0+阅读 · 2013年12月31日

ADS中检测快中子束的GEM探测器的研制

国家自然科学基金

0+阅读 · 2013年12月31日

PPARγ和ANGPTL4基因表达在急性胰腺炎肺损伤发病机制中的作用及清胰汤的干预作用

国家自然科学基金

0+阅读 · 2011年12月31日

miR-29调控TGF-β1与COLI基因表达在增生性瘢痕形成中的作用及机制

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员