通过部分线索保护在GPPPPPU应用中增强软件复原力 (Enabling Software Resilience in GPGPU Applications via Partial Thread Protection) - 专知论文

会员服务 ·

0

通用GPU · 可辨认的 · SOFT · Performance · Processing（编程语言） ·

2021 年 3 月 4 日

Enabling Software Resilience in GPGPU Applications via Partial Thread Protection

翻译：通过部分线索保护在GPPPPPU应用中增强软件复原力

Lishan Yang,Bin Nie,Adwait Jog,Evgenia Smirni

from arxiv, Accepted to the 43rd International Conference on Software Engineering (ICSE 2021)

Graphics Processing Units (GPUs) are widely used by various applications in a broad variety of fields to accelerate their computation but remain susceptible to transient hardware faults (soft errors) that can easily compromise application output. By taking advantage of a general purpose GPU application hierarchical organization in threads, warps, and cooperative thread arrays, we propose a methodology that identifies the resilience of threads and aims to map threads with the same resilience characteristics to the same warp. This allows engaging partial replication mechanisms for error detection/correction at the warp level. By exploring 12 benchmarks (17 kernels) from 4 benchmark suites, we illustrate that threads can be remapped into reliable or unreliable warps with only 1.63% introduced overhead (on average), and then enable selective protection via replication to those groups of threads that truly need it. Furthermore, we show that thread remapping to different warps does not sacrifice application performance. We show how this remapping facilitates warp replication for error detection and/or correction and achieves an average reduction of 20.61% and 27.15% execution cycles, respectively comparing to standard duplication/triplication.

翻译：图形处理器( GPU) 被各种应用在广泛的领域广泛广泛使用, 以加快计算速度, 但仍易发生瞬时硬件故障( 软错误), 容易影响应用输出。我们利用一般目的 GPU 应用分级组织在线条、扭曲器和合作线条阵列中, 提议了一种方法, 用以识别线条的弹性, 并用相同的弹性特性绘制线条到同一个扭曲器。这样可以使用部分复制机制在扭曲级别探测/校正错误。通过从 4 个基准套中探索 12 个基准( 17 内核 ), 我们说明线条可以重新绘制成可靠或不可靠的扭曲点( 平均), 只引入1.63% 的管理费, 然后通过复制真正需要它的那些线条来进行选择性保护。此外, 我们显示, 向不同的线条重新绘制不会牺牲应用程序的性能。我们展示了这种重新绘制如何促进在错误检测和/ 校正点上进行重复制, 并实现平均减少 20.61% 和27. 15% 执行周期, 。

0

相关内容

通用GPU

Google最新《机器学习对偶性》报告，48页ppt

Google最新《机器学习对偶性》报告，48页ppt

专知会员服务

36+阅读 · 2020年11月29日

Effective.Modern.C++ 中英文版，334页pdf

Effective.Modern.C++ 中英文版，334页pdf

专知会员服务

68+阅读 · 2020年11月4日

【Google】平滑对抗训练，Smooth Adversarial Training

【Google】平滑对抗训练，Smooth Adversarial Training

专知会员服务

49+阅读 · 2020年7月4日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

分布式并行架构Ray介绍

分布式并行架构Ray介绍

CreateAMind

10+阅读 · 2019年8月9日

revelation of MONet

revelation of MONet

CreateAMind

5+阅读 · 2019年6月8日

计算机 | USENIX Security 2020等国际会议信息5条

计算机 | USENIX Security 2020等国际会议信息5条

Call4Papers

7+阅读 · 2019年4月25日

【TED】生命中的每一年的智慧

【TED】生命中的每一年的智慧

英语演讲视频每日一推

10+阅读 · 2019年1月29日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Ray RLlib: Scalable 降龙十八掌

Ray RLlib: Scalable 降龙十八掌

CreateAMind

9+阅读 · 2018年12月28日

计算机类 | 11月截稿会议信息9条

计算机类 | 11月截稿会议信息9条

Call4Papers

6+阅读 · 2018年10月14日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

已删除

将门创投

4+阅读 · 2017年12月12日

Multi-Objective Reconstruction Of Software Architecture

Arxiv

0+阅读 · 2021年4月28日

Achieving High Throughput and Elasticity in a Larger-than-Memory Store

Arxiv

0+阅读 · 2021年4月27日

SoK: Cryptojacking Malware

Arxiv

0+阅读 · 2021年4月26日

A PGAS Communication Library for Heterogeneous Clusters

Arxiv

0+阅读 · 2021年4月26日

Efficient Replication via Timestamp Stability (Extended Version)

Arxiv

0+阅读 · 2021年4月25日

ESResNe(X)t-fbsp: Learning Robust Time-Frequency Transformation of Audio

Arxiv

0+阅读 · 2021年4月23日

Hyper-Parameter Optimization: A Review of Algorithms and Applications

Hyper-Parameter Optimization: A Review of Algorithms and Applications

Arxiv

16+阅读 · 2020年3月12日

Efficient Parameter-free Clustering Using First Neighbor Relations

Efficient Parameter-free Clustering Using First Neighbor Relations

Arxiv

7+阅读 · 2019年2月28日

Deep Feature Aggregation with Heat Diffusion for Image Retrieval

Arxiv

7+阅读 · 2018年6月2日

Learning to Evade Static PE Machine Learning Malware Models via Reinforcement Learning

Arxiv

3+阅读 · 2018年1月30日

VIP会员

文章信息

相关主题

Processing（编程语言）

相关VIP内容

Google最新《机器学习对偶性》报告，48页ppt

Google最新《机器学习对偶性》报告，48页ppt

专知会员服务

36+阅读 · 2020年11月29日

Effective.Modern.C++ 中英文版，334页pdf

Effective.Modern.C++ 中英文版，334页pdf

专知会员服务

68+阅读 · 2020年11月4日

【Google】平滑对抗训练，Smooth Adversarial Training

【Google】平滑对抗训练，Smooth Adversarial Training

专知会员服务

49+阅读 · 2020年7月4日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《无人机集群配置对模拟作战环境任务效能的影响研究》最新50页

《俄罗斯作战模式解析：对俄特别军事行动的观察报告》最新325页

军用无人机集群技术尚未成熟——但潜力可期

《无人机改变战争规则，但无法破解陆战固有挑战》最新报告

相关资讯

分布式并行架构Ray介绍

分布式并行架构Ray介绍

CreateAMind

10+阅读 · 2019年8月9日

revelation of MONet

revelation of MONet

CreateAMind

5+阅读 · 2019年6月8日

计算机 | USENIX Security 2020等国际会议信息5条

计算机 | USENIX Security 2020等国际会议信息5条

Call4Papers

7+阅读 · 2019年4月25日

【TED】生命中的每一年的智慧

【TED】生命中的每一年的智慧

英语演讲视频每日一推

10+阅读 · 2019年1月29日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Ray RLlib: Scalable 降龙十八掌

Ray RLlib: Scalable 降龙十八掌

CreateAMind

9+阅读 · 2018年12月28日

计算机类 | 11月截稿会议信息9条

计算机类 | 11月截稿会议信息9条

Call4Papers

6+阅读 · 2018年10月14日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

已删除

将门创投

4+阅读 · 2017年12月12日

相关论文

Multi-Objective Reconstruction Of Software Architecture

Arxiv

0+阅读 · 2021年4月28日

Achieving High Throughput and Elasticity in a Larger-than-Memory Store

Arxiv

0+阅读 · 2021年4月27日

SoK: Cryptojacking Malware

Arxiv

0+阅读 · 2021年4月26日

A PGAS Communication Library for Heterogeneous Clusters

Arxiv

0+阅读 · 2021年4月26日

Efficient Replication via Timestamp Stability (Extended Version)

Arxiv

0+阅读 · 2021年4月25日

ESResNe(X)t-fbsp: Learning Robust Time-Frequency Transformation of Audio

Arxiv

0+阅读 · 2021年4月23日

Hyper-Parameter Optimization: A Review of Algorithms and Applications

Hyper-Parameter Optimization: A Review of Algorithms and Applications

Arxiv

16+阅读 · 2020年3月12日

Efficient Parameter-free Clustering Using First Neighbor Relations

Efficient Parameter-free Clustering Using First Neighbor Relations

Arxiv

7+阅读 · 2019年2月28日

Deep Feature Aggregation with Heat Diffusion for Image Retrieval

Arxiv

7+阅读 · 2018年6月2日

Learning to Evade Static PE Machine Learning Malware Models via Reinforcement Learning

Arxiv

3+阅读 · 2018年1月30日

微信扫码咨询专知VIP会员