EXMA: 精确匹配的基因组加速器 (EXMA: A Genomics Accelerator for Exact-Matching) - 专知论文

会员服务 ·

0

行 · state-of-the-art · 可约的 · 哈尔滨工业大学（HIT） · Processing（编程语言） ·

2021 年 1 月 13 日

EXMA: A Genomics Accelerator for Exact-Matching

翻译：EXMA: 精确匹配的基因组加速器

Lei Jiang,Farzaneh Zokaee

from arxiv, IEEE International Symposium on High-Performance Computer Architecture, 2021

Genomics is the foundation of precision medicine, global food security and virus surveillance. Exact-match is one of the most essential operations widely used in almost every step of genomics such as alignment, assembly, annotation, and compression. Modern genomics adopts Ferragina-Manzini Index (FM-Index) augmenting space-efficient Burrows-Wheeler transform (BWT) with additional data structures to permit ultra-fast exact-match operations. However, FM-Index is notorious for its poor spatial locality and random memory access pattern. Prior works create GPU-, FPGA-, ASIC- and even process-in-memory (PIM)-based accelerators to boost FM-Index search throughput. Though they achieve the state-of-the-art FM-Index search throughput, the same as all prior conventional accelerators, FM-Index PIMs process only one DNA symbol after each DRAM row activation, thereby suffering from poor memory bandwidth utilization. In this paper, we propose a hardware accelerator, EXMA, to enhance FM-Index search throughput. We first create a novel EXMA table with a multi-task-learning (MTL)-based index to process multiple DNA symbols with each DRAM row activation. We then build an accelerator to search over an EXMA table. We propose 2-stage scheduling to increase the cache hit rate of our accelerator. We introduce dynamic page policy to improve the row buffer hit rate of DRAM main memory. We also present CHAIN compression to reduce the data structure size of EXMA tables. Compared to state-of-the-art FM-Index PIMs, EXMA improves search throughput by $4.9\times$, and enhances search throughput per Watt by $4.8\times$.

翻译：基因组是精密医学、全球粮食安全和病毒监控的基础。精确匹配是几乎所有基因组学步骤( 如对齐、组装、批注和压缩) 中广泛使用的最重要的操作之一。现代基因组学采用Ferragina- Manzini指数( FM- Index), 增加空间高效的 Burrows- Wheeleler 转换( BWT), 增加数据结构, 允许超快的精确匹配操作。然而, FM- Index 因其空间位置差和随机存储访问模式而臭名昭著。先前的工作创建了 GPPU、 FPGA、 ASIC- 甚至是进程模拟( PIM) 。以基于 GPP- Index 的加速器, 提升了调频- Index 搜索速度。尽管它们实现了最先进的调频调频的搜索进程, 也通过 We- Indler PIMS 程序, 将一个DNA符号升级到现在的调动的调频- IMA 。

0

相关内容

2020数据工程师成长路线图

专知会员服务

38+阅读 · 2020年9月6日

低秩稀疏矩阵优化问题的模型与算法

专知会员服务

41+阅读 · 2020年7月29日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

75+阅读 · 2020年7月26日

图机器学习-图拉普拉斯算子的离散正则性，141页ppt，Discrete regularity graph Laplacians

专知会员服务

28+阅读 · 2020年6月4日

【CCL 2019】ATT-第19期：Frontiers in Network Embedding and GCN （崔鹏）

【CCL 2019】ATT-第19期：Frontiers in Network Embedding and GCN （崔鹏）

专知会员服务

42+阅读 · 2019年11月12日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

23+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

45+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

31+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

53+阅读 · 2019年10月17日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

计算机 | 入门级EI会议ICVRIS 2019诚邀稿件

计算机 | 入门级EI会议ICVRIS 2019诚邀稿件

Call4Papers

10+阅读 · 2019年6月24日

计算机 | IUI 2020等国际会议信息4条

计算机 | IUI 2020等国际会议信息4条

Call4Papers

6+阅读 · 2019年6月17日

【泡泡汇总】CVPR2019 SLAM Paperlist

【泡泡汇总】CVPR2019 SLAM Paperlist

泡泡机器人SLAM

14+阅读 · 2019年6月12日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

25+阅读 · 2019年5月18日

Deep Compression/Acceleration：模型压缩加速论文汇总

Deep Compression/Acceleration：模型压缩加速论文汇总

极市平台

13+阅读 · 2019年5月15日

CCF A类 | 顶级会议RTSS 2019诚邀稿件

CCF A类 | 顶级会议RTSS 2019诚邀稿件

Call4Papers

10+阅读 · 2019年4月17日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

16+阅读 · 2018年12月24日

【NIPS2018】接收论文列表

【NIPS2018】接收论文列表

专知

5+阅读 · 2018年9月10日

人工智能 | 国际会议截稿信息9条

人工智能 | 国际会议截稿信息9条

Call4Papers

4+阅读 · 2018年3月13日

Generative Adversarial Text to Image Synthesis论文解读

Generative Adversarial Text to Image Synthesis论文解读

统计学习与视觉计算组

13+阅读 · 2017年6月9日

Fast and Accurate Model Scaling

Arxiv

0+阅读 · 2021年3月11日

Acceleration Methods

Arxiv

0+阅读 · 2021年3月11日

Pruning and Quantization for Deep Neural Network Acceleration: A Survey

Arxiv

0+阅读 · 2021年3月11日

A Computed Tomography Vertebral Segmentation Dataset with Anatomical Variations and Multi-Vendor Scanner Data

Arxiv

0+阅读 · 2021年3月10日

Faster Binary Embeddings for Preserving Euclidean Distances

Arxiv

0+阅读 · 2021年3月10日

$FM^2$: Field-matrixed Factorization Machines for Recommender Systems

Arxiv

16+阅读 · 2021年2月20日

Neural Architecture Generator Optimization

Arxiv

6+阅读 · 2020年10月8日

Horizontal Pyramid Matching for Person Re-identification

Arxiv

3+阅读 · 2018年4月30日

Cross-Domain Image Matching with Deep Feature Maps

Arxiv

13+阅读 · 2018年4月6日

Practical sketching algorithms for low-rank matrix approximation

Arxiv

4+阅读 · 2018年1月2日

VIP会员

文章信息

相关主题

state-of-the-art

哈尔滨工业大学（HIT）

Processing（编程语言）

相关VIP内容

2020数据工程师成长路线图

专知会员服务

38+阅读 · 2020年9月6日

低秩稀疏矩阵优化问题的模型与算法

专知会员服务

41+阅读 · 2020年7月29日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

75+阅读 · 2020年7月26日

图机器学习-图拉普拉斯算子的离散正则性，141页ppt，Discrete regularity graph Laplacians

专知会员服务

28+阅读 · 2020年6月4日

【CCL 2019】ATT-第19期：Frontiers in Network Embedding and GCN （崔鹏）

【CCL 2019】ATT-第19期：Frontiers in Network Embedding and GCN （崔鹏）

专知会员服务

42+阅读 · 2019年11月12日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

23+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

45+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

31+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

53+阅读 · 2019年10月17日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

热门VIP内容

相关资讯

计算机 | 入门级EI会议ICVRIS 2019诚邀稿件

计算机 | 入门级EI会议ICVRIS 2019诚邀稿件

Call4Papers

10+阅读 · 2019年6月24日

计算机 | IUI 2020等国际会议信息4条

计算机 | IUI 2020等国际会议信息4条

Call4Papers

6+阅读 · 2019年6月17日

【泡泡汇总】CVPR2019 SLAM Paperlist

【泡泡汇总】CVPR2019 SLAM Paperlist

泡泡机器人SLAM

14+阅读 · 2019年6月12日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

25+阅读 · 2019年5月18日

Deep Compression/Acceleration：模型压缩加速论文汇总

Deep Compression/Acceleration：模型压缩加速论文汇总

极市平台

13+阅读 · 2019年5月15日

CCF A类 | 顶级会议RTSS 2019诚邀稿件

CCF A类 | 顶级会议RTSS 2019诚邀稿件

Call4Papers

10+阅读 · 2019年4月17日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

16+阅读 · 2018年12月24日

【NIPS2018】接收论文列表

【NIPS2018】接收论文列表

专知

5+阅读 · 2018年9月10日

人工智能 | 国际会议截稿信息9条

人工智能 | 国际会议截稿信息9条

Call4Papers

4+阅读 · 2018年3月13日

Generative Adversarial Text to Image Synthesis论文解读

Generative Adversarial Text to Image Synthesis论文解读

统计学习与视觉计算组

13+阅读 · 2017年6月9日

相关论文

Fast and Accurate Model Scaling

Arxiv

0+阅读 · 2021年3月11日

Acceleration Methods

Arxiv

0+阅读 · 2021年3月11日

Pruning and Quantization for Deep Neural Network Acceleration: A Survey

Arxiv

0+阅读 · 2021年3月11日

A Computed Tomography Vertebral Segmentation Dataset with Anatomical Variations and Multi-Vendor Scanner Data

Arxiv

0+阅读 · 2021年3月10日

Faster Binary Embeddings for Preserving Euclidean Distances

Arxiv

0+阅读 · 2021年3月10日

$FM^2$: Field-matrixed Factorization Machines for Recommender Systems

Arxiv

16+阅读 · 2021年2月20日

Neural Architecture Generator Optimization

Arxiv

6+阅读 · 2020年10月8日

Horizontal Pyramid Matching for Person Re-identification

Arxiv

3+阅读 · 2018年4月30日

Cross-Domain Image Matching with Deep Feature Maps

Arxiv

13+阅读 · 2018年4月6日

Practical sketching algorithms for low-rank matrix approximation

Arxiv

4+阅读 · 2018年1月2日

微信扫码咨询专知VIP会员