Hospitals around the world collect massive amount of physiological data from their patients every day. Recently, there has been increasing research interest to subject this data into statistical analysis for gaining more insights and providing improved medical diagnoses. Enabling such advancements in healthcare require efficient data processing systems. In this paper, we show that currently available data processing solutions either fail to meet the performance requirements or lack simple and flexible programming interfaces. To address this problem, we propose LifeStream, a high performance stream processing engine for physiological data. LifeStream hits the sweet spot between ease of programming by providing a rich temporal query language support and performance by employing optimizations that exploit the constant frequency nature of physiological data. LifeStream demonstrates end-to-end performance up to $7.5\times$ higher than state-of-the-art streaming engines and $3.2\times$ than hand-optimized numerical libraries.
翻译:世界各地的医院每天从病人那里收集大量的生理数据。最近,人们越来越有兴趣将这些数据纳入统计分析,以获得更多的洞察力和更好的医学诊断。在保健领域实现这种进步需要高效的数据处理系统。在本文中,我们表明,目前现有的数据处理解决方案要么达不到性能要求,要么缺乏简单灵活的编程界面。为解决这一问题,我们提议了生命系统,这是一个高性能流处理生理数据的引擎。生命系统通过利用利用生理数据常年频率的优化手段提供丰富的时间查询语言支持和性能,在编程的方便度之间达到一个甜点。生命系统显示,其端到端的性能比最先进的流动引擎高出7.5美元,比手操作的数字图书馆高出3.2美元。