Advances in information technology and its widespread growth in several areas of business, engineering, medical and scientific studies are resulting in information/data explosion. Knowledge discovery and decision making from such rapidly growing voluminous data is a challenging task in terms of data organization and processing, which is an emerging trend known as Big Data Computing; a new paradigm which combines large scale compute, new data intensive techniques and mathematical models to build data analytics. Big Data computing demands a huge storage and computing for data curation and processing that could be delivered from on-premise or clouds infrastructures. This paper discusses the evolution of Big Data computing, differences between traditional data warehousing and Big Data, taxonomy of Big Data computing and underpinning technologies, integrated platform of Big Data and Clouds known as Big Data Clouds, layered architecture and components of Big Data Cloud and finally discusses open technical challenges and future directions.
翻译:信息技术的进步及其在若干商业、工程、医学和科学研究领域的广泛增长正在导致信息/数据爆炸,从这种迅速增长的大量数据中发现知识和作出决策,在数据组织和处理方面是一项具有挑战性的任务,这是被称为大数据计算的新趋势;一种新的模式,结合大规模计算、新的数据密集型技术和数学模型来建立数据分析;大数据计算需要庞大的储存和计算,用于数据整理和处理,而数据整理和处理可以来自局部或云层基础设施;本文讨论了大数据计算的变化、传统数据仓储和大数据之间的差异、大数据计算和基础技术的分类学、称为大数据云的大型数据和云的综合平台、多层结构以及大数据云的组成部分,并最后讨论了开放的技术挑战和今后的方向。