Performance evaluation is a key issue for designers and users of Database Management Systems (DBMSs). Performance is generally assessed with software benchmarks that help, e.g., test architectural choices, compare different technologies or tune a system. In the particular context of data warehousing and On-Line Analytical Processing (OLAP), although the Transaction Processing Performance Council (TPC) aims at issuing standard decision-support benchmarks, few benchmarks do actually exist. We present in this chapter the Data Warehouse Engineering Benchmark (DWEB), which allows generating various ad-hoc synthetic data warehouses and workloads. DWEB is fully parameterized to fulfill various data warehouse design needs. However, two levels of parameterization keep it relatively easy to tune. We also expand on our previous work on DWEB by presenting its new Extract, Transform, and Load (ETL) feature as well as its new execution protocol. A Java implementation of DWEB is freely available on-line, which can be interfaced with most existing relational DMBSs. To the best of our knowledge, DWEB is the only easily available, up-to-date benchmark for data warehouses.
翻译:对数据库管理系统(DBMS)的设计者和用户来说,业绩评估是一个关键问题。一般用软件基准评估业绩,这些软件基准有助于测试建筑选择、比较不同的技术或调控一个系统。在数据仓储和在线分析处理(OLAP)的特殊背景下,尽管交易处理业绩理事会(TPC)的目标是发布标准决定支持基准,但实际上几乎没有基准存在。我们在本章中提出了数据仓库工程基准(DWEB),它允许产生各种临时合成数据仓库和工作量。DWEB被充分参数化,以满足各种数据仓库的设计需要。然而,两个参数化等级使得它相对容易调和。我们还扩大了我们以前关于DWEB的工作,介绍了其新的抽取、变换和加载(ETL)特征及其新的执行协议。DWEB的Java实施在网上是免费的,可以与大多数现有的关系型数据库连接。据我们所知,DWEB是数据仓库唯一容易获得的最新基准。