The data warehousing and OLAP technologies are now moving onto handling complex data that mostly originate from the Web. However, intagrating such data into a decision-support process requires their representation under a form processable by OLAP and/or data mining techniques. We present in this paper a complex data warehousing methodology that exploits XML as a pivot language. Our approach includes the integration of complex data in an ODS, under the form of XML documents; their dimensional modeling and storage in an XML data warehouse; and their analysis with combined OLAP and data mining techniques. We also address the crucial issue of performance in XML warehouses.
翻译:目前,数据仓储和OLAP技术正在转向处理主要来自网络的复杂数据,然而,将这些数据拖入决策支持进程,要求以可采用OLAP和(或)数据挖掘技术处理的形式代表这些数据。我们在本文件中提出了一个复杂的数据仓储方法,利用XML作为主言。我们的方法包括将复杂的数据纳入正式文件系统,采用XML文件的形式;在XML数据仓库中进行尺寸建模和储存;结合OLAP和数据挖掘技术进行分析。我们还处理XML仓库中业绩的关键问题。