Transforming Statistical Linked Data for Use in OLAP Systems
Published: 2011 September
Buchtitel: I-SEMANTICS 2011, 7th Int. Conf. on Semantic Systems
The amount of available Linked Data on the Web is increasing, and data providers start to publish statistical datasets that comprise numerical data. Such statistical datasets differ significantly from the currently predominant network-style data published on the Web. We explore the possibility of integrating statistical data from multiple Linked Data sources. We provide a mapping from statistical Linked Data into the Multidimensional Model used in data warehouses. We use an extract-transform-load (ETL) pipeline to convert statistical Linked Data into a format suitable for loading into an open-source OLAP system, and thus demonstrate how standard OLAP infrastructure can be used for elaborate querying and visualisation of integrated statistical Linked Data. We discuss lessons learned from three experiments and identify areas which require future work to ultimately arrive at a well-interlinked set of statistical data from multiple sources which is processable with standard OLAP systems.
Download: Media:Kaempgen harth isem11 olap.pdf
Presentation given at conference: Media:Kaempgen_harth_isem11_presentation_final.pdf