Data Warehouse Hybrid Modeling Methodology

Authors

  • Viktor László Takács Institute of Applied Informatics and Logistics, Faculty of Economics and Business, University of Debrecen, Debrecen https://orcid.org/0000-0001-8433-6115
  • Katalin Bubnó School of Mathematical and Computational Sciences, University of Debrecen, Debrecen https://orcid.org/0000-0002-5445-2374
  • Gergely Gábor Ráthonyi Institute of Applied Informatics and Logistics, Faculty of Economics and Business, University of Debrecen, Debrecen https://orcid.org/0000-0002-1184-1660
  • Éva Bácsné Bába Institute of Rural Development, Tourism and Sports Management, Faculty of Economics and Business, University of Debrecen, Debrecen
  • Róbert Szilágyi Institute of Applied Informatics and Logistics, Faculty of Economics and Business, University of Debrecen, Debrecen https://orcid.org/0000-0002-1783-6483

DOI:

https://doi.org/10.5334/dsj-2020-038

Keywords:

Formal methods, Data science, Information Management

Abstract

The classic conceptual modeling around business processes followed by the ‘bus matrix’ methodology of designing the data cubes of data warehouses (Kimball & Ross 2013). For a serious system, such a quantity of management questions and dimensions, the bus matrix results a difficult-to-understand conceptual data model. The subject of automation and conceptual design – to which many individual methods already have been developed – are relevant topics in today’s literature also.

In the 2010s data warehouse projects were realized in Hungarian higher education to inform the decision makers of the universities about their own institutions. As we participated in this project in 2009–2010, we faced that our bus matrix at the end contained about 80–120 indicators with nearly 200 dimensions (dimensional attributes), therefore we worked on the early stenography to formalize the management question.

We provide a kind of ‘business intelligence problem solving thinking’ and a kind of descriptive language that can serve it and present a method which has two novelties compared to formers:

  1. It is based on the management questions and its visualization.
  2. As a kind of stenography, it is always based on the terminology corresponding to the current problem, so it forms an intermediate language for the data model.

We introduce our method through an example in a popular research area which is activity tracking.

Downloads

Published

2020-10-13

Issue

Section

Research Papers