University of Twente Student Theses


The Evolution of Data Storage Architectures : Examining the Value of the Data Lakehouse

Janssen, N.E. (2022) The Evolution of Data Storage Architectures : Examining the Value of the Data Lakehouse.

[img] PDF
Abstract:In today’s world, data has become an important resource a company can have. The desire to be data-driven has grown. However, choosing a suitable storage architecture to efficiently store, process, and manage data from numerous sources remains challenging. Currently, there are three storage architecture generations of which the newest (known as data lakehouse) was introduced in 2020. Given its novelty, limited research has been done into the rationale behind its introduction, strengths, and weaknesses. In order to fill this gap, this study answers the following research question: “What is the added value of the data lakehouse architecture in your data management platform?”. A systematic literature review and expert interviews were conducted to answer this question. As a result, this study presents two models 1) a data storage evolution model and 2) a fine-grained reference architecture of the data lakehouse. In essence, the value of the data lakehouse can be explained through 1) the combination of best practices from the data warehouse and data lake and 2) the introduction of a data management layer on the storage object.
Item Type:Essay (Master)
Avanade, Amsterdam, Netherlands
Faculty:EEMCS: Electrical Engineering, Mathematics and Computer Science
Subject:54 computer science
Programme:Business Information Technology MSc (60025)
Link to this item:
Export this item as:BibTeX
HTML Citation
Reference Manager


Repository Staff Only: item control page