University of Twente Student Theses
The Evolution of Data Storage Architectures : Examining the Value of the Data Lakehouse
Janssen, N.E. (2022) The Evolution of Data Storage Architectures : Examining the Value of the Data Lakehouse.
PDF
6MB |
Abstract: | In today’s world, data has become an important resource a company can have. The desire to be data-driven has grown. However, choosing a suitable storage architecture to efficiently store, process, and manage data from numerous sources remains challenging. Currently, there are three storage architecture generations of which the newest (known as data lakehouse) was introduced in 2020. Given its novelty, limited research has been done into the rationale behind its introduction, strengths, and weaknesses. In order to fill this gap, this study answers the following research question: “What is the added value of the data lakehouse architecture in your data management platform?”. A systematic literature review and expert interviews were conducted to answer this question. As a result, this study presents two models 1) a data storage evolution model and 2) a fine-grained reference architecture of the data lakehouse. In essence, the value of the data lakehouse can be explained through 1) the combination of best practices from the data warehouse and data lake and 2) the introduction of a data management layer on the storage object. |
Item Type: | Essay (Master) |
Clients: | Avanade, Amsterdam, Netherlands |
Faculty: | EEMCS: Electrical Engineering, Mathematics and Computer Science |
Subject: | 54 computer science |
Programme: | Business Information Technology MSc (60025) |
Link to this item: | https://purl.utwente.nl/essays/92801 |
Export this item as: | BibTeX EndNote HTML Citation Reference Manager |
Repository Staff Only: item control page