University of Twente Student Theses

Login

The evaluation of the conditioning task regarding probabilistic databases

Nieuwenhuizen, Julian van den (2024) The evaluation of the conditioning task regarding probabilistic databases.

[img] PDF
454kB
Abstract:Probabilistic Databases aim to solve the problem of inconsistent or uncertain data collection and storage by using data representation on a probabilistic basis. By using this representation the database is able to store, retrieve and change uncertain data . Probabilistic data Integration (PDI) is a type of data integration that achieves this goal. The PDI process contains 2 phases , the integration of data where data quality problems are not immediately solved but instead are represented as uncertainty in the probabilistic database. Afterwards the data will be continuously improved by gathering evidence through for example user feedback and improving the data accordingly. This research will focus primarily on the second step of the PDI process. The second phase of the PDI process is called conditioning. This paper shows results regarding the scalability of conditioning as well as the scalability of optimizing the database after a conditioning cycle. Furthermore, it shows in what situations optimizing the conditioned database has a positive effect.
Item Type:Essay (Bachelor)
Faculty:EEMCS: Electrical Engineering, Mathematics and Computer Science
Subject:30 exact sciences in general, 54 computer science
Programme:Business & IT BSc (56066)
Link to this item:https://purl.utwente.nl/essays/100775
Export this item as:BibTeX
EndNote
HTML Citation
Reference Manager

 

Repository Staff Only: item control page