University of Twente Student Theses
The evaluation of the conditioning task regarding probabilistic databases
Nieuwenhuizen, Julian van den (2024) The evaluation of the conditioning task regarding probabilistic databases.
PDF
454kB |
Abstract: | Probabilistic Databases aim to solve the problem of inconsistent or uncertain data collection and storage by using data representation on a probabilistic basis. By using this representation the database is able to store, retrieve and change uncertain data . Probabilistic data Integration (PDI) is a type of data integration that achieves this goal. The PDI process contains 2 phases , the integration of data where data quality problems are not immediately solved but instead are represented as uncertainty in the probabilistic database. Afterwards the data will be continuously improved by gathering evidence through for example user feedback and improving the data accordingly. This research will focus primarily on the second step of the PDI process. The second phase of the PDI process is called conditioning. This paper shows results regarding the scalability of conditioning as well as the scalability of optimizing the database after a conditioning cycle. Furthermore, it shows in what situations optimizing the conditioned database has a positive effect. |
Item Type: | Essay (Bachelor) |
Faculty: | EEMCS: Electrical Engineering, Mathematics and Computer Science |
Subject: | 30 exact sciences in general, 54 computer science |
Programme: | Business & IT BSc (56066) |
Link to this item: | https://purl.utwente.nl/essays/100775 |
Export this item as: | BibTeX EndNote HTML Citation Reference Manager |
Repository Staff Only: item control page