University of Twente Student Theses

Login

Predicting New Web Pages on the World Wide Web Using Topological Features

Sustronk, J.J. (2019) Predicting New Web Pages on the World Wide Web Using Topological Features.

[img] PDF
1MB
Abstract:Every day, numerous new web pages are created on the World Wide Web. These new web pages may contain information relevant to many people. However, finding these new pages is a non-trivial task. This research focuses on finding a function based on topological features that predicts where in the World Wide Web new pages can be found. Using various supervised Machine Learning techniques we show that we can predict with an accuracy and precision of over 90% where new pages can be found. The Random Forest algorithm obtained the highest performance.
Item Type:Essay (Bachelor)
Faculty:EEMCS: Electrical Engineering, Mathematics and Computer Science
Subject:31 mathematics, 54 computer science
Programme:Applied Mathematics BSc (56965)
Link to this item:https://purl.utwente.nl/essays/78964
Export this item as:BibTeX
EndNote
HTML Citation
Reference Manager

 

Repository Staff Only: item control page