University of Twente Student Theses
Machine driven predictions of the socio-economic status of Twitter users
Mentink, Fons (2016) Machine driven predictions of the socio-economic status of Twitter users.
PDF
2MB |
Abstract: | In this study, we examine the possibilities of creating predictions of the Socio-Economic Status of twitter users with the use of machine learning algorithms. The study starts with an ethical evaluation of the process and data collection. After the data is collected we explore the data and create features and labels. We then employ machine learning algorithms out of the box and combine the prediction probabilities via a soft-voting ensemble method. We find that the model outperforms dummies, but has troubles with the skewed data. We run additional tests to examine the performance of the classifiers when the tasks increase in probability, where we find the random forest to show the most promising results. |
Item Type: | Essay (Master) |
Faculty: | EEMCS: Electrical Engineering, Mathematics and Computer Science |
Subject: | 54 computer science |
Programme: | Business Information Technology MSc (60025) |
Link to this item: | https://purl.utwente.nl/essays/70144 |
Export this item as: | BibTeX EndNote HTML Citation Reference Manager |
Repository Staff Only: item control page