University of Twente Student Theses
Solving an order acceptance sequential decision-making problem with Q-learning
Calvino Sobrido, Raul (2024) Solving an order acceptance sequential decision-making problem with Q-learning.
This is the latest version of this item.
PDF
1MB |
Abstract: | On this thesis we explored how can a develops a sequential decision-making problem for a fast-moving consumer good delivery company be solved using Reinforcement Learning methods. We will implement an offline tabular Q-learning algorithm that learns the optimal policy based on a specific state space combination and a point in time of the day. Additionally, we present a simulation environment for the Q-learning algorithm to learn the policy and compare the performance of the Q-learning agent with a company derived policy. With this information, we present a series of recommendations to the company on what conclusions can be made from the policy derived by the Q-learning algorithm. |
Item Type: | Essay (Bachelor) |
Faculty: | BMS: Behavioural, Management and Social Sciences |
Subject: | 50 technical science in general, 54 computer science, 85 business administration, organizational science |
Programme: | Industrial Engineering and Management BSc (56994) |
Link to this item: | https://purl.utwente.nl/essays/102140 |
Export this item as: | BibTeX EndNote HTML Citation Reference Manager |
Repository Staff Only: item control page