University of Twente Student Theses

Login

Solving an order acceptance sequential decision-making problem with Q-learning

Calvino Sobrido, Raul (2024) Solving an order acceptance sequential decision-making problem with Q-learning.

This is the latest version of this item.

[img] PDF
1MB
Abstract:On this thesis we explored how can a develops a sequential decision-making problem for a fast-moving consumer good delivery company be solved using Reinforcement Learning methods. We will implement an offline tabular Q-learning algorithm that learns the optimal policy based on a specific state space combination and a point in time of the day. Additionally, we present a simulation environment for the Q-learning algorithm to learn the policy and compare the performance of the Q-learning agent with a company derived policy. With this information, we present a series of recommendations to the company on what conclusions can be made from the policy derived by the Q-learning algorithm.
Item Type:Essay (Bachelor)
Faculty:BMS: Behavioural, Management and Social Sciences
Subject:50 technical science in general, 54 computer science, 85 business administration, organizational science
Programme:Industrial Engineering and Management BSc (56994)
Link to this item:https://purl.utwente.nl/essays/102140
Export this item as:BibTeX
EndNote
HTML Citation
Reference Manager

 

Repository Staff Only: item control page