University of Twente Student Theses

Solving an order acceptance sequential decision-making problem with Q-learning

Calvino Sobrido, Raul (2024) Solving an order acceptance sequential decision-making problem with Q-learning.

This is the latest version of this item.

PDF
1MB

Abstract:	On this thesis we explored how can a develops a sequential decision-making problem for a fast-moving consumer good delivery company be solved using Reinforcement Learning methods. We will implement an offline tabular Q-learning algorithm that learns the optimal policy based on a specific state space combination and a point in time of the day. Additionally, we present a simulation environment for the Q-learning algorithm to learn the policy and compare the performance of the Q-learning agent with a company derived policy. With this information, we present a series of recommendations to the company on what conclusions can be made from the policy derived by the Q-learning algorithm.
Item Type:	Essay (Bachelor)
Faculty:	BMS: Behavioural, Management and Social Sciences
Subject:	50 technical science in general, 54 computer science, 85 business administration, organizational science
Programme:	Industrial Engineering and Management BSc (56994)
Link to this item:	https://purl.utwente.nl/essays/102140
Export this item as:	BibTeX EndNote HTML Citation Reference Manager

Daily downloads in the past month

Monthly downloads in the past 12 months

More statistics for this item...

Repository Staff Only: item control page