University of Twente Student Theses

Login
As of Friday, 8 August 2025, the current Student Theses repository is no longer available for thesis uploads. A new Student Theses repository will be available starting Friday, 15 August 2025.

From Prompt to Pwn : Browser-Empowered LLM Agents for Web Penetration Testing

Kalopisis, D. (2025) From Prompt to Pwn : Browser-Empowered LLM Agents for Web Penetration Testing.

[img] PDF
359kB
Abstract:Penetration testing simulates attacks on systems to uncover vulnerabilities before adversaries exploit them. It is labor-intensive, as testers must gather intelligence, plan exploits, interact with interfaces, and document actions precisely. Recent advances in language models (LLMs) have spurred efforts to automate parts of this workflow. Studies show LLM agents can generate commands, interpret tool output, and chain tasks, yet they lack programmatic control over a web browser. That omission limits their ability to test client-side logic critical in web applications. We explore whether adding browser access to LLM-driven agents closes that gap. We build two agents: a command-line baseline and a browser-enabled variant that can click, type, read the DOM, and monitor network traffic. Both are evaluated on twenty-seven Web Security labs covering nine common web vulnerabilities. The browser-enabled agent solves 66.7 percent of the labs, compared to 40.7 percent for the baseline, expands coverage from three to seven vulnerability classes, and shows the biggest gains on medium-difficulty tasks. These results confirm that giving agents control over a web browser is a significant advance for automated penetration testing. The next challenges include adding precise timing analysis and support for out-of-band interactions to achieve autonomy.
Item Type:Essay (Bachelor)
Faculty:EEMCS: Electrical Engineering, Mathematics and Computer Science
Subject:54 computer science
Programme:Computer Science BSc (56964)
Link to this item:https://purl.utwente.nl/essays/107568
Export this item as:BibTeX
EndNote
HTML Citation
Reference Manager

 

Repository Staff Only: item control page