Fine-tuning transformer models for commit message generation and autocompletion
Miksik, M. (2023)
Commit messages provide insight into the developer’s intentions and motivations — a fundamental source of information in the exceptionally collaborative discipline of software development. To assist the process of commit message writing, we fine-tune two transformer models for commit message generation and integrate them into popular code editors. We 1) collect and publish a dataset of commit message and patch pairs for 6 different programming languages, 2) fine-tune two generative language models for commit message autocompletion and generation tasks, and 3) provide integration for these models with popular code editors (IntelliJ, VSCode). Lastly, we show that on the test dataset, our fine-tuned models perform 2x and 10x times better than the base models for the completion and generation tasks, respectively.
miksik_BA_eemc.pdf