University of Twente Student Theses

Login

Fisher Information aware dynamic compression of Language Transformer networks using SVD

Hofstee, S.B.H.C. (2024) Fisher Information aware dynamic compression of Language Transformer networks using SVD.

[img] PDF
419kB
Abstract:We propose a novel approach to Singular Value Decomposition (SVD) for low-rank compression, addressing limitations in previous methods such as Fisher Weighted SVD (FWSVD) and True Fisher Weighted SVD (TFWSVD), which apply uniform compression ratios across layers without considering intra- and inter-layer information characteristics. Unlike FWSVD and TFWSVD, our approach dynamically determines layer-wise compression ratios based on these characteristics, enhancing task performance efficiency. Specifically, we explore three novel methods in which ranks are dynamically determined for low-rank compression based on the inter- or intra-layer Fisher Information (FI): (1) dynamically determining the rank for low-rank compression based on inter-layer Fisher Information (FI), (2) maintaining a fixed percentage of intra-layer FI and (3) optimizing to maximize total (or layer-wise) FI given a fixed overall compression ratio. These methods are evaluated on a transformer-based language model and benchmarked against the state-of-the-art. One of the proposed methods, relying on specifying a fixed percentage of FI to keep per layer, has been shown to outperform the current state of the art on average in excess of 5%, and very significantly for inference and similarity tasks. The work furthermore provides valuable insights for future work to further explore the dynamic compression of layers in transformer networks using FI, in particular by displaying the effectiveness of dynamic compression using intra-layer FI.
Item Type:Essay (Master)
Faculty:EEMCS: Electrical Engineering, Mathematics and Computer Science
Subject:54 computer science
Programme:Computer Science MSc (60300)
Link to this item:https://purl.utwente.nl/essays/101837
Export this item as:BibTeX
EndNote
HTML Citation
Reference Manager

 

Repository Staff Only: item control page