University of Twente Student Theses
Memory and inference time considerations in a C translated QKeras model
Dotsika, Adamantia (2022) Memory and inference time considerations in a C translated QKeras model.
PDF
1MB |
Abstract: | With the growth of applications using neural networks, there is an increase in need for compact C models. The work of [1] presents interesting results on QKeras models and its impact on memory footprint. With that in mind this paper presents a modified version of the keras2c library to adapt to the needs of QKeras models. The modified library is used for studying the influence of data representation on memory and inference time in C-translated Qkeras models. The results show a memory reduction of 2.5x in case of the fixed-point representation with no loss in inference time. Even though the output of the inference could not be studied in accuracy, the study shows interesting and promising results that need further investigation. |
Item Type: | Essay (Bachelor) |
Faculty: | EEMCS: Electrical Engineering, Mathematics and Computer Science |
Subject: | 53 electrotechnology |
Programme: | Electrical Engineering BSc (56953) |
Link to this item: | https://purl.utwente.nl/essays/93645 |
Export this item as: | BibTeX EndNote HTML Citation Reference Manager |
Repository Staff Only: item control page