Machine learning techniques for the analysis of affective components of sign language

Author(s): Deshpande, N.D. (2021)

Abstract:
In this thesis, several machine learning techniques were studied to reach a better accuracy for a facial expression recognition model trained on a sign language dataset. Various machine learning techniques such as fine-tuning, data augmentation, class balancing, as well as image preprocessing were used to reach a better accuracy for recognizing the 6 basic Ekman emotions of 'fear', 'disgust', 'surprise', 'sadness', 'happiness', 'anger' along with the 'neutral' class. The models were evaluated using K-fold cross-validation to get a more accurate conclusion. This thesis also presents a comparison of the above-mentioned techniques based on two different architectures, namely MobileNet and EfficientNet. It is experimentally demonstrated that fine-tuning a pre-trained model along with data augmentation by horizontally flipping images, and applying image normalization, helps in providing the best accuracy on the sign language dataset for both MobileNet and EfficientNet architectures.

Document(s):

Deshpande_MA_EEMCS.pdf