Human Emotion Recognition Utilizing Transfer Learning

Authors

DOI:

https://doi.org/10.64296/vijir.v1i2.05

Keywords:

Facial emotion recognition, Image classification, Ensemble Learning

Abstract

Facial Emotion Recognition (FER) is a crucial area of research in the field of Computer Vision (CV) and Deep Learning (DL), aiming to identify and classify emotions such as happiness, sadness, anger, and surprise from facial expressions. This study focuses on developing a model for recognizing facial emotions using DL techniques along with Transfer Learning (TL). For this work, we have used CKPLUS dataset. We have used two pre-trained State-of-the-art (SOTA) Convolutional Neural Network (CNN) models, namely, InceptionV3 and MobileNet, to transfer their learning in the task of feature extraction from our image data. For the classification task, we have used extracted features from these models both individually and fusing them together. For classifying these sets of features, we have used multiple classifiers, namely, Logistic Regression, a custom 1-dimensional (1D) CNN, and hard voting classifier that ensembles the decisions of Linear Support Vector Machine (LSVM), Logistic Regression, perceptron, quadratically penalized SVM, and SVM with quadratically smooth loss. Moreover we have utilized a soft voting classifier that ensembles the decision of LSVM, and SVM with quadratically smooth loss. The fused features extracted from InceptionV3 and MobileNet achieve the highest performance when classified using the voting classifier, with a classification accuracy of 98.48%, a weighted precision of 99.53%, a recall of 98.57%, and an F1-score of 98.48%. This work indicates the ensemble learning technique outperforms individual models across the dataset, identifying and classifying human emotion from an image of facial expression more accurately.

References

[1] P. Ekman and W. V. Friesen, “Constants across cultures in the face and emotion,” J. Pers. Soc. Psychol., vol. 17, no. 2, pp. 124–129, 1971.

[2] G. Zhao and M. Pietikäinen, “Dynamic texture recognition using local binary patterns with an application to facial expressions,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 29, no. 6, pp. 915–928, 2007.

[3] A. Mollahosseini, B. Hasani, and M. H. Mahoor, “AffectNet: A database for facial expression, valence, and arousal computing in the wild,” IEEE Trans. Affect. Comput., vol. 10, no. 1, pp. 18–31, 2019.

[4] S. Li and W. Deng, “Deep Facial Expression Recognition: A Survey,” IEEE Trans. Affect. Comput., vol. 13, no. 3, pp. 1195–1215, 2022.

[5] B. C. Ko, “A brief review of facial emotion recognition based on visual information,” Sensors (Basel), vol. 18, no. 2, 2018.

[6] D. Kollias and S. Zafeiriou, “Exploiting deep learning for facial expression recognition,” Image and Vision Computing, vol. 79, pp. 38–48, 2019.

[7] I. Kotsia and I. Pitas, “Facial expression recognition in image sequences using geometric deformation features and Support Vector Machines,” IEEE Trans. Image Process., vol. 16, no. 1, pp. 172–187, 2007.

[8] J. Kim and H. Kang, “Ensemble Learning for Emotion Recognition from Facial Expressions,” Expert Systems with Applications, vol. 184, 2021.

[9] H. Guo and X. Sun, “Improving Facial Emotion Recognition with Transfer Learning and Explainability Techniques,” Applied Intelligence, vol. 52, no. 3, pp. 1234–1245, 2022.

[10] J. Noor, M. Daud, R. Rashid, H. Mir, S. Nazir, and S. A. Velastin, “Facial expression recognition using hand-crafted features and supervised feature encoding,” in 2020 International Conference on Electrical, Communication, and Computer Engineering (ICECCE), 2020.

[11] J. Chen, D. Chen, Y. Gong, M. Yu, K. Zhang, and L. Wang, “Facial expression recognition using geometric and appearance features,” in Proceedings of the 4th International Conference on Internet Multimedia Computing and Service, 2012.

[12] O. Ezerceli and M. T. Eskil, “Convolutional neural network (CNN) algorithm based facial emotion recognition (FER) system for FER-2013 dataset,” in 2022 International Conference on Electrical, Computer, Communications and Mechatronics Engineering (ICECCME), 2022.

[13] M. Wafi, F. A. Bachtiar, and F. Utaminingrum, “Feature extraction comparison for facial expression recognition using adaptive extreme learning machine,” International Journal of Electrical & Computer Engineering, 2023.

[14] R. A. Borgalli and S. Surve, “Custom CNN Architecture for FER Using FER13, CK+, and JAFFE Datasets,” Journal of Physics: Conference Series, vol. 2236, no. 1, 2022.

[15] G. Subramanian, N. Cholendiran, K. Prathyusha, N. Balasubramanain, and J. Aravinth, “Multimodal emotion recognition using different fusion techniques,” in 2021 Seventh International Conference on Bio Signals, Images, and Instrumentation (ICBSII), 2021.

[16] Kaggle.com. [Online]. Available: [https://www.kaggle.com/datasets/shawon10/ckplus](https://www.kaggle.com/datasets/shawon10/ckplus) [Accessed: 15-Jan-2025].

Published

2026-06-14