This paper explores the enhancement of emotion recognition under challenging real-world conditions where traditional models often falter. It investigates the impact of factors such as occlusions, lighting variations, poses, expression intensities, and image quality on the performance of deep learning models, utilising the BAUM-1 dataset to simulate these real-world scenarios. Modifications and filters were applied to the dataset, including various occlusions like sunglasses and blurred rectangles, as well as changes in illumination and image quality. A Convolutional Neural Network (CNN) was specifically adapted to address these real-world challenges. The model underwent training and testing across a spectrum of conditions, revealing variable accuracy levels in response to the different challenges, particularly noting a significant impact from occlusions. Despite this, the model showed a notable resilience against certain variations in illumination and occlusions. |
*** Title, author list and abstract as submitted during Camera-Ready version delivery. Small changes that may have occurred during processing by Springer may not appear in this window.