EmoNet: Deep Learning-based Emotion Climate Recognition Using Peers' Conversational Speech, Affect Dynamics, and Physiological Data
Alhussein G., Alkhodari M., Saleem S., Roumeliotou E., Hadjileontiadis LJ.
Understanding the emotional dynamics within social interactions is crucial for meaningful interpretation. Despite progress in emotion recognition systems, recognizing the collective emotional climate among peers has been understudied. Addressing this gap, we propose EmoNet, an AI model transcending traditional emotion identification. EmoNet employs Mel-frequency cepstral coefficients and a Temporal Convolutional Network to extract deep features from speech signals. It uniquely integrates affect dynamics and physiological inputs (heart rate, electrodermal activity), providing a holistic view of emotion climates. Tested on K-EmoCon dataset, EmoNet excels in arousal and valence classification, achieving 87.82% and 83.79% accuracy, respectively. These results position EmoNet as a valuable tool for understanding and influencing emotion climates in real-world conversations, with applications in healthcare and human-computer interactions.